GitHub - AnthusAI/Plexus-Dashboard: Shadcn / Next.js / React / Amplify dashboard for Plexus.

Plexus Dashboard

This is a Next.js/Shadcn dashboard built on Amplify Gen2 for Plexus, with a Python API client code and a CLI tool.

Installation

Install the Plexus client Python module in development mode:

pip install -e .

Configuration

Set up your environment variables in a .env file:

AWS_ACCESS_KEY_ID=...
AWS_SECRET_ACCESS_KEY=...
AWS_REGION_NAME=... 
PLEXUS_API_URL=...
PLEXUS_API_KEY=...

Usage

Python API

The library provides a client that mirrors the GraphQL schema structure:

from plexus_dashboard.api.client import PlexusDashboardClient

# Initialize client with optional context
client = PlexusDashboardClient(
    context={
        'account_key': 'call-criteria',
        'scorecard_key': 'agent-performance'
    }
)

# Create a score result for an agent's call quality
client.ScoreResult.create(
    value=0.95,                    # 95% quality score
    item_id="call_20240309_123",   # Specific call being scored
    metadata={
        "duration_seconds": 342,
        "customer_sentiment": "positive",
        "topics": ["billing", "upgrade"]
    }
)

# Create a compliance score with immediate processing
client.ScoreResult.create(
    value=0.82,                    # 82% compliance score
    item_id="call_20240309_123",   # Same call, different score
    immediate=True,                # Process right away
    metadata={
        "regulations": ["pci", "hipaa"],
        "violations": []
    }
)

# Batch create multiple scores
client.ScoreResult.batch_create([
    {
        "value": 0.95,
        "item_id": "call_20240309_124",
        "metadata": {"agent_id": "agent_456"}
    },
    {
        "value": 0.88,
        "item_id": "call_20240309_125",
        "metadata": {"agent_id": "agent_457"}
    }
])

# Look up a scorecard by key
scorecard = client.Scorecard.get_by_key("agent-performance")

# Get an account by ID
account = client.Account.get_by_id("acc_123")

Command Line Interface

Use plexus-dashboard to interact with the API:

# Create a score result
plexus-dashboard score-result create \
  --value 0.95 \
  --item-id call_20240309_123 \
  --account-id acc-123 \
  --scoring-job-id job-123 \
  --scorecard-id card-123 \
  --confidence 0.87 \
  --metadata '{"duration_seconds": 342, "topics": ["billing"]}'

# Update a score result
plexus-dashboard score-result update abc123 \
  --value 0.98 \
  --metadata '{"reviewed": true}'

Working with Evaluations

The library provides background processing for Evaluation operations:

from plexus_dashboard.api.client import PlexusDashboardClient

client = PlexusDashboardClient(
    context={
        'account_key': 'call-criteria',
        'scorecard_key': 'agent-performance'
    }
)

# Create an Evaluation (non-blocking)
client.Evaluation.create(
    type="accuracy",
    accountId="acc-123",
    scorecardId="card-123",
    parameters={
        "model": "gpt-4",
        "threshold": 0.8
    }
)

# Get an existing Evaluation
Evaluation = client.Evaluation.get_by_id("exp-123")

# Update Evaluation status (non-blocking)
Evaluation.update(
    status="RUNNING",
    progress=0.45,
    processedItems=45,
    totalItems=100
)

# Add Evaluation results (non-blocking)
Evaluation.update(
    status="COMPLETED",
    accuracy=0.95,
    metrics={
        "precision": 0.94,
        "recall": 0.96,
        "f1": 0.95
    },
    confusionMatrix={
        "true_positive": 85,
        "false_positive": 5,
        "true_negative": 90,
        "false_negative": 4
    }
)

# Log Evaluation failure (non-blocking)
Evaluation.update(
    status="FAILED",
    errorMessage="Model API timeout",
    errorDetails={
        "timestamp": "2024-03-09T08:45:23Z",
        "request_id": "req-789",
        "error_code": "TIMEOUT"
    }
)

Simulating Evaluations

The CLI provides a simulation command for testing and demonstration purposes:

# Simulate an evaluation Evaluation
plexus-dashboard Evaluation simulate \
  --account-key call-criteria \
  --scorecard-key agent-scorecard \
  --num-items 100 \
  --accuracy 0.85

The simulate command:

Creates an Evaluation record
Generates synthetic binary classification results
Computes standard ML metrics in real-time
Updates the Evaluation with metrics as results are generated

This is useful for:

Testing the Evaluation tracking system
Demonstrating the metrics calculation pipeline
Generating sample data for UI development
Validating metric calculations

The simulation parameters:

num_items: Number of synthetic results to generate (default: 100)
accuracy: Target accuracy for synthetic data (default: 0.85)
Random delays between results (0.1-1.0 seconds)
Random confidence scores (0.7-0.99)

The simulation computes and tracks:

Overall accuracy
Precision
Sensitivity (Recall)
Specificity
Confusion matrix

Each result includes:

Binary prediction (Yes/No)
Confidence score
Correctness flag
Metadata with true/predicted values

Implementation Details

Background Processing

Configurable batch sizes (default: 10 items)
Configurable timeouts (default: 1 second)
Immediate processing option for urgent data
Automatic flushing on shutdown
Error resilient (errors are logged but don't affect main thread)

ID Resolution

Lazy resolution of IDs from keys/names
Caching of resolved IDs
Thread-safe operations
Graceful error handling

Thread Safety

All client operations are thread-safe and can be used in concurrent environments.

Background Processing

All mutations (create/update) in the Evaluation model are performed in background threads for non-blocking operation. This allows the main application to continue while Evaluation data is being saved.

Create operations spawn a new thread
Update operations spawn a new thread
Errors are logged but don't affect the main thread
No batching (Evaluations are processed individually)

Error Handling

Background operations handle errors gracefully:

# This won't block or raise errors in the main thread
Evaluation.update(
    status="RUNNING",
    progress=0.5
)

# Continue immediately while update happens in background
print("Continuing with other work...")

License

This library is licensed under the MIT-0 License. See the LICENSE file.

TypeScript Performance Troubleshooting

If TypeScript type-checking suddenly becomes very slow (minutes instead of seconds), it's likely due to type complexity explosion. Here are the patterns that cause slowdowns:

What Makes Type Checking Slow

Changing interface inheritance relationships
- Modifying which interfaces extend other interfaces
- Changing the inheritance chain
- Adding new type parameters to inherited interfaces
- Importing types from other components and using them in interfaces
Adding type assertions or complex type conversions
- Using as to convert between types
- Adding runtime type checks that affect type inference
- Complex conditional types
- Creating new type relationships through imports
Adding conditional type logic
- Making properties conditionally optional/required
- Using complex union or intersection types
- Nested conditional types
- Cross-component type dependencies

Key Metrics to Watch

When running tsc --noEmit --pretty --diagnostics, watch these metrics:

Types: Should stay under 100
Instantiations: Should stay at 0 if possible
Check time: Should stay under 1 second

If these metrics explode (Types > 1000, Instantiations > 0, Check time > 10s), you've likely introduced a type relationship that's causing combinatorial explosion.

Updating the schema

When you make changes to the Amplify schema, you will usually need to update the JSON outputs:

npx ampx generate outputs --app-id depfj4eia0tcf --branch main

TypeScript Type Checking

npx tsc --noEmit --pretty --diagnostics

Testing

Frontend Tests

Run Jest tests:

npm test

For development with watch mode:

npm run test:watch

For test coverage report:

npm run test:coverage

Component Tests

Run Storybook tests:

npm run test:storybook

Python Tests

Run pytest:

pytest plexus_dashboard/

Name		Name	Last commit message	Last commit date
Latest commit History 402 Commits
.storybook		.storybook
__tests__		__tests__
amplify		amplify
app		app
components		components
hooks		hooks
lib		lib
plexus_dashboard		plexus_dashboard
public		public
stories		stories
styles		styles
types		types
utils		utils
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Plexus-Dashboard.code-workspace		Plexus-Dashboard.code-workspace
README.md		README.md
amplify.yml		amplify.yml
components.json		components.json
jest.config.js		jest.config.js
jest.setup.ts		jest.setup.ts
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
setup.py		setup.py
setupTests.ts		setupTests.ts
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plexus Dashboard

Installation

Configuration

Usage

Python API

Command Line Interface

Working with Evaluations

Simulating Evaluations

Implementation Details

Background Processing

ID Resolution

Thread Safety

Background Processing

Error Handling

License

TypeScript Performance Troubleshooting

What Makes Type Checking Slow

Key Metrics to Watch

Updating the schema

TypeScript Type Checking

Testing

Frontend Tests

Component Tests

Python Tests

About

Releases

Packages

Languages

License

AnthusAI/Plexus-Dashboard

Folders and files

Latest commit

History

Repository files navigation

Plexus Dashboard

Installation

Configuration

Usage

Python API

Command Line Interface

Working with Evaluations

Simulating Evaluations

Implementation Details

Background Processing

ID Resolution

Thread Safety

Background Processing

Error Handling

License

TypeScript Performance Troubleshooting

What Makes Type Checking Slow

Key Metrics to Watch

Updating the schema

TypeScript Type Checking

Testing

Frontend Tests

Component Tests

Python Tests

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages