NovaPilot - Intelligent Analysis Orchestrator
Automated AI agent and conversational dataset analysis with specialized AI agents and comprehensive reporting
NovaPilot is Noveum's intelligent orchestrator for automated analysis of AI agent and conversational datasets. It uses specialized AI agents to identify issues, validate reasoning, and generate actionable insights from evaluation scores.
What is NovaPilot?
NovaPilot is an advanced analysis engine that automatically examines evaluation results from NovaEval to identify patterns, issues, and optimization opportunities. It acts as an AI analyst that understands your agent's behavior and provides detailed recommendations for improvement.
🤖 Specialized AI Agents
Four specialized agents analyze different aspects: flow logic, prompts, tools, and general patterns
🎯 Automatic Detection
Automatically detects dataset type (agent vs conversational) and applies appropriate analysis strategies
📊 Streaming Statistics
Memory-efficient analysis of large datasets using streaming algorithms
✅ Reasoning Validation
Validates and extracts meaningful insights from evaluation reasoning
Key Features
Specialized Analysis Agents
NovaPilot employs four specialized AI agents, each focusing on different aspects of your AI system:
1. Flow Analyzer Agent
- Analyzes agent execution flow and decision-making patterns
- Identifies issues with state transitions and control flow
- Detects loops, dead ends, and inefficient paths
2. Prompt Analyzer Agent
- Examines prompt quality and effectiveness
- Identifies ambiguous or problematic prompts
- Suggests prompt improvements for better performance
3. Tool Analyzer Agent
- Evaluates tool usage patterns and effectiveness
- Identifies tool selection issues and misuse patterns
- Recommends tool configuration improvements
4. General Analyzer Agent
- Performs comprehensive cross-cutting analysis
- Identifies systemic issues and patterns
- Provides holistic recommendations
Automatic Dataset Type Detection
Batch Processing with Parallel Execution
NovaPilot efficiently processes large datasets using intelligent batching and parallel execution:
Comprehensive Reporting
NovaPilot generates detailed JSON reports with:
- Pre-analysis Statistics: Score distributions, pass/fail rates, statistical summaries per scorer
- Bad Score Identification: Automatically filters and prioritizes low-scoring items
- Agent Analysis: Detailed insights from each specialized agent
- Reasoning Validation: Extracted and validated reasoning from evaluation scores
- Actionable Recommendations: Concrete steps to improve your AI system
Analysis Workflow
NovaPilot follows a systematic four-stage analysis process:
1. Load and Pre-analyze
2. Filter and Validate
3. Analyze with Agents
4. Generate Report
Configuration Options
Score Thresholds
Batch Sizes
Custom Model Configuration
Dataset Format Support
Agent Datasets
NovaPilot automatically detects agent datasets with this structure:
Conversational Datasets
For conversational datasets:
Error Handling
NovaPilot provides robust error handling with custom exceptions:
Example Reports
NovaPilot generates comprehensive JSON reports saved to your output directory:
Report Structure
Integration with NovaEval
NovaPilot works seamlessly with NovaEval evaluation results:
Best Practices
1. Start with Pre-analysis
Always enable pre-analysis to understand your dataset before deep analysis:
2. Use Appropriate Thresholds
Adjust thresholds based on your quality requirements:
3. Optimize Batch Sizes
Tune batch sizes based on your dataset size and available memory:
4. Organize Reports
Use descriptive output directories:
Support
- Integration: Works with NovaEval evaluation results
- Tracing: Requires data from noveum-trace
- Platform: https://noveum.ai/
- Email: support@noveum.ai
Next Steps
- NovaEval Overview - Learn about evaluation scorers
- Getting Started - Set up your first project
- Integration Examples - See complete workflows
- Dashboard Guide - Visualize your results
Ready to get automated insights from your AI evaluations? Integrate NovaPilot with your NovaEval workflow!
Get Early Access to Noveum.ai Platform
Be the first one to get notified when we open Noveum Platform to more users. All users get access to Observability suite for free, early users get free eval jobs and premium support for the first year.