Accuracy ScorerRule-Based

MultiPatternAccuracyScorer

Evaluates prediction accuracy against multiple acceptable patterns or answers. Ideal for tasks where multiple correct answers exist, with regex support for flexible pattern matching.

Overview

Evaluates prediction accuracy against multiple acceptable patterns or answers. Ideal for tasks where multiple correct answers exist, with regex support for flexible pattern matching.

accuracyrule-basedbenchmarkpatternsregexflexible

Use Cases

  • Accuracy benchmarking and validation

How It Works

This scorer uses deterministic rule-based evaluation to validate outputs against specific criteria. It applies predefined rules and patterns to assess the response, providing consistent and reproducible results without requiring LLM inference.

Input Schema

ParameterTypeRequiredDescription
predictionstrYesGenerated output
ground_truthstrYesPrimary expected output
context.patternslist[str]NoAdditional acceptable patterns (regex supported)

Output Schema

FieldTypeDescription
scorefloat10.0 if any pattern matches, 0.0 otherwise
passedboolTrue if any pattern matched
reasoningstrPattern match details
metadatadictWhich patterns matched

Score Interpretation

Default threshold: 7/10

10Perfect MatchOutput exactly matches expected format/value
0No MatchOutput does not match expected format/value

Frequently Asked Questions

When should I use this scorer?

Use MultiPatternAccuracyScorer when you need to evaluate accuracy and rule-based aspects of your AI outputs. It's particularly useful for accuracy benchmarking and validation.

Why does this scorer need expected output?

This scorer compares the generated output against a known expected result to calculate accuracy metrics.

Can I customize the threshold?

Yes, the default threshold of 7 can be customized when configuring the scorer.

Quick Info

CategoryAccuracy
Evaluation TypeRule-Based
Requires Expected OutputYes
Default Threshold7/10

Ready to try MultiPatternAccuracyScorer?

Start evaluating your AI agents with Noveum.ai's comprehensive scorer library.

Explore More Scorers

Discover 68+ LLM-as-Judge scorers for comprehensive AI evaluation