Supported Models and Use Cases
A comprehensive evaluation framework that spans the entire AI development lifecycle

Machine Learning
- Data Drift
- Classification Rates
- Root Mean Square
- Precision & Recall
- Many More
Generative AI
- Hallucination Rates
- Data Security Controls
- Acceptable Use Policies
- Domain-specific Evals, inc. custom code
- Inference & hallucination count
- Pass & Fail rates for Toxicity, PII & Sensitive Data
- Tokens & Model cost





Agentic AI






See what Arthur can do for you.
