Inspect AI
Government-built open-source framework for rigorous LLM and agent evaluations, popular for safety benchmarks and sandboxed agentic tasks.
| Category | Evals & Testing |
| Open source | Yes |
| Self-hostable | Yes |
| Pricing model | free |
| Pricing notes | MIT OSS, free; no commercial tier |
| Framework integrations | openai-sdk, anthropic, ollama, huggingface |
| Funding / ownership | Built by the UK AI Security Institute (government) |
Pricing/feature source: https://github.com/UKGovernmentBEIS/inspect_ai