Inspect AI

Government-built open-source framework for rigorous LLM and agent evaluations, popular for safety benchmarks and sandboxed agentic tasks.

Category	Evals & Testing
Open source	Yes
Self-hostable	Yes
Pricing model	free
Pricing notes	MIT OSS, free; no commercial tier
Framework integrations	openai-sdk, anthropic, ollama, huggingface
Funding / ownership	Built by the UK AI Security Institute (government)