Promptfoo vs Confident AI (DeepEval)

Side-by-side comparison from the Agent Observability Index: licensing, self-hosting, pricing model and integrations — no vendor copy, primary sources linked.

PromptfooConfident AI (DeepEval)
One-linerConfig-file-driven open-source CLI for prompt evals, regression testing and LLM red-teaming that runs in CI.Pytest-style open-source LLM evaluation framework (DeepEval) with a hosted platform for benchmarking, regression testing and red-teaming (DeepTeam).
CategoryEvals & TestingEvals & Testing
Open sourceYesYes
Self-hostableYesYes
Pricing modelfreemiumfreemium
Pricing notesCLI/library free (MIT); paid enterprise for red-teaming at scaleDeepEval framework free OSS (Apache-2.0); Confident AI cloud has free + paid tiers
Frameworksopenai-sdk, anthropic, langchain, ollama, vercel-aiopenai-sdk, langchain, llamaindex, anthropic
Funding / ownership$5M seed (a16z, 2024)YC W25; $2.2M seed

How to choose

Sources: Promptfoo · Confident AI (DeepEval)