Evals & Testing

Google Stax

Experimental developer tool from Google Labs for LLM evaluation with human labeling and LLM-as-judge autoraters.

Website ↗
CategoryEvals & Testing
Open sourceNo
Self-hostableNo
Pricing modelfree
Pricing notesExperimental Google Labs tool (free as of launch)
Framework integrations
Funding / ownershipGoogle Labs / DeepMind experimental product (launched 2025-08)

Pricing/feature source: https://developers.googleblog.com/en/streamline-llm-evaluation-with-stax/