Home / Best lists
Best self-hostable LLM observability tools
Quick answer
Top pick by our maturity signal: Grafana Cloud AI Observability. Below are all 30 self-hostable + in the observability, debugging or cost categories tools we track, ranked by the same objective GitHub-derived score. Maturity measures adoption and upkeep, not subjective quality — pick by your own constraints.
LLM observability and tracing tools you can run inside your own infrastructure, ranked by our public GitHub maturity signal. Ranking method is public — see methodology. Note: maturity reflects total GitHub adoption, so large general-purpose platforms (e.g. Grafana, Sentry, PostHog) can rank high on the strength of their parent project even where their LLM-specific features are newer — read the flags and pick by your constraints. Listings are free and editorially independent; sponsorship never changes facts or ranking.
| # | Tool | Maturity | Pricing | Flags |
|---|---|---|---|---|
| 1 | Grafana Cloud AI Observability | 100/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 2 | MLflow (Tracing & GenAI) | 100/100 (Mature) | free | OSS, self-host, OTel-native |
| 3 | Opik (Comet) | 100/100 (Mature) | freemium | OSS, self-host |
| 4 | Portkey | 100/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 5 | TensorZero | 100/100 (Mature) | free | OSS, self-host, OTel-native |
| 6 | Lago | 100/100 (Mature) | freemium | OSS, self-host |
| 7 | Traceloop (OpenLLMetry) | 98/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 8 | Helicone | 97/100 (Mature) | freemium | OSS, self-host |
| 9 | Pydantic Logfire | 95/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 10 | Laminar | 93/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 11 | OpenLIT | 92/100 (Mature) | free | OSS, self-host, OTel-native |
| 12 | LiteLLM | 90/100 (Mature) | freemium | OSS, self-host |
| 13 | Sentry AI Agent Monitoring | 90/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 14 | PostHog LLM Analytics | 90/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 15 | Langfuse | 90/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 16 | SigNoz | 90/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 17 | Arize Phoenix | 90/100 (Mature) | free | OSS, self-host, OTel-native |
| 18 | OpenMeter | 90/100 (Mature) | freemium | OSS, self-host |
| 19 | W&B Weave | 87/100 (Mature) | freemium | OSS, self-host, OTel-native |
| 20 | Tokencost | 61/100 (Established) | free | OSS, self-host |
| 21 | Invariant Labs | 60/100 (Established) | freemium | OSS, self-host |
| 22 | Langtrace | 58/100 (Growing) | freemium | OSS, self-host, OTel-native |
| 23 | WhyLabs (whylogs/LangKit) | 57/100 (Growing) | free | OSS, self-host |
| 24 | Phospho | 52/100 (Growing) | freemium | OSS, self-host |
| 25 | Arize AX | — | freemium | self-host, OTel-native |
| 26 | Fiddler AI | — | enterprise | self-host, OTel-native |
| 27 | HoneyHive | — | freemium | self-host, OTel-native |
| 28 | LangSmith | — | freemium | self-host, OTel-native |
| 29 | Lunary | — | freemium | OSS, self-host, OTel-native |
| 30 | TrueFoundry | — | freemium | self-host, OTel-native |
Frequently asked questions
What is the best self-hostable LLM observability tools?
By our public maturity signal (GitHub stars + recency + license), Grafana Cloud AI Observability ranks highest among the 30 self-hostable + in the observability, debugging or cost categories tools we track. Maturity reflects adoption and upkeep, not subjective quality.
How is this ranking decided?
Tools are ranked by a reproducible maturity score computed only from public GitHub signals (log of stars + last-commit recency + license). The formula is published on our methodology page; ranking is never sold.