AI Dev Skills
Tracking every LLM call, prompt, response, latency, cost and quality metric in production. Observability gives you full visibility into what your AI system is actually doing at runtime.
You can't improve what you can't measure. Cost overruns, quality regressions, and silent failures are completely invisible without observability. Every production AI incident investigation starts here.
Langfuse, Phoenix, and OpenLIT are the leading open source tools. OpenTelemetry is becoming the standard tracing protocol. The space has consolidated significantly in 2025.
Having 3+ observability repos signals a team that takes production AI seriously. They are monitoring costs, tracking prompt versions, and running LLM-as-judge evaluations on live traffic.
No repos in this skill area yet.