Library/trulens
Library/trulensForked

truera/trulens

trulens

Evaluation and Tracking for LLM Experiments and AI Agents

Builder

truera

truera

truera • individual

Stars

3,220

Using upstream star count

Forks

257

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Nov 2, 2020

Project creation date

README Summary

TruLens is an evaluation and tracking framework for Large Language Model (LLM) experiments and AI agents. It provides comprehensive tools for monitoring, evaluating, and improving LLM applications through feedback functions, performance tracking, and experiment management. The framework enables developers to assess the quality, safety, and effectiveness of their AI systems with built-in metrics and customizable evaluation criteria.

AI Dev Skills

Unmapped

Large Language Model EvaluationAI Agent MonitoringRetrieval-Augmented Generation AssessmentLLM Application TestingMachine Learning OperationsAI System ObservabilityNatural Language Processing EvaluationPrompt Engineering OptimizationAI Quality AssuranceLLM Performance Metrics

Tags

Large Language Model EvaluationAI Agent MonitoringRetrieval-Augmented Generation AssessmentLLM Application TestingMachine Learning OperationsAI System ObservabilityNatural Language Processing EvaluationPrompt Engineering OptimizationAI Quality AssuranceLLM Performance MetricsDeveloper ToolsCloud APIEnterprise AISelf-hostedAI/ML PlatformsResponsible AILLM Application EvaluationTextAI Application TestingRAG System AssessmentAI System DebuggingRetrieval-Augmented GenerationLLM Deployment ValidationOn-premiseModel Performance BenchmarkingAI ObservabilityAI SafetyAI Agent Performance TrackingAgentic AILLM Response Quality MonitoringAI EvaluationLLMOpsPython

Taxonomy

Recent Activity

Updated 23 days ago

7 Days

0

30 Days

0

90 Days

0

Quality

beta
Quality
high
Maturity
beta

Categories

RAG & RetrievalPrimaryEvals & BenchmarkingObservability & MonitoringInference & ServingNLP & TextML Platform & InfrastructureSafety & AlignmentOther AI / MLDev Tools & AutomationFoundation ModelsAI Agents

PM Skills

Scale & Reliability

Languages

Python100.0%

Timeline

Project created
Nov 2, 2020
Forked
Mar 22, 2026
Your last push
23 days ago
Upstream last push
7 days ago
Tracked since
Mar 21, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…