Library/giskard-oss
Library/giskard-ossForked

Giskard-AI/giskard-oss

giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

Builder

Giskard-AI

Giskard-AI

Giskard-AI • individual

Stars

5,215

Using upstream star count

Forks

423

Using upstream fork count

Open Issues

0

Activity Score

0/100

57 commits in 30d

Created

Mar 6, 2022

Project creation date

README Summary

Giskard is an open-source evaluation and testing framework specifically designed for LLM (Large Language Model) agents and AI applications. It provides comprehensive testing capabilities to identify vulnerabilities, biases, and performance issues in machine learning models before deployment. The library offers automated test suites, custom test creation, and integration with popular ML frameworks to ensure AI system reliability and safety.

AI Dev Skills

Unmapped

Large Language Model EvaluationAI Model TestingBias DetectionRobustness TestingPerformance MonitoringAdversarial TestingModel ValidationQuality Assurance for AIAutomated Testing FrameworksAI Safety Assessment

Tags

Large Language Model EvaluationAI Model TestingBias DetectionRobustness TestingPerformance MonitoringAdversarial TestingModel ValidationQuality Assurance for AIAutomated Testing FrameworksAI Safety AssessmentTextDeveloper ToolsLegal TechBias Detection in ML ModelsTabularHealthcareMultimodalHR TechResponsible AISelf-hostedAI Model Vulnerability ScanningModel Performance MonitoringCloud APIOn-premiseLLM Safety TestingCI/CD IntegrationAI Agent EvaluationAI SafetyAI GovernanceAutomated Quality AssuranceLLM TestingProduction Model ValidationModel EvaluationFinTechInsuranceRegulatory Compliance TestingPython

Taxonomy

Recent Activity

Updated 24 days ago

7 Days

1

30 Days

57

90 Days

114

Quality

production
Quality
high
Maturity
production

Categories

MLOps & InfrastructurePrimaryDev Tools & AutomationIndustry: FinTechEvals & BenchmarkingObservability & MonitoringSafety & AlignmentHealthcare & BiologyFinance & LegalMultimodal AIOther AI / MLFoundation ModelsAI Agents

PM Skills

Scale & ReliabilityDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Mar 6, 2022
Forked
Mar 22, 2026
Your last push
24 days ago
Upstream last push
6 days ago
Tracked since
Mar 20, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…