Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/promptfoo
Library/promptfooForked

promptfoo/promptfoo

promptfoo

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

View on GitHub↗Upstream promptfoo/promptfoo↗

Builder

promptfoo

promptfoo

promptfoo • individual

Stars

21,716

Using upstream star count

Forks

1,914

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Apr 28, 2023

Project creation date

README Summary

<p align="center"> <a href="https://npmjs.com/package/promptfoo"><img src="https://img.shields.io/npm/v/promptfoo" alt="npm"></a> <a href="https://npmjs.com/package/promptfoo"><img src="https://img.shields.io/npm/dm/promptfoo" alt="npm"></a> <a href="https://github.com/promptfoo/promptfoo/actions/workflows/main.yml"><img src="https://img.shields.io/github/actions/workflow/status/promptfoo/promptfoo/main.yml" alt="GitHub Workflow Status"></a> <a href="https://github.com/promptfoo/promptfo

Community Evaluation

Loading…

AI Dev Skills

Unmapped

AI Red TeamingAI Security TestingAI System TestingAI Vulnerability AssessmentLanguage Model EvaluationMulti-model ComparisonPrompt EngineeringRetrieval-Augmented Generation

Tags

AI Red TeamingAI Security TestingAI System TestingAI Vulnerability AssessmentLanguage Model EvaluationMulti-model ComparisonPrompt EngineeringRetrieval-Augmented GenerationAWS BedrockAnthropic / ClaudeCLI ToolCachingClaudeEvalsForkedLarge Language ModelsNode.jsOllamaOpen SourceOpenAIPromptFooRed TeamingSecurityTutorial

Taxonomy

AI Trends

Agentic AIAI SafetyCompound AI SystemsRed TeamingAI Evaluation

category

Foundation ModelsEvals & BenchmarkingInference & ServingDev Tools & AutomationLearning ResourcesSecurity & Safety

Deployment Context

Command LineCI/CD PipelineSelf-hosted

Industries

Developer Tools

Modalities

Text

Skill Areas

Prompt EngineeringRetrieval-Augmented GenerationAI Red TeamingAI Security TestingLanguage Model EvaluationAI Vulnerability AssessmentMulti-model ComparisonAI System Testing

tag

AWS BedrockAnthropic / ClaudeCLI ToolCachingClaudeEvalsForkedLarge Language ModelsNode.jsOllamaOpen SourceOpenAIPromptFooRed TeamingSecurityTutorial

Use Cases

AI Prompt TestingLanguage Model BenchmarkingAI Security AssessmentRAG System EvaluationAI Agent TestingPrompt OptimizationAI Penetration TestingMulti-LLM Performance Comparison

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

20

docs(site): refresh red team report screenshots (#8125)

Ian Webster • Mar 13, 2026

6877ba3

fix(redteam): fix Rules of Hooks violation in RiskCategoryDrawer (#8072)

Evan Bonsignori • Mar 12, 2026

afb2732

feat(app): add manual filtering to DataTable and improve filter UX (#8122)

Faizan Minhas • Mar 12, 2026

46dfdf5

Quality

production
Quality
high
Maturity
production

Categories

Evals & BenchmarkingPrimaryInference & ServingDev Tools & AutomationLearning ResourcesSecurity & SafetyFoundation ModelsOther AI / ML

PM Skills

Cost & EfficiencySafety & AlignmentData & EvaluationDeveloper Platform

Languages

TypeScript100.0%

Timeline

Project created
Apr 28, 2023
Forked
Mar 13, 2026
Your last push
2 months ago
Upstream last push
15 days ago
Tracked since
Mar 13, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…