Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/presidio
Library/presidioForked

microsoft/presidio

presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

View on GitHub↗Upstream microsoft/presidio↗

Builder

Microsoft

Microsoft

microsoft • big-tech

Stars

8,366

Using upstream star count

Forks

1,079

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 4, 2018

Project creation date

README Summary

Presidio - Data Protection and De-identification SDK

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Computer Vision for PII DetectionData Privacy EngineeringInformation ExtractionMachine Learning Pipeline DesignNamed Entity RecognitionNatural Language ProcessingPattern MatchingPrivacy-Preserving MLText AnalyticsTransformer-based NER Models

Tags

Computer Vision for PII DetectionData Privacy EngineeringInformation ExtractionMachine Learning Pipeline DesignNamed Entity RecognitionNatural Language ProcessingPattern MatchingPrivacy-Preserving MLText AnalyticsTransformer-based NER ModelsDockerFinTechForkedHealthcare AIKubernetesOpen SourcePythonSecuritySparkTutorial

Taxonomy

AI Trends

AI SafetyPrivacy-Preserving AIResponsible AIData Governance

category

MLOps & InfrastructureDev Tools & AutomationLearning ResourcesIndustry: HealthcareIndustry: FinTechSecurity & Safety

Deployment Context

Self-hostedOn-premiseCloud APIServerless

Industries

HealthcareFinTechLegal TechGovernmentInsuranceHuman Resources

Modalities

TextImageTabular

Skill Areas

Named Entity RecognitionNatural Language ProcessingPattern MatchingData Privacy EngineeringText AnalyticsComputer Vision for PII DetectionMachine Learning Pipeline DesignPrivacy-Preserving MLInformation ExtractionTransformer-based NER Models

tag

DockerFinTechForkedHealthcare AIKubernetesOpen SourcePythonSecuritySparkTutorial

Use Cases

PII Detection in DocumentsData Anonymization for GDPR ComplianceSensitive Information RedactionPrivacy-Safe Data SharingLog File SanitizationDatabase AnonymizationDocument Privacy ScanningCompliance Automation

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

11

build(deps): bump azure/login from 2.3.0 to 3.0.0 (#1915)

dependabot[bot] • Mar 22, 2026

954d5f7

feat(analyzer): add German PII recognizers (DE_*) (#1909)

Michael van den Berg • Mar 19, 2026

98f79b9

Add clarity cookie consent to docs site (#1908)

Sharon Hart • Mar 16, 2026

2450561

Quality

production
Quality
high
Maturity
production

Categories

Healthcare & BiologyPrimaryFinance & LegalOther AI / MLMLOps & InfrastructureDev Tools & AutomationLearning ResourcesIndustry: HealthcareIndustry: FinTechSecurity & Safety

PM Skills

Scale & Reliability

Languages

Python100.0%

Timeline

Project created
May 4, 2018
Forked
Mar 22, 2026
Your last push
2 months ago
Upstream last push
17 days ago
Tracked since
Mar 22, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…