Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/PaddleOCR
Library/PaddleOCRForked

PaddlePaddle/PaddleOCR

PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

View on GitHub↗Upstream PaddlePaddle/PaddleOCR↗

Builder

PaddlePaddle

PaddlePaddle

PaddlePaddle • individual

Stars

79,000

Using upstream star count

Forks

10,521

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

May 8, 2020

Project creation date

README Summary

<div align="center"> <p> <img width="100%" src="./docs/images/Banner.png" alt="PaddleOCR Banner"> </p>

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Computer VisionConvolutional Neural NetworksDeep LearningDocument Layout AnalysisDocument ProcessingMobile AI DeploymentModel OptimizationMultilingual NLPOptical Character Recognition (OCR)Text Detection and Recognition

Tags

Computer VisionConvolutional Neural NetworksDeep LearningDocument Layout AnalysisDocument ProcessingMobile AI DeploymentModel OptimizationMultilingual NLPOptical Character Recognition (OCR)Text Detection and RecognitionAnthropic / ClaudeBenchmarkingC++ClaudeData ScienceDockerEmbeddingsEvalsForkedGPU / CUDAHuggingFaceLLM ServingLarge Language ModelsMCPMobileMultimodal AINode.jsNumPyONNXOllamaOpenAIPandasPythonReal-Time / StreamingResearch / PapersTensorRTTutorialvLLM

Taxonomy

AI Trends

Multimodal AIDocument AIEdge AIOn-device AIModel CompressionCross-lingual AI

category

Inference & ServingFoundation ModelsAI AgentsRAG & RetrievalEvals & BenchmarkingMLOps & InfrastructureDev Tools & AutomationLearning ResourcesData Science & Analytics

Deployment Context

Self-hostedCloud APIEdge/MobileOn-premiseDockerServerless

Industries

Document ManagementLegal TechFinTechHealthcareEducationPublishingGovernmentInsurance

Modalities

ImageTextMultimodal

Skill Areas

Optical Character Recognition (OCR)Computer VisionDeep LearningText Detection and RecognitionDocument ProcessingConvolutional Neural NetworksModel OptimizationMobile AI DeploymentMultilingual NLPDocument Layout Analysis

tag

Anthropic / ClaudeBenchmarkingC++ClaudeData ScienceDockerDocument ProcessingEmbeddingsEvalsForkedGPU / CUDAHuggingFaceLLM ServingLarge Language ModelsMCPMobileModel OptimizationMultimodal AINode.jsNumPyONNXOllamaOpenAIPandasPythonReal-Time / StreamingResearch / PapersTensorRTTutorialvLLM

Use Cases

Document DigitizationPDF Text ExtractionInvoice ProcessingForm RecognitionReceipt ScanningLicense Plate RecognitionBusiness Card ProcessingDocument Question AnsweringAutomated Data EntryContent Moderation

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

10

Remove support for .env for security reasons (#17809)

Lin Manhui • Mar 13, 2026

428bb33

Enhance skills (#17801)

Lin Manhui • Mar 13, 2026

b905b20

[Fix] Fix incorrect env var name and update for security (#17799)

Lin Manhui • Mar 12, 2026

fed730d

Quality

production
Quality
high
Maturity
production

Categories

Inference & ServingPrimaryRAG & RetrievalEvals & BenchmarkingMLOps & InfrastructureDev Tools & AutomationLearning ResourcesData Science & AnalyticsFoundation ModelsAI AgentsMultimodal AIEdge & Mobile AISearch & KnowledgeOther AI / ML

PM Skills

Cost & EfficiencyUser ExperienceScale & ReliabilityData & EvaluationProduct DiscoveryDeveloper Platform

Languages

Python100.0%

Timeline

Project created
May 8, 2020
Forked
Mar 16, 2026
Your last push
2 months ago
Upstream last push
16 days ago
Tracked since
Mar 16, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…