Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/docling
Library/doclingForked

docling-project/docling

docling

Get your documents ready for gen AI

View on GitHub↗Upstream docling-project/docling↗

Builder

docling-project

docling-project

docling-project • individual

Stars

60,628

Using upstream star count

Forks

4,223

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jul 9, 2024

Project creation date

README Summary

<p align="center"> <a href="https://github.com/docling-project/docling"> <img loading="lazy" alt="Docling" src="https://github.com/docling-project/docling/raw/main/docs/assets/docling_processing.png" width="100%"/> </a> </p>

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Computer Vision for Document AnalysisDocument Layout AnalysisDocument ProcessingImage-to-Text ConversionMulti-modal Content ExtractionOptical Character Recognition (OCR)PDF ProcessingRetrieval-Augmented Generation (RAG) PreprocessingTable Structure Recognition

Tags

Computer Vision for Document AnalysisDocument Layout AnalysisDocument ProcessingImage-to-Text ConversionMulti-modal Content ExtractionOptical Character Recognition (OCR)PDF ProcessingRetrieval-Augmented Generation (RAG) PreprocessingTable Structure RecognitionAI AgentsCrewAIFinTechForkedHaystackHuggingFaceLangChainLlamaIndexMCPOpen SourcePydanticPythonResearch / PapersSpeech to TextStructured OutputTutorial

Taxonomy

AI Trends

Retrieval-Augmented Generation (RAG)Compound AI SystemsDocument AIMultimodal AI

category

AI AgentsFoundation ModelsRAG & RetrievalGenerative MediaLearning ResourcesIndustry: FinTech

Deployment Context

Self-hostedCloud APIOn-premiseDocker Containers

Industries

Legal TechFinancial ServicesHealthcareEnterprise Document ManagementResearch and AcademiaCompliance and Audit

Modalities

TextImageMultimodalTabular

Skill Areas

Document ProcessingOptical Character Recognition (OCR)Computer Vision for Document AnalysisMulti-modal Content ExtractionRetrieval-Augmented Generation (RAG) PreprocessingDocument Layout AnalysisTable Structure RecognitionPDF ProcessingImage-to-Text Conversion

tag

AI AgentsCrewAIDocument ProcessingFinTechForkedHaystackHuggingFaceLangChainLlamaIndexMCPOpen SourcePydanticPythonResearch / PapersSpeech to TextStructured OutputTutorial

Use Cases

Document Question AnsweringRAG System Data PreparationEnterprise Knowledge Base CreationDocument Digitization and ArchivalAutomated Document Processing PipelinesMulti-format Document IngestionTable Data ExtractionDocument Content Search and Indexing

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

20

chore: bump version to 2.81.0 [skip ci]

github-actions[bot] • Mar 20, 2026

4e650af

fix(docx): Missing list items after numbered header (#2665) (#2678)

Emre Çalışır • Mar 20, 2026

2f7c09e

feat: route plain-text and Quarto/R Markdown files to the Markdown backend (#3161)

Peter W. J. Staar • Mar 20, 2026

96d7c7e

Quality

beta
Quality
high
Maturity
beta

Categories

RAG & RetrievalPrimaryLearning ResourcesIndustry: FinTechFoundation ModelsAI AgentsGenerative MediaFinance & LegalSearch & KnowledgeOther AI / ML

PM Skills

User ExperienceProduct DiscoveryDeveloper PlatformAI-Native Architecture

Languages

Python100.0%

Timeline

Project created
Jul 9, 2024
Forked
Mar 22, 2026
Your last push
2 months ago
Upstream last push
15 days ago
Tracked since
Mar 20, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…