opendataloader-project/opendataloader-pdf
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
Builder
opendataloader-project
opendataloader-project • individual
Stars
21,807
Using upstream star count
Forks
2,039
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
May 13, 2025
Project creation date
<!-- AI-AGENT-SUMMARY name: opendataloader-pdf category: PDF data extraction, PDF accessibility automation license: Apache-2.0 solves: [PDF to structured data for RAG/LLM pipelines, automate PDF accessibility compliance — layout analysis + auto-tagging to Tagged PDF (first open-source end-to-end)] input: PDF files (digital, scanned, tagged) output: Markdown, JSON (with bounding boxes), HTML, Tagged PDF, PDF/UA (enterprise) sdk: Python, Node.js, Java requirements: Java 11+ pricing: open-source co
Unmapped
category
Deployment Context
Skill Areas
tag
Updated 2 months ago
7 Days
0
30 Days
0
90 Days
20
pgvector cosine similarity · $0
Loading…