Library/pdfplumber
Library/pdfplumberForked

jsvine/pdfplumber

pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Builder

jsvine

jsvine

jsvine • individual

Stars

10,035

Using upstream star count

Forks

875

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Aug 24, 2015

Project creation date

README Summary

pdfplumber is a Python library that provides detailed access to PDF document elements including characters, rectangles, lines, and other visual components. It enables precise text extraction and table parsing from PDF files with fine-grained control over the extraction process. The library offers both programmatic access to PDF structure and convenient methods for extracting formatted content.

AI Dev Skills

Unmapped

Document ProcessingNatural Language Processing PipelineData Extraction and ETLText MiningDocument Intelligence

Tags

Document ProcessingNatural Language Processing PipelineData Extraction and ETLText MiningDocument IntelligenceServerlessInsuranceSelf-hostedEducationFinTechResearch Paper ProcessingLegal TechOn-premiseInformation ExtractionDocument DigitizationFinancial Report ProcessingGovernmentText Mining from Scanned DocumentsDocument AIMultimodal Data ProcessingTabularTextCloud APIHealthcareLegal Document AnalysisDocument Data ExtractionTable Extraction from PDFsPublishingPython

Taxonomy

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

MLOps & InfrastructurePrimaryLearning ResourcesIndustry: FinTechRAG & RetrievalNLP & TextML Platform & InfrastructureHealthcare & BiologyFinance & LegalMultimodal AISearch & KnowledgeOther AI / MLFoundation Models

PM Skills

Scale & ReliabilityDeveloper Platform

Languages

Python100.0%

Timeline

Project created
Aug 24, 2015
Forked
Mar 16, 2026
Your last push
2 months ago
Upstream last push
2 months ago
Tracked since
Jan 28, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…