Library/suryaForked

datalab-to/surya

surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Builder

datalab-to

datalab-to

datalab-to • individual

Stars

19,543

Using upstream star count

Forks

1,340

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jan 10, 2024

Project creation date

README Summary

Surya is a comprehensive OCR toolkit that provides text recognition, layout analysis, reading order detection, and table recognition capabilities across 90+ languages. The tool is designed to handle complex document structures and multilingual content with high accuracy. It offers both command-line interfaces and Python APIs for easy integration into various workflows.

AI Dev Skills

Unmapped

Computer VisionOptical Character RecognitionDocument Layout AnalysisMultilingual Text ProcessingDeep Learning for VisionTransformer ArchitectureText Detection and RecognitionDocument Structure UnderstandingTable RecognitionReading Order Prediction

Tags

Computer VisionOptical Character RecognitionDocument Layout AnalysisMultilingual Text ProcessingDeep Learning for VisionTransformer ArchitectureText Detection and RecognitionDocument Structure UnderstandingTable RecognitionReading Order PredictionOn-device AILegal TechTextFinTechTable Data ExtractionMulti-language Document ProcessingGovernmentMultimodal ReasoningDocument Layout UnderstandingReceipt AnalysisCloud APIInsuranceLegal Document AnalysisSelf-hostedInvoice ProcessingHealthcarePDF Text ExtractionForm ProcessingEducationOn-premiseSmall Language ModelsAcademic Paper ProcessingPublishingDocument DigitizationImageMultimodalPythonCLI

Taxonomy

Recent Activity

Updated 1 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

beta
Quality
high
Maturity
beta

Categories

RAG & RetrievalPrimaryMultimodal AIEdge & Mobile AIOther AI / MLIndustry: FinTechHealthcare & BiologyFinance & LegalFoundation ModelsComputer Vision

PM Skills

Developer Platform

Languages

Python100.0%

Timeline

Project created
Jan 10, 2024
Forked
Mar 22, 2026
Your last push
1 months ago
Upstream last push
10 days ago
Tracked since
Mar 1, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…