datalab-to/surya
surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Builder

datalab-to
datalab-to • individual
Stars
19,543
Using upstream star count
Forks
1,340
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Jan 10, 2024
Project creation date
README Summary
Surya is a comprehensive OCR toolkit that provides text recognition, layout analysis, reading order detection, and table recognition capabilities across 90+ languages. The tool is designed to handle complex document structures and multilingual content with high accuracy. It offers both command-line interfaces and Python APIs for easy integration into various workflows.
AI Dev Skills
Unmapped
Tags
Taxonomy
Deployment Context
Modalities
Skill Areas
Recent Activity
Updated 1 months ago
7 Days
0
30 Days
0
90 Days
0
Quality
beta- Quality
- high
- Maturity
- beta
Categories
PM Skills
Languages
Timeline
- Project created
- Jan 10, 2024
- Forked
- Mar 22, 2026
- Your last push
- 1 months ago
- Upstream last push
- 10 days ago
- Tracked since
- Mar 1, 2026
Similar Repos
pgvector cosine similarity · $0
Loading…