←Library/tesseract.js
Library/tesseract.jsForked

naptha/tesseract.js

tesseract.js

Pure Javascript OCR for more than 100 Languages πŸ“–πŸŽ‰πŸ–₯

Builder

naptha

naptha

naptha β€’ individual

Stars

37,980

Using upstream star count

Forks

2,364

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Jun 24, 2015

Project creation date

README Summary

Tesseract.js is a pure JavaScript port of the Tesseract OCR engine that runs in browsers and Node.js environments. It provides optical character recognition capabilities for over 100 languages without requiring any native dependencies or server-side processing. The library offers both simple and advanced APIs for extracting text from images with customizable recognition options.

AI Dev Skills

Unmapped

Optical Character RecognitionComputer VisionWebAssembly IntegrationImage PreprocessingMulti-language Text RecognitionBrowser-based Machine Learning

Tags

Optical Character RecognitionComputer VisionWebAssembly IntegrationImage PreprocessingMulti-language Text RecognitionBrowser-based Machine LearningDocument ManagementOn-device AIAccessibility Text ReadingForm Data ExtractionEducationImageReceipt ProcessingEdge/MobileDocument DigitizationBrowser-based MLLicense Plate RecognitionBrowser/WASMLegal TechE-commerceEdge ComputingTextHealthcareImage-to-Text ConversionPublishingClient-sideFinTechAutomated Data EntryJavaScript

Taxonomy

Recent Activity

Updated 1 months ago

7 Days

0

30 Days

0

90 Days

0

Quality

production
Quality
high
Maturity
production

Categories

Industry: FinTechPrimaryCoding & Dev ToolsHealthcare & BiologyFinance & LegalEdge & Mobile AIOther AI / MLComputer Vision

PM Skills

Developer Platform

Languages

JavaScript100.0%

Timeline

Project created
Jun 24, 2015
Forked
Mar 23, 2026
Your last push
1 months ago
Upstream last push
1 months ago
Tracked since
Feb 28, 2026

Similar Repos

pgvector cosine similarity Β· $0

Loading…