allenai/olmocr
Toolkit for linearizing PDFs for LLM datasets/training
Builder
Allen AI
allenai • ai-lab
Stars
17,360
Using upstream star count
Forks
1,392
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Sep 17, 2024
Project creation date
<div align="center"> <img width="350" alt="olmocr-2-full@2x" src="https://github.com/user-attachments/assets/24f1b596-4059-46f1-8130-5d72dcc0b02e" /> <hr/> </div> <p align="center"> <a href="https://github.com/allenai/OLMo/blob/main/LICENSE"> <img alt="GitHub License" src="https://img.shields.io/github/license/allenai/OLMo"> </a> <a href="https://github.com/allenai/olmocr/releases"> <img alt="GitHub release" src="https://img.shields.io/github/release/allenai/olmocr.svg"> </a>
Unmapped
category
Deployment Context
Modalities
Skill Areas
tag
Updated 2 months ago
7 Days
0
30 Days
0
90 Days
11
pgvector cosine similarity · $0
Loading…