huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Builder
HuggingFace
huggingface • ai-lab
Stars
10,779
Using upstream star count
Forks
1,108
Using upstream fork count
Open Issues
0
Activity Score
0/100
0 commits in 30d
Created
Nov 1, 2019
Project creation date
<p align="center"> <br> <img src="https://huggingface.co/landing/assets/tokenizers/tokenizers-logo.png" width="600"/> <br> <p> <p align="center"> <img alt="Build" src="https://github.com/huggingface/tokenizers/workflows/Rust/badge.svg"> <a href="https://github.com/huggingface/tokenizers/blob/main/LICENSE"> <img alt="GitHub" src="https://img.shields.io/github/license/huggingface/tokenizers.svg?color=blue&cachedrop"> </a> <a href="https://pepy.tech/project/token
Unmapped
Deployment Context
Modalities
Skill Areas
Updated 2 months ago
7 Days
0
30 Days
0
90 Days
0
Fix multithreaded concurrency test to use shared tokenizer instance (#1950)
Shintaro Murakami • Feb 27, 2026
Bump minimatch from 3.1.2 to 3.1.3 in /bindings/node (#1955)
dependabot[bot] • Feb 25, 2026
Update to PyO3 0.28 to automatically disable GIL (#1948)
Nathan Goldbaum • Feb 25, 2026
pgvector cosine similarity · $0
Loading…