Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
English
Chinese
French
Spanish
German
Japanese
Korean
Portuguese
Italian
Russian
Hindi
Arabic
Thai
multilingual
Turkish
Vietnamese
Indonesian
Polish
Dutch
Romanian
Swedish
Ukrainian
Persian
Czech
Finnish
Bengali
Nepali
Danish
Greek
Hebrew
Malay
Tamil
Hungarian
Urdu
Bulgarian
Catalan
Telugu
Norwegian
Swahili
Marathi
Serbian
French
Gujarati
Slovak
Slovenian
Estonian
Burmese
Malayalam
Croatian
Tagalog
Lithuanian
Galician
Latvian
Khmer
Kannada
Basque
Icelandic
Panjabi
Amharic
Afrikaans
Lao
Kazakh
Georgian
Mongolian
Hausa
Assamese
Armenian
Welsh
Sinhala
Belarusian
Macedonian
Azerbaijani
Yoruba
Javanese
Uzbek
Irish
Sundanese
Albanian
Latin
Bosnian
English
Sanskrit
Somali
Maltese
Sindhi
Oriya
Pashto
Malagasy
code
Xhosa
+ 4792 languages
Apply filters
Models
250
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
bpe
Clear all
espnet/xeus
Automatic Speech Recognition
•
Updated
Jun 17, 2025
•
36
•
145
almaghrabima/SARFTokenizer
Text Generation
•
Updated
about 1 month ago
•
1
gsar78/Greek_Tokenizer
Updated
Aug 5, 2024
•
1
Acct2325449/asset-keywords-tokenizer
Updated
Sep 18, 2024
charanhu/kannada-tokenizer
Text Generation
•
Updated
Nov 12, 2024
•
2
aloobun/IN-Llama-3-Tokenizer
Updated
Mar 7, 2025
Saiteja/telugu-bpe
Updated
Jan 5, 2025
•
4
aayushraina/hindi-bpe-tokenizer
Text Generation
•
Updated
Jan 11, 2025
aayushraina/bpe-hindi
Text Generation
•
Updated
Jan 11, 2025
prithivMLmods/Bpe-vocab-n-OCR
Image-to-Text
•
Updated
May 3, 2025
•
2
•
4
mradermacher/Tokenized-OCR-GGUF
2B
•
Updated
Jul 31, 2025
•
321
•
1
mradermacher/Tokenized-OCR-i1-GGUF
2B
•
Updated
Jul 11, 2025
•
115
muzaffercky/kurdish-kurmanji-tokenizer
Updated
Jun 30, 2025
•
3
mradermacher/Bpe-vocab-n-OCR-GGUF
2B
•
Updated
Jul 31, 2025
•
540
mradermacher/Bpe-vocab-n-OCR-i1-GGUF
2B
•
Updated
Jul 11, 2025
•
79
•
1
Hailay/geez-tokenizer
Token Classification
•
Updated
Jul 15, 2025
Hengzongshu/Chinse_BBPE_Vocab
Text Classification
•
Updated
Jun 1, 2025
Vipplav/telugu-bpe-23k
Updated
Jun 19, 2025
Max1798/my-tokenizer
Updated
Jul 17, 2025
•
2
amirhofo/Persian-BPE-Tokenizer
Updated
Jul 24, 2025
yakul259/english-bpe-tokenizer-60k
Updated
Aug 14, 2025
yakul259/finance-bpe-tokenizer-30k
Updated
Aug 15, 2025
toksuite/common-pile-comma-v0.1
Text Generation
•
2B
•
Updated
Dec 25, 2025
•
5
toksuite/meta-llama-Llama-3.2-1B
Text Generation
•
2B
•
Updated
Dec 25, 2025
•
38
toksuite/microsoft-Phi-3-mini-4k-instruct
Text Generation
•
1B
•
Updated
Dec 25, 2025
•
1
toksuite/gpt2
Text Generation
•
1B
•
Updated
Dec 25, 2025
•
88
toksuite/bigscience-bloom
Text Generation
•
2B
•
Updated
Dec 25, 2025
•
106
MustafaSeker/sugartrq-tokenizer-tr
Updated
Sep 2, 2025
toksuite/mistralai-tekken
Text Generation
•
2B
•
Updated
Dec 25, 2025
•
16
toksuite/Qwen-Qwen3-8B
Text Generation
•
2B
•
Updated
Dec 25, 2025
•
79
Previous
1
2
3
...
9
Next