Model checkpoints accompanying the paper "Scaling Behavior of Discrete Diffusion Language Models" (https://arxiv.org/abs/2512.10858).
Dimitri von Rütte
dvruette
AI & ML interests
None yet
Recent Activity
updated
a collection
about 4 hours ago
OpenWebText BPE
updated
a collection
about 4 hours ago
OpenWebText BPE
updated
a collection
about 4 hours ago
OpenWebText BPE
Organizations
models
39
dvruette/openwebtext-bpe-1k
Updated
dvruette/openwebtext-bpe-16k
Updated
dvruette/openwebtext-bpe-131k
Updated
dvruette/openwebtext-bpe-8k
Updated
dvruette/openwebtext-bpe-4k
Updated
dvruette/openwebtext-bpe-33k
Updated
dvruette/openwebtext-bpe-66k
Updated
dvruette/openwebtext-bpe-2k
Updated
dvruette/gidd-unif-3b-orbax
Updated
dvruette/gidd-unif-3b
Text Generation
•
3B
•
Updated
•
113
datasets
14
dvruette/openwebtext-tokenized-131k
Viewer
•
Updated
•
8.01M
•
3
dvruette/openwebtext-tokenized-66k
Viewer
•
Updated
•
8.01M
•
6
dvruette/openwebtext-tokenized-33k
Viewer
•
Updated
•
8.01M
•
9
dvruette/openwebtext-tokenized-16k
Viewer
•
Updated
•
8.01M
•
3
dvruette/openwebtext-tokenized-8k
Viewer
•
Updated
•
8.01M
•
2
dvruette/openwebtext-tokenized-4k
Viewer
•
Updated
•
8.01M
•
2
dvruette/openwebtext-tokenized-2k
Viewer
•
Updated
•
8.01M
•
2
dvruette/openwebtext-tokenized-1k
Viewer
•
Updated
•
8.01M
•
2
dvruette/openwebtext
Viewer
•
Updated
•
8.01M
•
7
dvruette/gidd-nemotron-cc-pretok
Preview
•
Updated
•
3