Ivan Deryabin's picture

6 46

Ivan Deryabin

alexlink

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 hours ago

liked a model 1 day ago

unsloth/GLM-4.7-Flash-REAP-23B-A3B-GGUF

liked a model 6 days ago

unsloth/GLM-4.7-Flash-GGUF

View all activity

Organizations

None yet

upvoted a collection about 2 hours ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 24 items • Updated 4 days ago • 93

upvoted a collection 9 months ago

Qwen3

84 items • Updated 27 days ago • 1.6k

upvoted a collection 10 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10, 2025 • 215

upvoted 2 collections over 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 27 days ago • 679

upvoted a paper over 1 year ago

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published Jul 10, 2024 • 10