mmhamdy (Mohammed Hamdy)

upvoted an article 6 months ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 389

upvoted 2 articles 7 months ago

Article

Promoter-GPT: Writing DNA Instructions with Language Models

hugging-science

•

Oct 22, 2025

• 25

Article

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

nvidia

•

Oct 20, 2025

• 19

upvoted an article 10 months ago

Article

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

davanstrien

•

Jul 8, 2025

• 35

upvoted an article 12 months ago

Article

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

nvidia

•

Jun 4, 2025

• 23

upvoted a paper 12 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 159

upvoted an article 12 months ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

cfahlgren1

•

Apr 30, 2025

• 88

upvoted a paper 12 months ago

Text Generation Beyond Discrete Token Sampling

Paper • 2505.14827 • Published May 20, 2025 • 10

upvoted an article about 1 year ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

julien-c

•

Apr 25, 2025

• 308

upvoted 2 papers about 1 year ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 207

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124

upvoted an article about 1 year ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

+2

saurabhdash, olivernan, ArashAhmadian, johndang-cohere

•

Mar 4, 2025

• 78

upvoted a collection about 1 year ago

Cohere Labs Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Jul 31, 2025 • 74

upvoted an article about 1 year ago

Article

Common AI Model Formats

ngxson

•

Feb 27, 2025

• 72

upvoted a collection about 1 year ago

CHASE

Collection

Generate challenging synthetic data to evaluate LLMs • 4 items • Updated Mar 2 • 4

upvoted a paper about 1 year ago

How to Get Your LLM to Generate Challenging Problems for Evaluation

Paper • 2502.14678 • Published Feb 20, 2025 • 18

upvoted a collection about 1 year ago

Reasoning Datasets

Collection

50 items • Updated Jun 8, 2025 • 11

upvoted 2 papers about 1 year ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 48

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

Paper • 2502.13791 • Published Feb 19, 2025 • 6

upvoted a paper over 1 year ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 125

Mohammed Hamdy

AI & ML interests

Organizations

Continuous batching from first principles

Promoter-GPT: Writing DNA Instructions with Language Models

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

The 4 Things Qwen-3’s Chat Template Teaches Us

Text Generation Beyond Discrete Token Sampling

Tiny Agents: an MCP-powered agent in 50 lines of code

SmolVLM: Redefining small and efficient multimodal models

Unified Reward Model for Multimodal Understanding and Generation

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Cohere Labs Aya Vision

Common AI Model Formats

CHASE

How to Get Your LLM to Generate Challenging Problems for Evaluation

Reasoning Datasets

MMTEB: Massive Multilingual Text Embedding Benchmark

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Mohammed Hamdy

AI & ML interests

Organizations

mmhamdy's activity

Continuous batching from first principles

Promoter-GPT: Writing DNA Instructions with Language Models

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

The 4 Things Qwen-3’s Chat Template Teaches Us

Tiny Agents: an MCP-powered agent in 50 lines of code

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Common AI Model Formats