Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted a paper about 8 hours ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

upvoted an article about 8 hours ago

ML Intern Takes Our Post-Training Internship Test

published an article about 8 hours ago

ML Intern Takes Our Post-Training Internship Test

View all activity

Organizations

lewtun 's models 293

lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-2

Updated Aug 27, 2024

lewtun/qwen2-7B-ultrafeedback-online-dpo

Updated Aug 27, 2024

lewtun/pythia-1b-deduped-tldr-online-dpo

1B • Updated Aug 27, 2024 • 2

lewtun/pythia-1b-tldr-online-dpo

Updated Aug 27, 2024

lewtun/qwen2-0.5B-lr-5e-7

Updated Aug 27, 2024

lewtun/qwen2-7B-lr-3e-6

Updated Aug 26, 2024

lewtun/qwen2-7B-lr-3e-6-tok-1024

Updated Aug 26, 2024

lewtun/qwen2-0.5B-lr-3e-6-tok-1024

Updated Aug 26, 2024

lewtun/qwen2-1.5B-lr-3e-6-tok-1024

Updated Aug 26, 2024

lewtun/qwen2-1.5B-lr-3e-6-tok-2048

Updated Aug 25, 2024

lewtun/qwen2-0.5B-lr-3e-6-tok-2048

Updated Aug 25, 2024

lewtun/qwen2-1.5B-lr-3e-6

2B • Updated Aug 25, 2024 • 2

lewtun/qwen2-0.5B-lr-3e-6

0.5B • Updated Aug 25, 2024

lewtun/qwen2-0.5B

0.5B • Updated Aug 24, 2024 • 1

lewtun/qwen2-1.5B

2B • Updated Aug 24, 2024 • 1

lewtun/smollm-1.7B

Updated Aug 23, 2024

lewtun/smollm-360m

Updated Aug 23, 2024

lewtun/smollm-135m

Updated Aug 23, 2024

lewtun/qwen2

Updated Aug 22, 2024

lewtun/EleutherAI_pythia-1b

1B • Updated Aug 21, 2024 • 1

lewtun/gkd-model

Updated Aug 20, 2024

lewtun/kto-aligned-model-lora

Updated Mar 27, 2024 • 5

lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.05

Text Generation • 9B • Updated Mar 1, 2024 • 5

lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.4

Text Generation • 9B • Updated Mar 1, 2024 • 3

lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.01

Text Generation • 9B • Updated Mar 1, 2024 • 2

lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.2

Text Generation • 9B • Updated Mar 1, 2024 • 3

lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.1

Text Generation • 9B • Updated Mar 1, 2024 • 5

lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-3

Text Generation • 9B • Updated Mar 1, 2024 • 5 • 1

lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-2

Text Generation • 9B • Updated Mar 1, 2024 • 6

lewtun/gemma-7b-sft-full-openhermes-v0

Text Generation • 9B • Updated Mar 1, 2024 • 2