In a Training Loop 🔄
lewtun
·
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
Organizations
lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-2
Updated
lewtun/qwen2-7B-ultrafeedback-online-dpo
Updated
lewtun/pythia-1b-deduped-tldr-online-dpo
1B • Updated • 2
lewtun/pythia-1b-tldr-online-dpo
Updated
lewtun/qwen2-0.5B-lr-5e-7
Updated
lewtun/qwen2-7B-lr-3e-6-tok-1024
Updated
lewtun/qwen2-0.5B-lr-3e-6-tok-1024
Updated
lewtun/qwen2-1.5B-lr-3e-6-tok-1024
Updated
lewtun/qwen2-1.5B-lr-3e-6-tok-2048
Updated
lewtun/qwen2-0.5B-lr-3e-6-tok-2048
Updated
lewtun/qwen2-1.5B-lr-3e-6
2B • Updated • 2
lewtun/qwen2-0.5B-lr-3e-6
0.5B • Updated 0.5B • Updated • 1
2B • Updated • 1
lewtun/EleutherAI_pythia-1b
1B • Updated • 1
lewtun/kto-aligned-model-lora
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.05
Text Generation
• 9B • Updated • 5
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.4
Text Generation
• 9B • Updated • 3
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.01
Text Generation
• 9B • Updated • 2
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.2
Text Generation
• 9B • Updated • 3
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.1
Text Generation
• 9B • Updated • 5
lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-3
Text Generation
• 9B • Updated • 5
• 1
lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-2
Text Generation
• 9B • Updated • 6
lewtun/gemma-7b-sft-full-openhermes-v0
Text Generation
• 9B • Updated • 2