Quentin Gallouédec's picture

Hiring 💼

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Rethinking the Trust Region in LLM Reinforcement Learning

upvoted a paper 2 days ago

Defeating the Training-Inference Mismatch via FP16

upvoted a paper 2 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

View all activity

Organizations

qgallouedec 's models 784

qgallouedec/ddpg-CartpoleDMC-v0

Reinforcement Learning • Updated Jan 16, 2023 • 4

qgallouedec/ddpg-BallInCupDMC-v0

Reinforcement Learning • Updated Jan 16, 2023 • 2

qgallouedec/ddpg-AcrobotSwingupSparseDMC-v0

Reinforcement Learning • Updated Jan 16, 2023 • 2

qgallouedec/ddpg-AcrobotSwingupDMC-v0

Reinforcement Learning • Updated Jan 16, 2023 • 2