Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Hiring 💼
1218
146
113
Quentin Gallouédec
PRO
qgallouedec
Follow
maharshpatelx's profile picture
JaumePrats's profile picture
jaguuai's profile picture
578 followers
·
335 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Rethinking the Trust Region in LLM Reinforcement Learning
upvoted
a
paper
2 days ago
Defeating the Training-Inference Mismatch via FP16
upvoted
a
paper
2 days ago
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR
View all activity
Organizations
qgallouedec
's models
784
Sort: Recently updated
qgallouedec/ddpg-CartpoleDMC-v0
Reinforcement Learning
•
Updated
Jan 16, 2023
•
4
qgallouedec/ddpg-BallInCupDMC-v0
Reinforcement Learning
•
Updated
Jan 16, 2023
•
2
qgallouedec/ddpg-AcrobotSwingupSparseDMC-v0
Reinforcement Learning
•
Updated
Jan 16, 2023
•
2
qgallouedec/ddpg-AcrobotSwingupDMC-v0
Reinforcement Learning
•
Updated
Jan 16, 2023
•
2
Previous
1
...
25
26
27
Next