This collection includes KnowRL-Nemotron-1.5B, train data, test data from the KnowRL project.
Linhao Yu
HasuerYu
AI & ML interests
None yet
Recent Activity
commentedon a paper 3 days ago
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance upvoted a paper 11 days ago
Co-Evolving Policy Distillation liked a model 22 days ago
HasuerYu/KnowRL-Nemotron-1.5BOrganizations
None yet