Predict human preference to LLM responses.
Binfeng Xu
billxbf
AI & ML interests
evolving back to apes
Recent Activity
upvoted a paper 3 days ago
Polar: Agentic RL on Any Harness at Scale updated a model about 1 month ago
billxbf/qwen3.5-4b-pi-polar