TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning Paper • 2510.06217 • Published Oct 7, 2025 • 65
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning Paper • 2312.09039 • Published Dec 14, 2023
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Paper • 2602.02488 • Published 1 day ago • 26
Open-AgentRL Collection RLAnything & DemyAgent: Open-Source RL for LLMs and Agentic Scenarios • 12 items • Updated about 24 hours ago • 5