Princeton-AI

community

https://yangling0818.github.io/

LingYang_PU

Gen-Verse

AI & ML interests

LLM, Diffusion, and Beyond

Recent Activity

jiaruz2 authored a paper about 15 hours ago

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

jiaruz2 authored a paper about 15 hours ago

TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning

Lingaaaaaaa submitted a paper about 20 hours ago

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

View all activity

Papers

Latent Collaboration in Multi-Agent Systems

View all Papers

jiaruz2

authored 2 papers about 15 hours ago

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

Paper • 2510.06217 • Published Oct 7, 2025 • 65

TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning

Paper • 2312.09039 • Published Dec 14, 2023

Lingaaaaaaa

submitted a paper to Daily Papers about 20 hours ago

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published 1 day ago • 26

yinjiewang

updated 6 models about 21 hours ago

Gen-Verse/RLAnything-Coder-7B

8B • Updated about 21 hours ago • 3 • 2

Gen-Verse/RLAnything-UT-14B

841k • Updated about 21 hours ago • 3 • 2

Gen-Verse/RLAnything-OS-8B

770k • Updated about 21 hours ago • 3 • 3

Gen-Verse/RLAnything-OS-Reward-8B

770k • Updated about 21 hours ago • 3 • 2

Gen-Verse/RLAnything-Alf-Reward-14B

15B • Updated about 21 hours ago • 3 • 2

Gen-Verse/RLAnything-Alf-7B

8B • Updated about 21 hours ago • 3 • 2

Lingaaaaaaa

published 6 models about 22 hours ago

Gen-Verse/RLAnything-Coder-7B

8B • Updated about 21 hours ago • 3 • 2

Gen-Verse/RLAnything-UT-14B

841k • Updated about 21 hours ago • 3 • 2

Gen-Verse/RLAnything-OS-8B

770k • Updated about 21 hours ago • 3 • 3

Gen-Verse/RLAnything-OS-Reward-8B

770k • Updated about 21 hours ago • 3 • 2

Gen-Verse/RLAnything-Alf-Reward-14B

15B • Updated about 21 hours ago • 3 • 2

Gen-Verse/RLAnything-Alf-7B

8B • Updated about 21 hours ago • 3 • 2

yinjiewang

updated 3 models 2 days ago

Gen-Verse/TraDo-8B-Thinking

8B • Updated 2 days ago • 160 • 13

Gen-Verse/TraDo-4B-Instruct

4B • Updated 2 days ago • 953 • 10

Gen-Verse/TraDo-8B-Instruct

8B • Updated 2 days ago • 207 • 13

yinjiewang

updated a dataset 2 days ago

Gen-Verse/LiveCodeBench-ReasonFlux

Preview • Updated 2 days ago • 39 • 1

yinjiewang

updated a collection 2 days ago

Open-AgentRL

RLAnything & DemyAgent: Open-Source RL for LLMs and Agentic Scenarios • 12 items • Updated about 24 hours ago • 5