16 11

Jonathan H. Parker

reply-guy

AI & ML interests

None yet

Recent Activity

liked a dataset about 10 hours ago

www0wwwjs1/SMDPlus

liked a dataset 4 days ago

open-r1/OpenR1-Math-220k

liked a dataset 7 days ago

SpartanEngineer24798/orthogonal_data

View all activity

Organizations

None yet

liked a dataset about 10 hours ago

www0wwwjs1/SMDPlus

Updated about 9 hours ago • 1

liked a dataset 4 days ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 32.2k • 745

liked a dataset 7 days ago

SpartanEngineer24798/orthogonal_data

Viewer • Updated 6 days ago • 310k • 18.7k • 2

liked a model 12 days ago

Haku-2004/act_so101_test_c

Robotics • Updated 11 days ago • 36 • 1

liked a dataset 17 days ago

wegrthj/ubp66j-fcwh-data

Viewer • Updated 16 days ago • 720k • 4.29k • 2

upvoted 2 papers 25 days ago

Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models

Paper • 2604.21193 • Published 26 days ago • 2

Where does output diversity collapse in post-training?

Paper • 2604.16027 • Published Apr 17 • 22

upvoted 2 papers about 1 month ago

Type-Checked Compliance: Deterministic Guardrails for Agentic Financial Systems Using Lean 4 Theorem Proving

Paper • 2604.01483 • Published Apr 1 • 7

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 628

liked a dataset about 1 month ago

Open-Orca/OpenOrca

Viewer • Updated Feb 19, 2025 • 2.94M • 50k • 1.54k

upvoted a paper about 1 month ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

liked a model about 2 months ago

Qwen/Qwen3-1.7B

Text Generation • 2B • Updated Jul 26, 2025 • 3.55M • • 471

upvoted 2 papers 2 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 311

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

liked a model 2 months ago

ZJU-AI4H/Hulu-Med-4B

Image-Text-to-Text • 5B • Updated Nov 27, 2025 • 23.7k • 50

liked 2 models 3 months ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 213k • • 1.11k

MiniMaxAI/MiniMax-M2.5

Text Generation • 229B • Updated Mar 10 • 783k • • 1.48k

upvoted 3 papers 3 months ago

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 151

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

Jonathan H. Parker

AI & ML interests

Recent Activity

Organizations

reply-guy's activity