Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
64.4
TFLOPS
81
84
Jarrod Barnes
PRO
Jarrodbarnes
Follow
ariG23498's profile picture
aman-jaglan's profile picture
Molbap's profile picture
5 followers
·
48 following
https://arc.computer
jarrodbarnes
jbarnes850
jarrodbarnes
AI & ML interests
Continual Learning, Reinforcement Learning
Recent Activity
liked
a dataset
about 13 hours ago
metr-evals/malt-transcripts-public
upvoted
an
article
1 day ago
Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL
published
an
article
1 day ago
Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL
View all activity
Organizations
Articles
1
Article
1
Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL
Papers
1
arxiv:
2511.01093
spaces
2
Sort:Â Recently updated
Running
RL
OpenSec-Env
🚀
Sleeping
Trackio
🚀
Display tracking information
models
4
Sort:Â Recently updated
Jarrodbarnes/opensec-gdpo-4b
Text Generation
•
4B
•
Updated
1 day ago
•
21
Jarrodbarnes/Qwen3-4B-tau2-grpo-v1
Text Generation
•
4B
•
Updated
9 days ago
•
67
Jarrodbarnes/Qwen3-4B-tau2-sft1
4B
•
Updated
9 days ago
•
28
Jarrodbarnes/Cortex-1-mini
Text Generation
•
Updated
Mar 13, 2025
•
5
•
2
datasets
6
Sort:Â Recently updated
Jarrodbarnes/osworld-reasoning-sft-v1
Preview
•
Updated
10 days ago
•
28
Jarrodbarnes/osworld-train-v1
Viewer
•
Updated
11 days ago
•
66
•
17
Jarrodbarnes/tau2-sft-seed-v3
Updated
Dec 19, 2025
•
16
Jarrodbarnes/tau2-sft-final
Updated
Dec 15, 2025
•
44
Jarrodbarnes/tau2-sft-v4-dataset
Viewer
•
Updated
Nov 29, 2025
•
219
•
78
Jarrodbarnes/cortex-1-market-analysis
Viewer
•
Updated
Mar 9, 2025
•
521
•
65
•
2