Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
9
Xiangyuan Xue
xxyQwQ
Follow
mhjiang0408's profile picture
GoodEnough's profile picture
lucazhou2000's profile picture
4 followers
·
1 following
https://xxyqwq.cn/
xxyQwQ
AI & ML interests
LLM-Based Agents, Multi-Agent Systems, Reinforcement Learning
Recent Activity
updated
a collection
2 days ago
StraTA (Miscellaneous)
updated
a model
2 days ago
xxyQwQ/train-ppo-sciworld-text-qwen2.5-7b
published
a model
2 days ago
xxyQwQ/train-ppo-sciworld-text-qwen2.5-7b
View all activity
Organizations
xxyQwQ
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a collection
2 days ago
StraTA (Miscellaneous)
Collection
14 items
•
Updated
2 days ago
updated
a model
2 days ago
xxyQwQ/train-ppo-sciworld-text-qwen2.5-7b
8B
•
Updated
2 days ago
•
12
published
a model
2 days ago
xxyQwQ/train-ppo-sciworld-text-qwen2.5-7b
8B
•
Updated
2 days ago
•
12
updated
a collection
2 days ago
StraTA (Miscellaneous)
Collection
14 items
•
Updated
2 days ago
updated
a model
6 days ago
xxyQwQ/train-grpo-sciworld-text-qwen2.5-7b
8B
•
Updated
6 days ago
•
15
published
a model
6 days ago
xxyQwQ/train-grpo-sciworld-text-qwen2.5-7b
8B
•
Updated
6 days ago
•
15
updated
a model
11 days ago
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-ultimate-version
3B
•
Updated
11 days ago
•
9
published
a model
11 days ago
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-ultimate-version
3B
•
Updated
11 days ago
•
9
updated
a model
11 days ago
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-diverse-version
3B
•
Updated
11 days ago
•
14
published
a model
11 days ago
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-diverse-version
3B
•
Updated
11 days ago
•
14
updated
a model
11 days ago
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-judgment-version
3B
•
Updated
11 days ago
•
18
published
a model
11 days ago
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-judgment-version
3B
•
Updated
11 days ago
•
18
updated
a model
11 days ago
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-vanilla-version
3B
•
Updated
11 days ago
•
17
published
a model
11 days ago
xxyQwQ/train-strata-webshop-text-qwen2.5-3b-vanilla-version
3B
•
Updated
11 days ago
•
17
updated
2 models
12 days ago
xxyQwQ/train-strata-alfworld-text-qwen2.5-3b-diverse-version
3B
•
Updated
12 days ago
•
16
xxyQwQ/train-strata-alfworld-text-qwen2.5-3b-judgment-version
3B
•
Updated
12 days ago
•
16
Load more