Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
UCLA-AGI
's Collections
zephyr-7b-sft-full-SPIN
datasets-SPIN
SPIN-Diffusion
SPPO
SPPO
updated
Jun 29, 2024
Self-Play Preference Optimization
Upvote
13
+3
UCLA-AGI/Mistral7B-PairRM-SPPO
Text Generation
•
7B
•
Updated
May 7, 2024
•
142
•
6
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter1
Text Generation
•
7B
•
Updated
May 6, 2024
•
14
•
2
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter2
Text Generation
•
7B
•
Updated
May 6, 2024
•
8.27k
•
•
1
UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3
Text Generation
•
7B
•
Updated
May 7, 2024
•
8.28k
•
•
5
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter1
Text Generation
•
8B
•
Updated
Jun 25, 2024
•
10
•
•
1
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2
Text Generation
•
Updated
Jun 25, 2024
•
8.33k
•
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
8B
•
Updated
Jun 28, 2024
•
8.36k
•
•
83
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3
Text Generation
•
9B
•
Updated
Jul 1, 2024
•
2.21k
•
•
127
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2
Text Generation
•
9B
•
Updated
Jul 1, 2024
•
1.09k
•
•
4
UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1
Text Generation
•
9B
•
Updated
Jul 1, 2024
•
1.09k
•
•
4
Upvote
13
+9
Share collection
View history
Collection guide
Browse collections