Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
UCLA-AGI 's Collections
zephyr-7b-sft-full-SPIN
datasets-SPIN
SPIN-Diffusion
SPPO

SPPO

updated Jun 29, 2024

Self-Play Preference Optimization

Upvote
13

  • UCLA-AGI/Mistral7B-PairRM-SPPO

    Text Generation • 7B • Updated May 7, 2024 • 142 • 6

  • UCLA-AGI/Mistral7B-PairRM-SPPO-Iter1

    Text Generation • 7B • Updated May 6, 2024 • 14 • 2

  • UCLA-AGI/Mistral7B-PairRM-SPPO-Iter2

    Text Generation • 7B • Updated May 6, 2024 • 8.27k • • 1

  • UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3

    Text Generation • 7B • Updated May 7, 2024 • 8.28k • • 5

  • UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter1

    Text Generation • 8B • Updated Jun 25, 2024 • 10 • • 1

  • UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2

    Text Generation • Updated Jun 25, 2024 • 8.33k •

  • UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3

    Text Generation • 8B • Updated Jun 28, 2024 • 8.36k • • 83

  • UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3

    Text Generation • 9B • Updated Jul 1, 2024 • 2.21k • • 127

  • UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2

    Text Generation • 9B • Updated Jul 1, 2024 • 1.09k • • 4

  • UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1

    Text Generation • 9B • Updated Jul 1, 2024 • 1.09k • • 4
Upvote
13
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs