Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Atotti 's Collections
Japanese Speech LLM Training Datasets
ALM Audio Encoders

ALM Audio Encoders

updated Dec 6, 2025

I'm currently in the process of preparing the inference code.

Upvote
3

  • Atotti/Google-USM

    Feature Extraction • 0.7B • Updated Aug 12, 2025 • 191k • • 20

  • Atotti/Qwen3-Omni-AudioTransformer

    0.6B • Updated Oct 4, 2025 • 4.5k • 33

  • Atotti/google-usm-bf16

    Feature Extraction • 0.7B • Updated Jul 9, 2025 • 1 • 1

  • Atotti/Qwen3-Omni-Captioner-AudioTransformer

    0.6B • Updated Dec 6, 2025 • 44

  • Atotti/AFClap

    Feature Extraction • Updated Dec 6, 2025

  • Atotti/qwen2-audio-encoder

    Feature Extraction • 0.6B • Updated Oct 18, 2025 • 57 • 2

  • Atotti/Kimi-Audio-Whisper-Encoder

    Feature Extraction • Updated Dec 6, 2025 • 52

  • Atotti/AFWhisper

    Feature Extraction • 0.6B • Updated Dec 6, 2025 • 5 • 1
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs