facebook/wav2vec2-base-10k-voxpopuli-ft-hu
Automatic Speech Recognition
•
Updated
•
8
None defined yet.
Inference-time Physics Alignment of Video Generative Models with Latent World Models
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice