Beyond End-to-End Video Models: An LLM-Based Multi-Agent System for Educational Video Generation Paper • 2602.11790 • Published Feb 12
Running on Zero Agents 801 IndexTTS 2 Demo 🏢 801 Generate expressive speech from text and voice prompts