SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper • 2603.16869 • Published 1 day ago • 15
prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX Image-Text-to-Text • 4B • Updated 25 days ago • 1.47k • 17
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published 15 days ago • 139
Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published 26 days ago • 30
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 27 days ago • 487
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 219
Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published Oct 17, 2025 • 50
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper • 2510.15742 • Published Oct 17, 2025 • 51
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 145
Lynx: Towards High-Fidelity Personalized Video Generation Paper • 2509.15496 • Published Sep 19, 2025 • 13
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching Paper • 2506.23552 • Published Jun 30, 2025 • 10
Running on Zero MCP Featured 323 Chain-of-Zoom 🚀 323 Extreme Super-Resolution via Scale Autoregression
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper • 2506.08279 • Published Jun 9, 2025 • 27
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper • 2506.08279 • Published Jun 9, 2025 • 27 • 2