VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion Paper • 2605.30351 • Published 7 days ago • 25
view article Article Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action nvidia • 3 days ago • 62
view article Article MONET: Lowering the bar for World-Class Image Generation research. jasperai • 7 days ago • 10
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 6 days ago • 73
Quantized Keys Steal Attention: Bias Correction for KV-Cache Compression in Video Diffusion Paper • 2605.26266 • Published 10 days ago • 1
One Pass Is Not Enough: Recursive Latent Refinement for Generative Models Paper • 2605.15309 • Published 21 days ago • 1
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published 17 days ago • 112
SANA-WM Collection SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer • 2 items • Updated 17 days ago • 4
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 21 days ago • 85
WorldJen: An End-to-End Multi-Dimensional Benchmark for Generative Video Models Paper • 2605.03475 • Published about 1 month ago • 8
view article Article Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts NucleusAI • Apr 14 • 11
view article Article How I contributed a new model to the Transformers library using Codex nielsr • Mar 30 • 52
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 YiYiXu, OzzyGT, dn6, sayakpaul • Mar 5 • 51
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Paper • 2603.18742 • Published Mar 19 • 11
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published Mar 12 • 22
ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA Paper • 2603.10256 • Published Mar 10 • 23