UniScene: Unified Occupancy-centric Driving Scene Generation Paper • 2412.05435 • Published Dec 6, 2024
MEGS$^{2}$: Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning Paper • 2509.07021 • Published Sep 7, 2025
MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction Paper • 2503.10604 • Published Mar 13, 2025
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning Paper • 2604.24300 • Published 6 days ago • 64
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 27 days ago • 203
Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models Paper • 2603.22782 • Published Mar 24 • 20
GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models Paper • 2603.23246 • Published Mar 24 • 8
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models Paper • 2603.22212 • Published Mar 23 • 126
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published Mar 26 • 155
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM Paper • 2603.23386 • Published Mar 24 • 40
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published Dec 15, 2025 • 76
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper • 2512.03041 • Published Dec 2, 2025 • 65