How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning
Paper • 2605.27310 • Published • 18
None defined yet.
How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning
Communicating about Space: Language-Mediated Spatial Integration Across Partial Views