InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing Paper • 2603.09877 • Published 1 day ago • 24
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Paper • 2603.09652 • Published 1 day ago • 7
OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning Paper • 2603.08655 • Published 2 days ago • 3
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 5 days ago • 95
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published 6 days ago • 25
FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling Paper • 2603.06199 • Published 6 days ago • 9
UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data Paper • 2603.05312 • Published 6 days ago • 7
RealWonder: Real-Time Physical Action-Conditioned Video Generation Paper • 2603.05449 • Published 6 days ago • 11
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 7 days ago • 16
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration Paper • 2603.03823 • Published 8 days ago • 4
Proact-VL: A Proactive VideoLLM for Real-Time AI Companions Paper • 2603.03447 • Published 8 days ago • 31