SafePred: A Predictive Guardrail for Computer-Using Agents via World Models Paper • 2602.01725 • Published Feb 2 • 1
From Data to Behavior: Predicting Unintended Model Behaviors Before Training Paper • 2602.04735 • Published Feb 4 • 15
VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration Paper • 2601.22674 • Published Jan 30 • 5
Learning to Extract Rational Evidence via Reinforcement Learning for Retrieval-Augmented Generation Paper • 2507.15586 • Published Jul 21, 2025 • 2
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey Paper • 2507.20783 • Published Jul 28, 2025 • 1
Aligning Agentic World Models via Knowledgeable Experience Learning Paper • 2601.13247 • Published Jan 19 • 15
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published Jan 9 • 20
LEO-RobotAgent: A General-purpose Robotic Agent for Language-driven Embodied Operator Paper • 2512.10605 • Published Dec 11, 2025 • 7
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control Paper • 2506.01943 • Published Jun 2, 2025 • 25
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks Paper • 2503.21696 • Published Mar 27, 2025 • 23
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published Mar 14, 2025 • 148
CLIP-AD: A Language-Guided Staged Dual-Path Model for Zero-shot Anomaly Detection Paper • 2311.00453 • Published Nov 1, 2023
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models Paper • 2410.16236 • Published Oct 21, 2024
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network Paper • 2411.15941 • Published Nov 24, 2024 • 2
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection Paper • 2404.06564 • Published Apr 9, 2024