SceneAligner: 3D-Grounded Floorplan Localization in the Wild Paper • 2605.22581 • Published 2 days ago • 3
Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization Paper • 2605.13641 • Published 10 days ago • 48
AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning Paper • 2605.00425 • Published 15 days ago • 23
MNAFT: modality neuron-aware fine-tuning of multimodal large language models for image translation Paper • 2604.16943 • Published Apr 18 • 2
Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models Paper • 2604.00375 • Published Apr 1 • 5
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 629
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published Apr 8 • 187
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published Apr 9 • 101
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization Paper • 2604.05963 • Published Apr 7 • 8