Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published 21 days ago • 63
Quantile Advantage Estimation for Entropy-Safe Reasoning Paper • 2509.22611 • Published Sep 26, 2025 • 118
We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning Paper • 2508.10433 • Published Aug 14, 2025 • 144
CoRT: Code-integrated Reasoning within Thinking Paper • 2506.09820 • Published Jun 11, 2025 • 18 • 2
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning Paper • 2505.16410 • Published May 22, 2025 • 58
Enabling Scalable Oversight via Self-Evolving Critic Paper • 2501.05727 • Published Jan 10, 2025 • 72
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13, 2025 • 99
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 11 items • Updated 5 days ago • 88
Qwen2-Math Collection Math-specific model series based on Qwen2 • 8 items • Updated 5 days ago • 52