SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published 15 days ago • 60
Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Paper • 2601.14243 • Published 28 days ago • 22
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision Paper • 2601.19798 • Published 21 days ago • 42
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 29 days ago • 39
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 27 days ago • 72
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published Jan 11 • 212
EnvScaler Collection The official datasets and models of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis" • 8 items • Updated Jan 13 • 3
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 288
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 22 items • Updated 16 days ago • 87
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published Jan 5 • 26
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published Dec 26, 2025 • 25
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 251