Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs Paper • 2603.22446 • Published 22 days ago • 9
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 25 days ago • 335
AION-1: Omnimodal Foundation Model for Astronomical Sciences Paper • 2510.17960 • Published Oct 20, 2025 • 30
Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions Paper • 2504.19056 • Published Apr 27, 2025 • 18
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published Apr 29, 2025 • 98
A 5-Point Minimal Solver for Event Camera Relative Motion Estimation Paper • 2309.17054 • Published Sep 29, 2023 • 2
AI-University: An LLM-based platform for instructional alignment to scientific classrooms Paper • 2504.08846 • Published Apr 11, 2025 • 9
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 188
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 247
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15, 2025 • 122
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper • 2501.02045 • Published Jan 3, 2025 • 22