Great paper
updated
Paper
• 2410.05258
• Published
• 180
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper
• 2412.03555
• Published
• 133
VisionZip: Longer is Better but Not Necessary in Vision Language Models
Paper
• 2412.04467
• Published
• 117
o1-Coder: an o1 Replication for Coding
Paper
• 2412.00154
• Published
• 44
SNOOPI: Supercharged One-step Diffusion Distillation with Proper
Guidance
Paper
• 2412.02687
• Published
• 113
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any
Point in Long Video
Paper
• 2411.18671
• Published
• 20
Fully Open Source Moxin-7B Technical Report
Paper
• 2412.06845
• Published
• 11
Small Language Models: Survey, Measurements, and Insights
Paper
• 2409.15790
• Published
• 2
Paper
• 2407.10671
• Published
• 168
Paper
• 2412.08905
• Published
• 122
Apollo: An Exploration of Video Understanding in Large Multimodal Models
Paper
• 2412.10360
• Published
• 147
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
• 2412.09871
• Published
• 108
Paper
• 2412.15115
• Published
• 377
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper
• 2501.05366
• Published
• 102
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
Paper
• 2501.04519
• Published
• 288
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper
• 2501.08313
• Published
• 300
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
Large Language Models
Paper
• 2501.09686
• Published
• 41
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
• 2501.12948
• Published
• 441
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
Paper
• 2501.17161
• Published
• 124
Baichuan-Omni-1.5 Technical Report
Paper
• 2501.15368
• Published
• 60
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human
Animation Models
Paper
• 2502.01061
• Published
• 223
The Differences Between Direct Alignment Algorithms are a Blur
Paper
• 2502.01237
• Published
• 113
Hermes 3 Technical Report
Paper
• 2408.11857
• Published
• 56
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence
Generation up to 100K Tokens
Paper
• 2502.18890
• Published
• 30
SemViQA: A Semantic Question Answering System for Vietnamese Information
Fact-Checking
Paper
• 2503.00955
• Published
• 28
InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models
Paper
• 2504.10479
• Published
• 306
Tina: Tiny Reasoning Models via LoRA
Paper
• 2504.15777
• Published
• 56
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper
• 2505.03335
• Published
• 189