Yuxin Zuo's picture

Yuxin Zuo

yuxinzuo

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

Attention Residuals

upvoted a paper about 22 hours ago

AI Can Learn Scientific Taste

upvoted a paper 8 days ago

How Far Can Unsupervised RLVR Scale LLM Training?

View all activity

Organizations

None yet

upvoted 2 papers about 22 hours ago

Attention Residuals

Paper • 2603.15031 • Published 2 days ago • 66

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published 3 days ago • 220

upvoted a paper 8 days ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published 9 days ago • 53

upvoted a paper 14 days ago

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published 18 days ago • 50

upvoted a paper 15 days ago

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Paper • 2602.21320 • Published 21 days ago • 12

upvoted a collection 30 days ago

Qwen3.5

21 items • Updated 9 days ago • 1.22k

upvoted 6 papers about 1 month ago

Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments

Paper • 2602.11964 • Published Feb 12 • 12

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published Feb 10 • 36

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 58

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 201

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published Feb 5 • 27

upvoted a paper about 2 months ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86

upvoted 2 papers 3 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 161

upvoted 2 papers 4 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 83

upvoted 3 papers 5 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 119

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 69