3 16 2

ChengpengLi

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper about 2 months ago

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

upvoted a paper 4 months ago

Qwen3-VL Technical Report

upvoted a paper 4 months ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

Paper • 2602.06079 • Published Feb 4 • 18

upvoted 2 papers 4 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 161

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 65

upvoted 2 papers 6 months ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 107

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 120

upvoted 2 papers 8 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14, 2025 • 146

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 160

commented a paper 10 months ago

CoRT: Code-integrated Reasoning within Thinking

Paper • 2506.09820 • Published Jun 11, 2025 • 18 •

upvoted a paper 11 months ago

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22, 2025 • 58

authored a paper about 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

commented a paper about 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113 •

upvoted a paper about 1 year ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

published a model about 1 year ago

ChengpengLi/START

Updated Feb 21, 2025

upvoted 2 papers about 1 year ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10, 2025 • 72

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 100

upvoted a paper over 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 377

liked a Space over 1 year ago

Qwen2.5 Math Demo

🧮

242

Answer math questions from uploaded images or sketches

upvoted 2 collections over 1 year ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 10 items • Updated Mar 2 • 91

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 7 items • Updated Mar 2 • 52

liked a model over 1 year ago

Qwen/Qwen2-Math-72B

Text Generation • 73B • Updated Aug 8, 2024 • 31 • 30

ChengpengLi

AI & ML interests

Recent Activity

Organizations

ChengpengLi's activity

Qwen2.5 Math Demo