XiongShao Pan's picture

5

XiongShao Pan

PanAndy

·

PanAndy

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

upvoted a paper 23 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper 2 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

View all activity

Organizations

None yet

upvoted 2 papers 23 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23 • 273

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 24 days ago • 93

upvoted 2 papers 2 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published Oct 15 • 57

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

Paper • 2510.11345 • Published Oct 13 • 15

upvoted a paper 7 months ago

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Paper • 2506.06122 • Published Jun 6 • 7