Yulun Jiang

yljblues

3 7

https://yljblues.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

Connect the Dots: Training LLMs for Long-Lifecycle Agents with Cross-Domain Generalization Via Reinforcement Learning

upvoted a paper about 1 month ago

Self-Distilled RLVR

upvoted a paper 3 months ago

Reinforcement Learning for Self-Improving Agent with Skill Library

View all activity

Organizations

upvoted a paper about 12 hours ago

Connect the Dots: Training LLMs for Long-Lifecycle Agents with Cross-Domain Generalization Via Reinforcement Learning

Paper • 2606.20002 • Published 14 days ago • 9

upvoted a paper about 1 month ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 179

upvoted a paper 3 months ago

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published Dec 18, 2025 • 42

commented a paper 6 months ago

Meta-RL Induces Exploration in Language Agents

Paper • 2512.16848 • Published Dec 18, 2025 • 12 •

authored a paper 6 months ago

Meta-RL Induces Exploration in Language Agents

Paper • 2512.16848 • Published Dec 18, 2025 • 12

upvoted a paper 6 months ago

Meta-RL Induces Exploration in Language Agents

Paper • 2512.16848 • Published Dec 18, 2025 • 12

submitted a paper to Daily Papers 6 months ago

Meta-RL Induces Exploration in Language Agents

Paper • 2512.16848 • Published Dec 18, 2025 • 12

upvoted a paper 7 months ago

Tree Search for LLM Agent Reinforcement Learning

Paper • 2509.21240 • Published Sep 25, 2025 • 92

updated a dataset 9 months ago

mrble/MARBLE

Viewer • Updated Sep 23, 2025 • 3.22k • 135 • 2

upvoted a paper about 1 year ago

Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards

Paper • 2506.11474 • Published Jun 13, 2025 • 18

New activity in mrble/MARBLE about 1 year ago

Update dataset card: task category and license

#2 opened about 1 year ago by

nielsr

commented a paper about 1 year ago

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

Paper • 2506.22992 • Published Jun 28, 2025 • 12 •

authored a paper about 1 year ago

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

Paper • 2506.22992 • Published Jun 28, 2025 • 12

commented a paper about 1 year ago

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

Paper • 2506.22992 • Published Jun 28, 2025 • 12 •

upvoted a paper about 1 year ago

MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning

Paper • 2506.22992 • Published Jun 28, 2025 • 12

published a dataset about 1 year ago

mrble/MARBLE

Viewer • Updated Sep 23, 2025 • 3.22k • 135 • 2

Yulun Jiang

AI & ML interests

Recent Activity

Organizations

yljblues's activity

Update dataset card: task category and license