15 36 35

Daixuan Cheng

daixuancheng

https://cdxeve.github.io

DaixuanC45443

AI & ML interests

I study LLMs, from Pre-Training to Agent.

Recent Activity

liked a dataset about 12 hours ago

RUC-AIBOX/ClawGym-Bench

liked a dataset about 18 hours ago

RUC-AIBOX/ClawGym-Trajectory

liked a dataset 1 day ago

RUC-AIBOX/ClawGym-Task

View all activity

Organizations

None yet

liked a dataset about 12 hours ago

RUC-AIBOX/ClawGym-Bench

Viewer • Updated 1 day ago • 200 • 26 • 1

liked a dataset about 18 hours ago

RUC-AIBOX/ClawGym-Trajectory

Viewer • Updated 1 day ago • 24.5k • 8 • 1

liked a dataset 1 day ago

RUC-AIBOX/ClawGym-Task

Preview • Updated about 6 hours ago • 15 • 1

authored a paper 16 days ago

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

Paper • 2508.07534 • Published Aug 11, 2025 • 2

authored a paper 17 days ago

ClawGym: A Scalable Framework for Building Effective Claw Agents

Paper • 2604.26904 • Published 18 days ago • 50

commented a paper 17 days ago

ClawGym: A Scalable Framework for Building Effective Claw Agents

Paper • 2604.26904 • Published 18 days ago • 50 •

upvoted a paper 17 days ago

ClawGym: A Scalable Framework for Building Effective Claw Agents

Paper • 2604.26904 • Published 18 days ago • 50

updated a model 18 days ago

daixuancheng/4-26-debugAdv_docker_rbs8_temp0.7_model-30BA3B-Sft_step99

31B • Updated 18 days ago • 15

published a model 18 days ago

daixuancheng/4-26-debugAdv_docker_rbs8_temp0.7_model-30BA3B-Sft_step99

31B • Updated 18 days ago • 15

updated a model 19 days ago

daixuancheng/4-26-debugAdv_docker_rbs8_temp0.7_model-30BA3B-Sft_step39

31B • Updated 19 days ago • 18

published a model 19 days ago

daixuancheng/4-26-debugAdv_docker_rbs8_temp0.7_model-30BA3B-Sft_step39

31B • Updated 19 days ago • 18

updated a model 20 days ago

daixuancheng/4-26-debugAdv_docker_rbs8_temp0.7_model-30BA3B-Sft_step9

31B • Updated 20 days ago • 19

published a model 20 days ago

daixuancheng/4-26-debugAdv_docker_rbs8_temp0.7_model-30BA3B-Sft_step9

31B • Updated 20 days ago • 19

updated a model 22 days ago

daixuancheng/4-24-30BA3B-s233-final

31B • Updated 22 days ago • 18

published a model 22 days ago

daixuancheng/4-24-30BA3B-s233-final

31B • Updated 22 days ago • 18

updated a model 23 days ago

daixuancheng/4-23_docker_rbs32_model-Coder_step15

Text Generation • 31B • Updated 23 days ago • 588

published a model 23 days ago

daixuancheng/4-23_docker_rbs32_model-Coder_step15

Text Generation • 31B • Updated 23 days ago • 588

upvoted a paper 25 days ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published 27 days ago • 84

updated a model 26 days ago

daixuancheng/0418_17K_filtered_10K_reward0.5_4b_inst

4B • Updated 26 days ago • 17

published a model 26 days ago

daixuancheng/0418_17K_filtered_10K_reward0.5_4b_inst

4B • Updated 26 days ago • 17

Daixuan Cheng

AI & ML interests

Recent Activity

Organizations

daixuancheng's activity