Dawei Li's picture

Dawei Li

wjldw

·

https://david-li0406.github.io/

AI & ML interests

LLM, NLP, Data Mining

Recent Activity

upvoted a paper 13 days ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

upvoted a paper about 1 month ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

authored a paper about 1 month ago

ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents

View all activity

Organizations

Papers 15

arxiv:2601.12294

arxiv:2509.25154

arxiv:2508.19570

arxiv:2508.01191

models 18

wjldw/ToolPRM-GRPO-synthesis

4B • Updated Jan 4 • 4

wjldw/ToolPRM-GRPO-v4

4B • Updated Jan 3 • 2

wjldw/ToolPRM-Base-v4

Text Generation • 196k • Updated Jan 3 • 3

wjldw/ToolPRM-CoT-v4

Text Generation • 196k • Updated Jan 3 • 2

wjldw/ToolPRM-Base-synthesis

Text Generation • 196k • Updated Jan 3 • 2

wjldw/ToolPRM-GRPO-v3

4B • Updated Jan 1 • 1

wjldw/ToolPRM-Checklist-v3

Text Generation • 196k • Updated Jan 1 • 2

wjldw/ToolPRM-Base-v3

Text Generation • 196k • Updated Jan 1 • 1

wjldw/ToolPRM-CoT-v3

Text Generation • 196k • Updated Jan 1 • 2

wjldw/Qwen2.5-14B_gemini_sft_30000

Text Generation • 15B • Updated Jul 29, 2025 • 2

datasets 1

wjldw/JD-Bench

Viewer • Updated Sep 29, 2025 • 42k • 15