Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
56
1
Dawei Li
wjldw
Follow
Ironieser's profile picture
HarryHe's profile picture
HowieHwong's profile picture
6 followers
·
22 following
https://david-li0406.github.io/
home
David-Li0406
dawei-li-29b334251
AI & ML interests
LLM, NLP, Data Mining
Recent Activity
upvoted
a
paper
13 days ago
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
upvoted
a
paper
about 1 month ago
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models
authored
a paper
about 1 month ago
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents
View all activity
Organizations
Papers
15
arxiv:
2601.12294
arxiv:
2509.25154
arxiv:
2508.19570
arxiv:
2508.01191
Expand 15 papers
models
18
Sort: Recently updated
wjldw/ToolPRM-GRPO-synthesis
4B
•
Updated
Jan 4
•
4
wjldw/ToolPRM-GRPO-v4
4B
•
Updated
Jan 3
•
2
wjldw/ToolPRM-Base-v4
Text Generation
•
196k
•
Updated
Jan 3
•
3
wjldw/ToolPRM-CoT-v4
Text Generation
•
196k
•
Updated
Jan 3
•
2
wjldw/ToolPRM-Base-synthesis
Text Generation
•
196k
•
Updated
Jan 3
•
2
wjldw/ToolPRM-GRPO-v3
4B
•
Updated
Jan 1
•
1
wjldw/ToolPRM-Checklist-v3
Text Generation
•
196k
•
Updated
Jan 1
•
2
wjldw/ToolPRM-Base-v3
Text Generation
•
196k
•
Updated
Jan 1
•
1
wjldw/ToolPRM-CoT-v3
Text Generation
•
196k
•
Updated
Jan 1
•
2
wjldw/Qwen2.5-14B_gemini_sft_30000
Text Generation
•
15B
•
Updated
Jul 29, 2025
•
2
View 18 models
datasets
1
wjldw/JD-Bench
Viewer
•
Updated
Sep 29, 2025
•
42k
•
15