Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
27
6
shipeng luo
luoagent
Follow
0 followers
·
6 following
AI & ML interests
ML AI
Recent Activity
upvoted
an
article
about 1 hour ago
使用 DPO 微调 Llama 2
upvoted
a
paper
1 day ago
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
upvoted
a
paper
1 day ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
View all activity
Organizations
None yet
models
0
None public yet
datasets
0
None public yet