shipeng luo's picture

shipeng luo

luoagent

·

AI & ML interests

ML AI

Recent Activity

upvoted an article about 1 hour ago

使用 DPO 微调 Llama 2

upvoted a paper 1 day ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

upvoted a paper 1 day ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet