arxiv:2602.10693
floyed shen
floyed
AI & ML interests
None yet
Recent Activity
submitted a paper 2 days ago
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information upvoted a paper 9 days ago
From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation upvoted a paper 9 days ago
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information