DataClaw0: Agentic Tailoring Multimodal Data from Raw Streams Paper • 2606.21337 • Published 7 days ago • 70
Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 10 days ago • 74
AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions Paper • 2605.25707 • Published May 25 • 6
stefanocarrera/autophagycode_D_he_train-mercury_Qwen3-8B_strategy_trust_t1.1_g6_run1_metrics Viewer • Updated 26 days ago • 164 • 41 • 2
SOD: Step-wise On-policy Distillation for Small Language Model Agents Paper • 2605.07725 • Published May 8 • 25
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published about 1 month ago • 431
Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Paper • 2605.22109 • Published May 21 • 171