3 13 1

Yao

Huaxiu

https://www.huaxiuyao.io/

HuaxiuYaoML

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

submitted a paper 7 days ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

upvoted a paper 7 days ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

View all activity

Organizations

upvoted a paper 3 days ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published 5 days ago • 29

submitted a paper to Daily Papers 7 days ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published 8 days ago • 29

upvoted a paper 7 days ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published 8 days ago • 29

upvoted a paper 10 days ago

PRBench: End-to-end Paper Reproduction in Physics Research

Paper • 2603.27646 • Published 12 days ago • 29

authored 7 papers 21 days ago

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

Paper • 2602.08236 • Published Feb 9 • 9

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 74

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Paper • 2602.10090 • Published Feb 10 • 52

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Paper • 2602.22190 • Published Feb 25 • 17

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published 28 days ago • 64

SimpleOCR: Rendering Visualized Questions to Teach MLLMs to Read

Paper • 2602.22426 • Published Feb 25

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 23 days ago • 136

upvoted a paper 22 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 23 days ago • 136

submitted a paper to Daily Papers 22 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 23 days ago • 136

upvoted a paper 28 days ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published 28 days ago • 64

upvoted 2 papers about 2 months ago

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Paper • 2602.10090 • Published Feb 10 • 52

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 74

upvoted a paper 2 months ago

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Paper • 2601.15369 • Published Jan 21 • 21

upvoted an article 3 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

116

upvoted a paper 3 months ago

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 37

upvoted a paper 4 months ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Paper • 2511.19900 • Published Nov 25, 2025 • 49

Yao

AI & ML interests

Recent Activity

Organizations

Huaxiu's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment