Mashiro's picture

9

Mashiro

AlexMashiro

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

upvoted a paper 6 days ago

RM-R1: Reward Modeling as Reasoning

upvoted a paper 11 days ago

Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling

View all activity

Organizations

None yet

AlexMashiro 's models

None public yet