arxiv:2511.01093
Jarrod Barnes PRO
Jarrodbarnes
AI & ML interests
Continual Learning, Reinforcement Learning
Recent Activity
updated
a model
about 6 hours ago
Jarrodbarnes/Qwen3-4B-tau2-grpo-v1
published
a model
about 6 hours ago
Jarrodbarnes/Qwen3-4B-tau2-grpo-v1
upvoted
a
paper
about 7 hours ago
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking