Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 11 days ago • 17
Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, and Simulation 15 days ago • 23
ORBA: Orthogonal Reflection Bounded Ablation — A Geometrically Exact Detour in Directional Activation Editing 7 days ago • 5
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 113
Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 11 days ago • 17
Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, and Simulation 15 days ago • 23
ORBA: Orthogonal Reflection Bounded Ablation — A Geometrically Exact Detour in Directional Activation Editing 7 days ago • 5
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 113