OpenDataArena/ODA-Fin-RL-8B
Reinforcement Learning • 8B • Updated • 45 • 1
Data-centric AI, LLM, MLLM
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs
Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training