combines reinforcement learning (RL) and large language models (LLMs) to improve exploration using diverse tool generation during inference
Gabriel Bo PRO
gabrielbo
·
AI & ML interests
NLP, Scaling, Test-time Compute
Recent Activity
updated a dataset 2 days ago
gabrielbo/parser-bench published a dataset about 1 month ago
gabrielbo/parser-bench updated a dataset about 1 month ago
gabrielbo/parser-bench