NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents Paper โข 2512.12730 โข Published 11 days ago โข 43
Scaling Latent Reasoning via Looped Language Models Paper โข 2510.25741 โข Published Oct 29 โข 221