MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning Paper β’ 2506.22992 β’ Published Jun 28, 2025 β’ 12 β’ 4
Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task Paper β’ 2506.08872 β’ Published Jun 10, 2025 β’ 13 β’ 2
Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards Paper β’ 2506.11474 β’ Published Jun 13, 2025 β’ 18 β’ 2