Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper • 2507.06261 • Published Jul 7, 2025 • 67
Quantifying the Carbon Emissions of Machine Learning Paper • 1910.09700 • Published Oct 21, 2019 • 32
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 206
Winograd Schema Challenge Collection Datasets related to the original Winograd Schema Challenge (WSC) • 7 items • Updated Jan 21, 2024 • 1
LexSemBridge: Fine-Grained Dense Representation Enhancement through Token-Aware Embedding Augmentation Paper • 2508.17858 • Published Aug 25, 2025 • 10
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks Paper • 2411.01192 • Published Nov 2, 2024 • 5
Can LLMs Be Trusted for Evaluating RAG Systems? A Survey of Methods and Datasets Paper • 2504.20119 • Published Apr 28, 2025 • 3
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published Mar 2, 2025 • 64