Mitigating Catastrophic Forgetting in Language Transfer via Model Merging Paper • 2407.08699 • Published Jul 11, 2024 • 1
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 57
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published 17 days ago • 42
TRAVL: A Recipe for Making Video-Language Models Better Judges of Physics Implausibility Paper • 2510.07550 • Published Oct 8, 2025 • 3
PATMAT: Person Aware Tuning of Mask-Aware Transformer for Face Inpainting Paper • 2304.06107 • Published Apr 12, 2023 • 2
Do generative video models learn physical principles from watching videos? Paper • 2501.09038 • Published Jan 14, 2025 • 34
view article Article Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM Apr 23, 2025 • 63