view article Article Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation nvidia • 3 days ago • 15
view article Article Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation nvidia • 3 days ago • 15
view article Article NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI nvidia • Jan 5 • 64
view article Article Introducing NVIDIA Cosmos Policy for Advanced Robot Control nvidia • Jan 29 • 48
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 danf, mber, moshew • Dec 4, 2025 • 40
Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs Paper • 2510.11062 • Published Oct 13, 2025 • 29
view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face +2 Jiqing, MatrixYao, kding1, IlyasMoutawwakil • Oct 16, 2025 • 18
view article Article Fast LoRA inference for Flux with Diffusers and PEFT sayakpaul, BenjaminB • Jul 23, 2025 • 54
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.13k
Running 3.85k The Ultra-Scale Playbook 🌌 3.85k The ultimate guide to training LLM on large GPU Clusters
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk • Apr 29, 2025 • 44
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk • Apr 29, 2025 • 44
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131 • Apr 16, 2025 • 42
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131 • Apr 16, 2025 • 42
view article Article 🚀 Accelerating LLM Inference with TGI on Intel Gaudi +3 baptistecolle, regisss, IlyasMoutawwakil, echarlaix, kding1 • Mar 28, 2025 • 14
view article Article 🚀 Accelerating LLM Inference with TGI on Intel Gaudi +3 baptistecolle, regisss, IlyasMoutawwakil, echarlaix, kding1 • Mar 28, 2025 • 14
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP +1 MatrixYao, kding1, IlyasMoutawwakil • Dec 17, 2024 • 7
view article Article Accelerating Protein Language Model ProtST on Intel Gaudi 2 +6 juliensimon, Jiqing, smiret, katarinayuan, sywangyi, MatrixYao, ChrisAllenMing, kding1 • Jul 3, 2024 • 2
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon +6 juliensimon, Haihao, antonyvance, MatrixYao, lianglv, gserochi, Debbh, kding1 • May 9, 2024 • 12
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon +6 juliensimon, Haihao, antonyvance, MatrixYao, lianglv, gserochi, Debbh, kding1 • May 9, 2024 • 12