LMMs-Lab

community

https://www.lmms-lab.com/

lmmslab

EvolvingLMMs-Lab

AI & ML interests

Feeling and building the multimodal intelligence.

Recent Activity

yiyexy submitted a paper 1 day ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

xiangan authored a paper 6 days ago

ProCLIP: Progressive Vision-Language Alignment via LLM-based Embedder

xiangan authored a paper 6 days ago

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

View all activity

Papers

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

View all Papers

lmms-lab 's models 61

lmms-lab/LLaVA-OneVision-1.5-4B-Instruct

Image-Text-to-Text • 5B • Updated 11 days ago • 2.64k • 17

lmms-lab/BAGEL-7B-MoT-ver.LE

Text Generation • 15B • Updated Dec 8, 2025 • 3.88k • 1

lmms-lab/LLaVA-OneVision-1.5-8B-Instruct

Image-Text-to-Text • 9B • Updated Oct 21, 2025 • 39.9k • 62

lmms-lab/LLaVA-OneVision-1.5-4B-Base

Image-Text-to-Text • 5B • Updated Oct 5, 2025 • 2.05k • 1

lmms-lab/LLaVA-OneVision-1.5-8B-Base

Image-Text-to-Text • 9B • Updated Sep 30, 2025 • 52 • 1

lmms-lab/LLaVA-OneVision-1.5-4B-stage0

4B • Updated Sep 30, 2025 • 15 • 1

lmms-lab/LLaVA-OneVision-1.5-8B-stage0

9B • Updated Sep 30, 2025 • 4 • 2

lmms-lab/LLaVA-Critic-R1-7B-LLaMA32v

11B • Updated Aug 28, 2025

lmms-lab/LLaVA-Critic-R1-7B-Plus-Mimo

8B • Updated Aug 28, 2025 • 2 • 1

lmms-lab/MMSearch-R1-7B-0807

8B • Updated Aug 7, 2025 • 1

lmms-lab/MMSearch-R1-7B

8B • Updated Jul 30, 2025 • 61 • 9

lmms-lab/LLaVA-Critic-R1-7B-Plus-Qwen

8B • Updated Jul 26, 2025 • 54 • 5

lmms-lab/LLaVA-Critic-R1-7B

8B • Updated Jul 19, 2025 • 174

lmms-lab/Aero-1-Audio

Text Generation • 2B • Updated Jun 7, 2025 • 78 • 91

lmms-lab/EgoGPT-0.5b-Demo

2B • Updated Mar 7, 2025 • 3

lmms-lab/EgoGPT-7b-EgoIT

9B • Updated Mar 7, 2025 • 149

lmms-lab/EgoGPT-7b-EgoIT-EgoLife

9B • Updated Mar 7, 2025 • 56 • 2

lmms-lab/EgoGPT-7b-Demo

9B • Updated Mar 7, 2025 • 1

lmms-lab/LLaVA-NeXT-Video-7B-DPO

Video-Text-to-Text • 7B • Updated Feb 21, 2025 • 960 • 29

lmms-lab/LLaVA-NeXT-Video-7B

Video-Text-to-Text • 7B • Updated Feb 21, 2025 • 967 • 51

lmms-lab/Qwen2-VL-2B-GRPO-8k

2B • Updated Jan 28, 2025 • 16 • 17

lmms-lab/Qwen2-VL-7B-GRPO-8k

8B • Updated Jan 28, 2025 • 1 • 3

lmms-lab/llama3-llava-next-8b-hf-sae-131k

Updated Nov 26, 2024 • 8 • 7

lmms-lab/LLaVA-Video-72B-Qwen2

Text Generation • 73B • Updated Oct 25, 2024 • 284 • 20

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 102k • 124

lmms-lab/llava-onevision-qwen2-7b-ov-chat

Text Generation • 8B • Updated Oct 23, 2024 • 1.14k • 23

lmms-lab/llava-onevision-qwen2-72b-ov-chat

Image-Text-to-Text • 73B • Updated Oct 9, 2024 • 9 • 9

lmms-lab/llava-critic-72b

73B • Updated Oct 4, 2024 • 12 • 15

lmms-lab/llava-critic-7b

8B • Updated Oct 4, 2024 • 134 • 15

lmms-lab/LLaVA-Video-7B-Qwen2-Video-Only

Text Generation • 8B • Updated Oct 4, 2024 • 361 • 6