ComfyUI-R1: Exploring Reasoning Models for Workflow Generation Paper • 2506.09790 • Published Jun 11, 2025 • 53
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations Paper • 2506.18898 • Published Jun 23, 2025 • 33
Align Your Flow: Scaling Continuous-Time Flow Map Distillation Paper • 2506.14603 • Published Jun 17, 2025 • 19
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 6 days ago • 160
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated Sep 13, 2025 • 98
Flux.1-dev ControlNets Collection A collection of ControlNet models for Flux.1-dev by Jasper Research • 4 items • Updated Sep 24, 2024 • 26
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated 6 days ago • 227