Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 11 items โข Updated Jul 21 โข 550
๐ญ Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! ๐ โข 75 items โข Updated Apr 20 โข 92
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper โข 2409.02634 โข Published Sep 4, 2024 โข 97
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper โข 2404.05719 โข Published Apr 8, 2024 โข 83