Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 5 days ago • 244
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 6 days ago • 41
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 10 days ago • 63
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper • 2604.18486 • Published 6 days ago • 84
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence Paper • 2604.18292 • Published 6 days ago • 78
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 18 days ago • 34
khazarai/Qwen3-4B-Qwen3.6-plus-Reasoning-Distilled-GGUF Text Generation • 4B • Updated 6 days ago • 80.5k • 33
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 20 days ago • 200
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding Paper • 2604.00528 • Published 25 days ago • 12