AlpachinoNLP/LongCLIP-ViT-B-32 Zero-Shot Image Classification • 0.2B • Updated 11 days ago • 16
AlpachinoNLP/LongCLIP-ViT-B-32 Zero-Shot Image Classification • 0.2B • Updated 11 days ago • 16
Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension Paper • 2512.02791 • Published Dec 2, 2025 • 1
Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension Paper • 2512.02791 • Published Dec 2, 2025 • 1
QTSplus Collection Official models and datasets for paper(https://arxiv.org/abs/2511.11910) • 7 items • Updated Dec 2, 2025 • 1
QTSplus Collection Official models and datasets for paper(https://arxiv.org/abs/2511.11910) • 7 items • Updated Dec 2, 2025 • 1
QTSplus Collection Official models and datasets for paper(https://arxiv.org/abs/2511.11910) • 7 items • Updated Dec 2, 2025 • 1
QTSplus Collection Official models and datasets for paper(https://arxiv.org/abs/2511.11910) • 7 items • Updated Dec 2, 2025 • 1