arxiv:2507.16815
Fu-En Yang
FuEnYang
AI & ML interests
Computer Vision, Deep Learning, Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs), Reasoning Models, Embodied AI
Recent Activity
upvoted
a
paper
2 days ago
Spatial-Aware VLA Pretraining through Visual-Physical Alignment from Human Videos
upvoted
a
paper
2 days ago
LongVie 2: Multimodal Controllable Ultra-Long Video World Model