Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shilong Zhang's picture
5 7 12

Shilong Zhang

shilongz
Reza2kn's profile picture svjack's profile picture Meka-1018's profile picture
·
https://jshilong.github.io/
  • jshilong

AI & ML interests

His research interests are primarily focused on Large Vision-Language Model and Large Vision Generation Model

Recent Activity

commented on a paper 12 days ago
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing
upvoted a paper 13 days ago
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing
submitted a paper 13 days ago
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing
View all activity

Organizations

FoundationVision's profile picture

commented a paper 12 days ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published 16 days ago • 36 •
6
New activity in FoundationVision/FlashVideo 11 months ago

Add library name, pipeline tag

#1 opened 11 months ago by
nielsr
commented a paper 11 months ago

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Paper • 2502.05179 • Published Feb 7, 2025 • 24 •
3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs