Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shilong Zhang's picture
5 7 12

Shilong Zhang

shilongz
AdinaY's profile picture Meka-1018's profile picture pjlab-l0240's profile picture
·
https://jshilong.github.io/
  • jshilong

AI & ML interests

His research interests are primarily focused on Large Vision-Language Model and Large Vision Generation Model

Organizations

FoundationVision's profile picture

upvoted a paper 2 months ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published Dec 19, 2025 • 37
upvoted 2 papers about 1 year ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 106

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Paper • 2502.05179 • Published Feb 7, 2025 • 24
upvoted an article over 1 year ago
view article
Article

Diffusers welcomes Stable Diffusion 3

  • +4
Jun 12, 2024
•
99
upvoted 2 papers over 1 year ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 71

Zero-shot Image Editing with Reference Imitation

Paper • 2406.07547 • Published Jun 11, 2024 • 33
upvoted a paper almost 2 years ago

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Paper • 2403.17008 • Published Mar 25, 2024 • 22
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs