arsenalhuang 's Collections image edit
updated
CoLLM: A Large Language Model for Composed Image Retrieval
Paper
• 2503.19910
• Published
• 15
LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized
Text-Guided Image Editing
Paper
• 2503.21541
• Published
• 1
HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction
via Gaussian Restoration
Paper
• 2504.03536
• Published
• 13
FantasyTalking: Realistic Talking Portrait Generation via Coherent
Motion Synthesis
Paper
• 2504.04842
• Published
• 35
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based
Spatiotemporal Diffusion for Audio-driven Talking Portrait
Paper
• 2503.12963
• Published
• 7
RASA: Replace Anyone, Say Anything -- A Training-Free Framework for
Audio-Driven and Universal Portrait Video Editing
Paper
• 2503.11571
• Published
• 1
VisualCloze: A Universal Image Generation Framework via Visual
In-Context Learning
Paper
• 2504.07960
• Published
• 50
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based
Image Editing
Paper
• 2505.02370
• Published
• 14
In-Context Edit: Enabling Instructional Image Editing with In-Context
Generation in Large Scale Diffusion Transformer
Paper
• 2504.20690
• Published
• 19
Emerging Properties in Unified Multimodal Pretraining
Paper
• 2505.14683
• Published
• 133
OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video
Diffusion Models
Paper
• 2503.18033
• Published
• 30
UniWorld: High-Resolution Semantic Encoders for Unified Visual
Understanding and Generation
Paper
• 2506.03147
• Published
• 58
In-Context Brush: Zero-shot Customized Subject Insertion with
Context-Aware Latent Space Manipulation
Paper
• 2505.20271
• Published
• 1
CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic
Design Generation
Paper
• 2506.10890
• Published
• 9