Tolga Cangöz's picture

Open to Work

Tolga Cangöz

tolgacangoz

·

standard_ai

AI & ML interests

AIGC

Recent Activity

upvoted a paper about 22 hours ago

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

liked a model 1 day ago

Wan-AI/Wan2.1-T2V-1.3B

upvoted an article 2 days ago

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

View all activity

Organizations

upvoted a paper about 22 hours ago

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

Paper • 2605.30351 • Published 7 days ago • 25

upvoted an article 2 days ago

Article

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

nvidia

•

3 days ago

• 62

upvoted 2 articles 3 days ago

Article

MONET: Lowering the bar for World-Class Image Generation research.

jasperai

•

7 days ago

• 10

Article

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

+3

ariG23498, sayakpaul, sergiopaniego, ror, pcuenq

•

6 days ago

• 73

upvoted a collection 4 days ago

LingBot-World

4 items • Updated 6 days ago • 38

upvoted a paper 5 days ago

Quantized Keys Steal Attention: Bias Correction for KV-Cache Compression in Video Diffusion

Paper • 2605.26266 • Published 10 days ago • 1

upvoted a paper 14 days ago

One Pass Is Not Enough: Recursive Latent Refinement for Generative Models

Paper • 2605.15309 • Published 21 days ago • 1

upvoted a paper 15 days ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published 17 days ago • 112

upvoted a collection 17 days ago

SANA-WM

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer • 2 items • Updated 17 days ago • 4

upvoted a paper 20 days ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 21 days ago • 85

upvoted a paper 27 days ago

WorldJen: An End-to-End Multi-Dimensional Benchmark for Generative Video Models

Paper • 2605.03475 • Published about 1 month ago • 8

upvoted an article about 1 month ago

Article

Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts

NucleusAI

•

Apr 14

• 11

upvoted a paper about 1 month ago

Speculative Decoding for Autoregressive Video Generation

Paper • 2604.17397 • Published Apr 19 • 11

upvoted an article about 2 months ago

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 52

upvoted a collection about 2 months ago

Modular Pipelines

Diffusers Modular Pipeline repositories • 7 items • Updated Feb 20 • 2

upvoted an article 2 months ago

Article

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

+2

YiYiXu, OzzyGT, dn6, sayakpaul

•

Mar 5

• 51

upvoted 2 papers 2 months ago

6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models

Paper • 2603.18742 • Published Mar 19 • 11

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 22

upvoted a collection 2 months ago

AutoGaze

7 items • Updated Mar 19 • 9

upvoted a paper 2 months ago

ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA

Paper • 2603.10256 • Published Mar 10 • 23