Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2508.19205

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1, 2025 • 512k • 2.14k
microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 25 days ago • 327k • 1.03k
VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

Generative-Voice

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17, 2025 • 51
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 267
DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

Bugai's Collection

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published Aug 26, 2025 • 43
VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

Voice cloning & TTS

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

bezzam/VibeVoice-1.5B

Text-to-Speech • 3B • Updated 7 days ago • 49
bezzam/VibeVoice-7B

Text-to-Speech • 9B • Updated Nov 20, 2025 • 668
bezzam/VibeVoice-AcousticTokenizer

Feature Extraction • 0.7B • Updated 7 days ago • 334
bezzam/VibeVoice-SemanticTokenizer

Feature Extraction • 0.3B • Updated Dec 3, 2025 • 11

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139
Running on Zero

675

IndexTTS 2 Demo

🏢

675

Generate expressive voice from text using audio reference

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1, 2025 • 512k • 2.14k
microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 25 days ago • 327k • 1.03k
VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

Voice cloning & TTS

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

Generative-Voice

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

bezzam/VibeVoice-1.5B

Text-to-Speech • 3B • Updated 7 days ago • 49
bezzam/VibeVoice-7B

Text-to-Speech • 9B • Updated Nov 20, 2025 • 668
bezzam/VibeVoice-AcousticTokenizer

Feature Extraction • 0.7B • Updated 7 days ago • 334
bezzam/VibeVoice-SemanticTokenizer

Feature Extraction • 0.3B • Updated Dec 3, 2025 • 11

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17, 2025 • 51
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 267
DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

Bugai's Collection

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space

Paper • 2508.19247 • Published Aug 26, 2025 • 43
VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 139
Running on Zero

675

IndexTTS 2 Demo

🏢

675

Generate expressive voice from text using audio reference

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs