🎙️ MamreVoice: State-of-the-Art Hebrew TTS

Project Site | GitHub

MamreVoice is a high-performance, end-to-end Text-to-Speech (TTS) and Voice Conversion system specifically optimized for the Hebrew language. Built on the innovative DiffMamba architecture, it delivers expressive, clear, and lifelike synthetic speech.

✨ Features

Expressive Hebrew Synthesis: Specialized optimization for Hebrew prosody, rhythm, and naturalness.
Voice Conversion: High-quality voice cloning and identity preservation.
Phonetics-Aware: Seamless integration with Phonikud for accurate Hebrew diacritization (Niqqud) and vowelization.
Efficiency: Designed for low computational overhead, enabling real-time performance on modern Nvidia GPU hardware.

🛠️ Technical Overview

Architecture

MamreVoice utilizes DiffMamba, which addresses the computational heavy-lifting of traditional diffusion models by using Mamba blocks. This allows for long-context handling and faster inference without sacrificing audio quality.

Components

Text Processing: Advanced normalization and diacritization via Phonikud.
BackBone Model: DiffMamba-based Backbone
Rest Of The Model: is based on Zonos

Downloads last month: 80

Model tree for notmax123/MamreTTS

Base model

notmax123/Zonos-Hebrew

Finetuned

(1)

this model

notmax123
/

MamreTTS