Text-to-Speech
Transformers
ONNX
GGUF
Chinese
English
voice-dialogue
speech-recognition
large-language-model
asr
tts
llm
chinese
english
real-time
conversational
Instructions to use MoYoYoTech/VoiceDialogue with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use MoYoYoTech/VoiceDialogue with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="MoYoYoTech/VoiceDialogue") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("MoYoYoTech/VoiceDialogue", dtype="auto") - llama-cpp-python
How to use MoYoYoTech/VoiceDialogue with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="MoYoYoTech/VoiceDialogue", filename="assets/models/llm/qwen/Qwen3-8B-Q6_K.gguf", )
llm.create_chat_completion( messages = "\"The answer to the universe is 42\"" )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use MoYoYoTech/VoiceDialogue with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K # Run inference directly in the terminal: llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K # Run inference directly in the terminal: llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K # Run inference directly in the terminal: ./llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K # Run inference directly in the terminal: ./build/bin/llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K
Use Docker
docker model run hf.co/MoYoYoTech/VoiceDialogue:Q6_K
- LM Studio
- Jan
- Ollama
How to use MoYoYoTech/VoiceDialogue with Ollama:
ollama run hf.co/MoYoYoTech/VoiceDialogue:Q6_K
- Unsloth Studio new
How to use MoYoYoTech/VoiceDialogue with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for MoYoYoTech/VoiceDialogue to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for MoYoYoTech/VoiceDialogue to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for MoYoYoTech/VoiceDialogue to start chatting
- Pi new
How to use MoYoYoTech/VoiceDialogue with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "MoYoYoTech/VoiceDialogue:Q6_K" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use MoYoYoTech/VoiceDialogue with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default MoYoYoTech/VoiceDialogue:Q6_K
Run Hermes
hermes
- Docker Model Runner
How to use MoYoYoTech/VoiceDialogue with Docker Model Runner:
docker model run hf.co/MoYoYoTech/VoiceDialogue:Q6_K
- Lemonade
How to use MoYoYoTech/VoiceDialogue with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull MoYoYoTech/VoiceDialogue:Q6_K
Run and chat with the model
lemonade run user.VoiceDialogue-Q6_K
List all available models
lemonade list
liumaolin commited on
Commit ·
d41c6db
1
Parent(s): b446464
Add `silero-vad` as a dependency in `pyproject.toml` and update `uv.lock`
Browse files- pyproject.toml +1 -0
- uv.lock +34 -0
pyproject.toml
CHANGED
|
@@ -29,6 +29,7 @@ dependencies = [
|
|
| 29 |
"pypinyin>=0.54.0",
|
| 30 |
"pytorch-lightning==2.3.1",
|
| 31 |
"pywhispercpp",
|
|
|
|
| 32 |
"soundfile==0.13.1",
|
| 33 |
"torch==2.3.1",
|
| 34 |
"transformers==4.41.2",
|
|
|
|
| 29 |
"pypinyin>=0.54.0",
|
| 30 |
"pytorch-lightning==2.3.1",
|
| 31 |
"pywhispercpp",
|
| 32 |
+
"silero-vad==5.1.2",
|
| 33 |
"soundfile==0.13.1",
|
| 34 |
"torch==2.3.1",
|
| 35 |
"transformers==4.41.2",
|
uv.lock
CHANGED
|
@@ -2856,6 +2856,20 @@ wheels = [
|
|
| 2856 |
{ url = "https://files.pythonhosted.org/packages/e0/f9/0595336914c5619e5f28a1fb793285925a8cd4b432c9da0a987836c7f822/shellingham-1.5.4-py2.py3-none-any.whl", hash = "sha256:7ecfff8f2fd72616f7481040475a65b2bf8af90a56c89140852d1120324e8686", size = 9755, upload-time = "2023-10-24T04:13:38.866Z" },
|
| 2857 |
]
|
| 2858 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2859 |
[[package]]
|
| 2860 |
name = "six"
|
| 2861 |
version = "1.17.0"
|
|
@@ -3265,6 +3279,24 @@ wheels = [
|
|
| 3265 |
{ url = "https://files.pythonhosted.org/packages/49/b6/1a2e3d43d4bc4ad7a4575b3745d707a68d5ed00ba263b205b6281bdd0921/torch-2.3.1-cp312-none-macosx_11_0_arm64.whl", hash = "sha256:3c333dc2ebc189561514eda06e81df22bf8fb64e2384746b2cb9f04f96d1d4c8", size = 60978559, upload-time = "2024-06-05T16:41:27.77Z" },
|
| 3266 |
]
|
| 3267 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3268 |
[[package]]
|
| 3269 |
name = "torchmetrics"
|
| 3270 |
version = "1.7.2"
|
|
@@ -3446,6 +3478,7 @@ dependencies = [
|
|
| 3446 |
{ name = "pypinyin" },
|
| 3447 |
{ name = "pytorch-lightning" },
|
| 3448 |
{ name = "pywhispercpp" },
|
|
|
|
| 3449 |
{ name = "soundfile" },
|
| 3450 |
{ name = "torch" },
|
| 3451 |
{ name = "transformers" },
|
|
@@ -3485,6 +3518,7 @@ requires-dist = [
|
|
| 3485 |
{ name = "pypinyin", specifier = ">=0.54.0" },
|
| 3486 |
{ name = "pytorch-lightning", specifier = "==2.3.1" },
|
| 3487 |
{ name = "pywhispercpp", git = "https://github.com/absadiki/pywhispercpp" },
|
|
|
|
| 3488 |
{ name = "soundfile", specifier = "==0.13.1" },
|
| 3489 |
{ name = "torch", specifier = "==2.3.1" },
|
| 3490 |
{ name = "transformers", specifier = "==4.41.2" },
|
|
|
|
| 2856 |
{ url = "https://files.pythonhosted.org/packages/e0/f9/0595336914c5619e5f28a1fb793285925a8cd4b432c9da0a987836c7f822/shellingham-1.5.4-py2.py3-none-any.whl", hash = "sha256:7ecfff8f2fd72616f7481040475a65b2bf8af90a56c89140852d1120324e8686", size = 9755, upload-time = "2023-10-24T04:13:38.866Z" },
|
| 2857 |
]
|
| 2858 |
|
| 2859 |
+
[[package]]
|
| 2860 |
+
name = "silero-vad"
|
| 2861 |
+
version = "5.1.2"
|
| 2862 |
+
source = { registry = "https://pypi.org/simple" }
|
| 2863 |
+
dependencies = [
|
| 2864 |
+
{ name = "onnxruntime" },
|
| 2865 |
+
{ name = "torch" },
|
| 2866 |
+
{ name = "torchaudio" },
|
| 2867 |
+
]
|
| 2868 |
+
sdist = { url = "https://files.pythonhosted.org/packages/b1/b4/d0311b2e6220a11f8f4699f4a278cb088131573286cdfe804c87c7eb5123/silero_vad-5.1.2.tar.gz", hash = "sha256:c442971160026d2d7aa0ad83f0c7ee86c89797a65289fe625c8ea59fc6fb828d", size = 5098526, upload-time = "2024-10-09T09:50:47.019Z" }
|
| 2869 |
+
wheels = [
|
| 2870 |
+
{ url = "https://files.pythonhosted.org/packages/98/f7/5ae11d13fbb733cd3bfd7ff1c3a3902e6f55437df4b72307c1f168146268/silero_vad-5.1.2-py3-none-any.whl", hash = "sha256:93b41953d7774b165407fda6b533c119c5803864e367d5034dc626c82cfdf661", size = 5026737, upload-time = "2024-10-09T09:50:44.355Z" },
|
| 2871 |
+
]
|
| 2872 |
+
|
| 2873 |
[[package]]
|
| 2874 |
name = "six"
|
| 2875 |
version = "1.17.0"
|
|
|
|
| 3279 |
{ url = "https://files.pythonhosted.org/packages/49/b6/1a2e3d43d4bc4ad7a4575b3745d707a68d5ed00ba263b205b6281bdd0921/torch-2.3.1-cp312-none-macosx_11_0_arm64.whl", hash = "sha256:3c333dc2ebc189561514eda06e81df22bf8fb64e2384746b2cb9f04f96d1d4c8", size = 60978559, upload-time = "2024-06-05T16:41:27.77Z" },
|
| 3280 |
]
|
| 3281 |
|
| 3282 |
+
[[package]]
|
| 3283 |
+
name = "torchaudio"
|
| 3284 |
+
version = "2.3.1"
|
| 3285 |
+
source = { registry = "https://pypi.org/simple" }
|
| 3286 |
+
dependencies = [
|
| 3287 |
+
{ name = "torch" },
|
| 3288 |
+
]
|
| 3289 |
+
wheels = [
|
| 3290 |
+
{ url = "https://files.pythonhosted.org/packages/41/c3/51482591d741c0c069f5825fe02d9ae387dc63eef2fe4cea1d1f3b07a623/torchaudio-2.3.1-cp311-cp311-macosx_11_0_arm64.whl", hash = "sha256:01984f38398ca5e98ecfbfeafb72ae5b2131d0bb8aa464b5777addb3e4826877", size = 1816843, upload-time = "2024-06-05T16:45:04.237Z" },
|
| 3291 |
+
{ url = "https://files.pythonhosted.org/packages/62/04/3acb3673dcc9f493e65798d752841137cfcc14220a8eb4ec8dc202382bcc/torchaudio-2.3.1-cp311-cp311-manylinux1_x86_64.whl", hash = "sha256:68815815e09105fe1171f0541681a7ebaf6d5d52b8e095ccde94b8064b107002", size = 3355690, upload-time = "2024-06-05T16:44:55.522Z" },
|
| 3292 |
+
{ url = "https://files.pythonhosted.org/packages/e7/8b/a883b4359c88d4e77d9301b4aaba6793a5de09a69010f2b56ba47655f49e/torchaudio-2.3.1-cp311-cp311-manylinux2014_aarch64.whl", hash = "sha256:c8c727c8341825bd18d91017c4c00f36b53b08f2176cdb9bdcb0def1c450b21d", size = 1651755, upload-time = "2024-06-05T16:44:59.953Z" },
|
| 3293 |
+
{ url = "https://files.pythonhosted.org/packages/7a/bf/5af72c1c4522bcf67df140427120b0e7898b2abc5afa5da917b722983a5b/torchaudio-2.3.1-cp311-cp311-win_amd64.whl", hash = "sha256:341e33450831146bc4c4cc8191d94484f1acc8bb566c2463a57c4133f792464e", size = 2368862, upload-time = "2024-06-05T16:44:41.152Z" },
|
| 3294 |
+
{ url = "https://files.pythonhosted.org/packages/45/97/e584276755305d3a0af0ff280ce7eeafc45855355ae0a6de38b13ac195ac/torchaudio-2.3.1-cp312-cp312-macosx_11_0_arm64.whl", hash = "sha256:5e36685420a07a176146e9d6e0fa8225198f126e167a00785538f853807e2d43", size = 1808180, upload-time = "2024-06-05T16:44:58.503Z" },
|
| 3295 |
+
{ url = "https://files.pythonhosted.org/packages/48/b7/04a96dabea4ffd1cc990b429c36f08316b442f2ea4964bd8b52e28aeb36f/torchaudio-2.3.1-cp312-cp312-manylinux1_x86_64.whl", hash = "sha256:07b72d76fa108ac0f3400a759456ba96bdaa2b8649fd9588cc93295a532b01d9", size = 3350075, upload-time = "2024-06-05T16:44:39.047Z" },
|
| 3296 |
+
{ url = "https://files.pythonhosted.org/packages/53/30/89de93b7082b58c29f2e368a737c613a07d0adbe87505b9592f4c88be718/torchaudio-2.3.1-cp312-cp312-manylinux2014_aarch64.whl", hash = "sha256:42af6c7a430e6268f2c028e06078d413912b5ec6efa28a097ebdd3c3c79659df", size = 1646768, upload-time = "2024-06-05T16:44:46.727Z" },
|
| 3297 |
+
{ url = "https://files.pythonhosted.org/packages/22/fa/23a6456de8b6fbac7026efb9c7163335c57b79437618686149daf2f9be39/torchaudio-2.3.1-cp312-cp312-win_amd64.whl", hash = "sha256:25bd1137e47de96b48ef0dc4865bc620a0b759e44c009c7e78e92d7bfdf257ba", size = 2362582, upload-time = "2024-06-05T16:44:34.565Z" },
|
| 3298 |
+
]
|
| 3299 |
+
|
| 3300 |
[[package]]
|
| 3301 |
name = "torchmetrics"
|
| 3302 |
version = "1.7.2"
|
|
|
|
| 3478 |
{ name = "pypinyin" },
|
| 3479 |
{ name = "pytorch-lightning" },
|
| 3480 |
{ name = "pywhispercpp" },
|
| 3481 |
+
{ name = "silero-vad" },
|
| 3482 |
{ name = "soundfile" },
|
| 3483 |
{ name = "torch" },
|
| 3484 |
{ name = "transformers" },
|
|
|
|
| 3518 |
{ name = "pypinyin", specifier = ">=0.54.0" },
|
| 3519 |
{ name = "pytorch-lightning", specifier = "==2.3.1" },
|
| 3520 |
{ name = "pywhispercpp", git = "https://github.com/absadiki/pywhispercpp" },
|
| 3521 |
+
{ name = "silero-vad", specifier = "==5.1.2" },
|
| 3522 |
{ name = "soundfile", specifier = "==0.13.1" },
|
| 3523 |
{ name = "torch", specifier = "==2.3.1" },
|
| 3524 |
{ name = "transformers", specifier = "==4.41.2" },
|