speech-to-speech

v2025 Feature

This release adds 4 notable features for engineering teams evaluating rollout.

Published 3mo AI Agents & Assistants

View tool

✓ No known CVEs patched

Read the diff → Tool health → What is this tool? →

✓ No known CVEs patched in this version

Topics

ai assistant language-model machine-learning python speech

+3 more

speech-synthesis speech-to-text speech-translation

Summary

AI summary

Add Chinese support with paraformer_zh ASR, ChatTTS, DeepFilterNet speech enhancement, and macOS multi-language.

Full changelog

What's Changed

Minor doc fix. by @Vaibhavs10 in https://github.com/huggingface/speech-to-speech/pull/2
Fix missing sounddevice module by @AlexHayton in https://github.com/huggingface/speech-to-speech/pull/7
Update README.md by @RodriMora in https://github.com/huggingface/speech-to-speech/pull/23
fix issue with ntlk by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/29
Dockerize by @codearranger in https://github.com/huggingface/speech-to-speech/pull/22
Add support to MPS by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/20
adding apache license by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/31
refactor arguments folder + run ruff by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/32
Allow LM selection and MLX Gemma by @RonanKMcGovern in https://github.com/huggingface/speech-to-speech/pull/40
Improvements mlx pipeline by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/41
refactor all the handlers - folder structure by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/43
add min new tokens by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/49
add warning to install flash attn by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/51
improve logging by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/52
feat:add paraformer_zh asr by @wuhongsheng in https://github.com/huggingface/speech-to-speech/pull/48
Assigning min new tokens to a compiled whisper graph on a thread brea… by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/58
Add paraformer - Chinese STT by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/53
feat:add chatTTS by @wuhongsheng in https://github.com/huggingface/speech-to-speech/pull/55
Add ChatTTS - Chinese support by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/59
feat:add DeepFilterNet for speech enhancement to obtain clear speech … by @wuhongsheng in https://github.com/huggingface/speech-to-speech/pull/61
improve documentation by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/77
Add support for multiple languages by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/60
Update module_arguments.py by @AgainstEntropy in https://github.com/huggingface/speech-to-speech/pull/78
Add language arg to lightning whisper handler by @rchan26 in https://github.com/huggingface/speech-to-speech/pull/84
Fix relative link in README by @rchan26 in https://github.com/huggingface/speech-to-speech/pull/85
fix by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/87
fix: Changed [True] to [False] in help text for audio_enhancement to align with actual default by @BrutalCoding in https://github.com/huggingface/speech-to-speech/pull/91
Update: Added multi-language support for macOS. by @ybm911 in https://github.com/huggingface/speech-to-speech/pull/93
Mac multi language by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/98
Refactor for inference by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/106
feat:Add rest call support similar to oepn-api style by @wuhongsheng in https://github.com/huggingface/speech-to-speech/pull/81
Improve auto language by @eustlb in https://github.com/huggingface/speech-to-speech/pull/112
Readme update + clarity improvements by @eustlb in https://github.com/huggingface/speech-to-speech/pull/113
updated readme for a small typo by @ankanpy in https://github.com/huggingface/speech-to-speech/pull/115
Fix hanging client on KeyboardInterrupt by @3manifold in https://github.com/huggingface/speech-to-speech/pull/121
made small fixes in arguments_classes and TTS folder by @ankanpy in https://github.com/huggingface/speech-to-speech/pull/116
Facebook mms merge by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/123
New new faster whisper by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/124
Add moonshine by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/127
set keras backend to torch. by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/129
Fixed typos in README.md by @sergiopaniego in https://github.com/huggingface/speech-to-speech/pull/137
Bugfix: can not concatenate str + GenerationResponse by @baldassarreFe in https://github.com/huggingface/speech-to-speech/pull/144
Update Parler-TTS base model and description by @ylacombe in https://github.com/huggingface/speech-to-speech/pull/147
adding more languages by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/148
multilingual improvements for parler by @andimarafioti in https://github.com/huggingface/speech-to-speech/pull/149
Improved Error Message for get_tts_handler by @Arslan-Mehmood1 in https://github.com/huggingface/speech-to-speech/pull/155

New Contributors

@Vaibhavs10 made their first contribution in https://github.com/huggingface/speech-to-speech/pull/2
@AlexHayton made their first contribution in https://github.com/huggingface/speech-to-speech/pull/7
@RodriMora made their first contribution in https://github.com/huggingface/speech-to-speech/pull/23
@andimarafioti made their first contribution in https://github.com/huggingface/speech-to-speech/pull/29
@codearranger made their first contribution in https://github.com/huggingface/speech-to-speech/pull/22
@RonanKMcGovern made their first contribution in https://github.com/huggingface/speech-to-speech/pull/40
@wuhongsheng made their first contribution in https://github.com/huggingface/speech-to-speech/pull/48
@rchan26 made their first contribution in https://github.com/huggingface/speech-to-speech/pull/84
@BrutalCoding made their first contribution in https://github.com/huggingface/speech-to-speech/pull/91
@ybm911 made their first contribution in https://github.com/huggingface/speech-to-speech/pull/93
@eustlb made their first contribution in https://github.com/huggingface/speech-to-speech/pull/112
@ankanpy made their first contribution in https://github.com/huggingface/speech-to-speech/pull/115
@3manifold made their first contribution in https://github.com/huggingface/speech-to-speech/pull/121
@sergiopaniego made their first contribution in https://github.com/huggingface/speech-to-speech/pull/137
@baldassarreFe made their first contribution in https://github.com/huggingface/speech-to-speech/pull/144
@ylacombe made their first contribution in https://github.com/huggingface/speech-to-speech/pull/147
@Arslan-Mehmood1 made their first contribution in https://github.com/huggingface/speech-to-speech/pull/155

Full Changelog: https://github.com/huggingface/speech-to-speech/commits/2025

View diff on GitHub

Weekly OSS security release digest.

The CVE patches and breaking changes that affected production tools this week. One email, every Sunday.

No spam, unsubscribe anytime.

Share this release

Share on X Share on Bluesky

Track speech-to-speech

Get notified when new releases ship.

About speech-to-speech

All releases →

speech-to-speech

Summary

What's Changed

New Contributors

Related context

Related tools