Skip to content
View synesthesiam's full-sized avatar

Sponsors

@Toothwitch
@Joldiges
@zugaldia
@dimitri-b1
Private Sponsor

Sponsoring

@thorstenMueller

Highlights

  • Pro

Organizations

@rhasspy

Block or report synesthesiam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
132 stars written in Python
Clear filter

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,404 5,937 Updated Aug 16, 2024

Faster Whisper transcription with CTranslate2

Python 20,659 1,711 Updated Nov 19, 2025

Manipulate audio with a simple and easy high level interface

Python 9,724 1,123 Updated Jul 26, 2025

Python library for audio and music analysis

Python 8,161 1,029 Updated Sep 16, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,034 721 Updated Dec 30, 2025

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,808 1,386 Updated Dec 6, 2023

Noise supression using deep filtering

Python 3,770 396 Updated Oct 17, 2024

Rapid fuzzy string matching in Python using various string metrics

Python 3,696 150 Updated Jan 26, 2026

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,494 252 Updated Apr 28, 2025

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Python 3,407 669 Updated Dec 14, 2022

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,423 618 Updated Oct 20, 2021

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,222 210 Updated Dec 27, 2025

WaveRNN Vocoder + TTS

Python 2,176 693 Updated Jul 2, 2022

AI powered speech denoising and enhancement

Python 2,169 263 Updated Dec 3, 2024

CLI task management & automation tool

Python 2,008 187 Updated Jan 10, 2026

Command line utility for forced alignment using Kaldi

Python 1,733 280 Updated Jan 11, 2026

Simple text to phones converter for multiple languages

Python 1,504 196 Updated Sep 26, 2024

Easily serialize Data Classes to and from JSON

Python 1,477 163 Updated Aug 8, 2024

Persian NLP Toolkit

Python 1,368 204 Updated Dec 21, 2025

A fast local neural text to speech engine for Mycroft

Python 1,244 131 Updated Mar 25, 2025

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,222 101 Updated Oct 1, 2024

🔉 Play and Record Sound with Python 🐍

Python 1,215 155 Updated Jan 23, 2026

A Home Assistant integration & Model to control your smart home using a Local LLM

Python 1,207 122 Updated Jan 4, 2026

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python 1,159 222 Updated May 3, 2024

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 1,129 99 Updated Nov 24, 2025

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Python 1,091 340 Updated Jun 8, 2024

Modules to convert numbers to words. 42 --> forty-two

Python 935 533 Updated May 30, 2025

g2p: English Grapheme To Phoneme Conversion

Python 910 135 Updated Jan 5, 2023

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Python 901 89 Updated Aug 20, 2024

Create highly reproducible python environments

Python 894 110 Updated May 20, 2024
Next