synesthesiam

Michael Hansen synesthesiam

Computer/cognitive science PhD, open source voice assistant enthusiast.

727 followers · 2 following

Open Home Foundation
United States
https://synesthesiam.com
@rhasspy
@[email protected]

Sponsoring

Achievements

x4 x2 x4 x3

Achievements

x4 x2 x4 x3

Highlights

Organizations

Lists (1)

Sort

🔮 Future ideas

2 repositories

Stars

132 stars written in Python

Clear filter

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,404 5,937 Updated Aug 16, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 20,659 1,711 Updated Nov 19, 2025

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 9,724 1,123 Updated Jul 26, 2025

librosa / librosa

Python library for audio and music analysis

Python 8,161 1,029 Updated Sep 16, 2025

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,034 721 Updated Dec 30, 2025

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,808 1,386 Updated Dec 6, 2023

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 3,770 396 Updated Oct 17, 2024

rapidfuzz / RapidFuzz

Rapid fuzzy string matching in Python using various string metrics

Python 3,696 150 Updated Jan 26, 2026

kuprel / min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,494 252 Updated Apr 28, 2025

minimaxir / gpt-2-simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Python 3,407 669 Updated Dec 14, 2022

jameslyons / python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,423 618 Updated Oct 20, 2021

iver56 / audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,222 210 Updated Dec 27, 2025

fatchord / WaveRNN

WaveRNN Vocoder + TTS

Python 2,176 693 Updated Jul 2, 2022

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

Python 2,169 263 Updated Dec 3, 2024

pydoit / doit

CLI task management & automation tool

Python 2,008 187 Updated Jan 10, 2026

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python 1,733 280 Updated Jan 11, 2026

bootphon / phonemizer

Simple text to phones converter for multiple languages

Python 1,504 196 Updated Sep 26, 2024

lidatong / dataclasses-json

Easily serialize Data Classes to and from JSON

Python 1,477 163 Updated Aug 8, 2024

roshan-research / hazm

Persian NLP Toolkit

Python 1,368 204 Updated Dec 21, 2025

MycroftAI / mimic3

A fast local neural text to speech engine for Mycroft

Python 1,244 131 Updated Mar 25, 2025

bheinzerling / bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,222 101 Updated Oct 1, 2024

spatialaudio / python-sounddevice

🔉 Play and Record Sound with Python 🐍

Python 1,215 155 Updated Jan 23, 2026

acon96 / home-llm

A Home Assistant integration & Model to control your smart home using a Local LLM

Python 1,207 122 Updated Jan 4, 2026

spring-media / TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python 1,159 222 Updated May 3, 2024

iver56 / torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 1,129 99 Updated Nov 24, 2025

alumae / kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Python 1,091 340 Updated Jun 8, 2024

savoirfairelinux / num2words

Modules to convert numbers to words. 42 --> forty-two

Python 935 533 Updated May 30, 2025

Kyubyong / g2p

g2p: English Grapheme To Phoneme Conversion

Python 910 135 Updated Jan 5, 2023

nipunsadvilkar / pySBD

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Python 901 89 Updated Aug 20, 2024

DavHau / mach-nix

Create highly reproducible python environments

Python 894 110 Updated May 20, 2024