Lists (1)
Sort Name ascending (A-Z)
Starred repositories
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
A helper package to get information of scholarly articles from DBLP using its public API
Easily create your own custom icons in just a few clicks in the browser
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
Pytorch implementation of the CREPE pitch tracker
netnet.studio is a hypermedia higherEd cyberspace for fully realizing the Web’s creative potential.
mlciv / ai-deadlines
Forked from paperswithcode/ai-deadlines⏰ AI conference deadline countdowns
real time face swap and one-click video deepfake with only a single image
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
A list of summer schools on Artificial Intelligence, Machine Learning, and Healthcare
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
The Project Gutenberg tool to generate EPUBs and other ebook formats.
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
Track emissions from Compute and recommend ways to reduce their impact on the environment.
An online wiki to support sustainable practices within and beyond NIME as outlined in the NIME Conference Environmental Statement.
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.
my collection of figlet / toilet ascii art fonts
Neat allows you to build awesome gradients for your website, using 3d shaders
kelseyicotton / voicebook
Forked from jim-schwoebel/voicebook🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
DiffSinger training colab notebook to make training easier hopefully

