Lists (1)
Sort Name ascending (A-Z)
Stars
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Unofficial implementation of BRIA RMBG Model for ComfyUI
A Python package that makes it easy to use the Kokoro voice synthesis library.
FastDiarize is a lightweight and efficient speaker diarization API powered by FastAPI and Pyannote.audio. Alternative to Gladia, AssemblyAI, pyannote.ai...
Coglet animatronic voice assistant robot prototype