Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Python tool for converting files and office documents to Markdown.
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Rembg is a tool to remove images background
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
A TTS model capable of generating ultra-realistic dialogue in one pass.
Awesome list of open-source startup alternatives to well-known SaaS products 🚀
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
A TTS that fits in your CPU (and pocket)
[CVPR 2026] PersonaLive! : Expressive Portrait Image Animation for Live Streaming
The collection of pre-trained, state-of-the-art AI models for ailia SDK
first base model for full-duplex conversational audio
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Ink/Stitch: an Inkscape extension for machine embroidery design
Soprano: Instant, Ultra-Realistic Text-to-Speech
The fastest and highest-quality deep learning powered Sora2 watermark cleaner.

