Skip to content
View mediarl's full-sized avatar

Block or report mediarl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
43 stars written in Python
Clear filter

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 183,425 46,220 Updated Apr 14, 2026

Python tool for converting files and office documents to Markdown.

Python 108,104 6,839 Updated Mar 30, 2026

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.

Python 60,615 10,815 Updated Apr 14, 2026

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,622 9,405 Updated Mar 9, 2026

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,451 4,800 Updated Jun 2, 2025

SoTA open-source TTS

Python 24,302 3,240 Updated Mar 26, 2026

Rembg is a tool to remove images background

Python 22,557 2,272 Updated Apr 8, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 20,013 2,460 Updated Mar 16, 2026

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,256 1,682 Updated Nov 19, 2025

Awesome list of open-source startup alternatives to well-known SaaS products 🚀

Python 18,995 1,041 Updated Sep 3, 2025

Build, Manage and Deploy AI/ML Systems

Python 10,033 1,253 Updated Apr 13, 2026

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Python 9,702 1,615 Updated Jun 26, 2024

More relighting!

Python 8,409 524 Updated Feb 20, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,188 819 Updated Mar 5, 2025

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,833 560 Updated Jul 11, 2024

Towards Human-Sounding Speech

Python 6,081 518 Updated Dec 5, 2025

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 5,516 472 Updated May 12, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 4,065 335 Updated Aug 14, 2025

A TTS that fits in your CPU (and pocket)

Python 3,958 446 Updated Apr 13, 2026

[CVPR 2026] PersonaLive! : Expressive Portrait Image Animation for Live Streaming

Python 2,554 339 Updated Mar 5, 2026

The collection of pre-trained, state-of-the-art AI models for ailia SDK

Python 2,340 359 Updated Apr 14, 2026

first base model for full-duplex conversational audio

Python 1,783 112 Updated Jan 5, 2025

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Python 1,776 138 Updated Dec 31, 2025

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,624 126 Updated Jan 26, 2026

Make text LLMs listen and speak

Python 1,271 219 Updated Mar 26, 2026

Ink/Stitch: an Inkscape extension for machine embroidery design

Python 1,240 225 Updated Apr 14, 2026

Soprano: Instant, Ultra-Realistic Text-to-Speech

Python 1,218 108 Updated Jan 15, 2026

The fastest and highest-quality deep learning powered Sora2 watermark cleaner.

Python 1,152 247 Updated Apr 14, 2026
Python 1,143 86 Updated Sep 26, 2023
Next