Stars
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Collection of JET's Vapoursynth packages for video filtering
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Industry leading face manipulation platform
guocuixia / lama-cleaner-
Forked from Sanster/IOPaint去水印Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
A repository collecting image and video upscaling resources as well as my own super resolution models.
A node.js version management utility for Windows. Ironically written in Go.
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
Generate any location from the real world in Minecraft with a high level of detail.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
a python script to decrypt and demux Honkai: Star Rail cutscene video
Office Tool Plus localization projects.
Split-screen video comparison tool using FFmpeg and SDL2
GUI for upscaling ONNX models with NVIDIA TensorRT and Vapoursynth
🧸「One Last Image」卢浮宫生成器 One Last Kiss 封面风格生成
Mod loader for Hatsune Miku: Project DIVA Mega Mix+
2023年,最新音视频学习资料整理,项目(调试可用),ffmpeg命令手册,文章,编解码论文,视频讲解,面试题全套资料
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
A super great audio/video source and FFmpeg wrapper
Subtitle source files from Nekomoe Kissaten. Should there be any issues, please create them in this main repository first.
The subtitle files in this repository are created and shared by Haruhana Funsub. If you find any errors, please feel free to report them via issues, forms, or email.
GUI for a Vocal Remover that uses Deep Neural Networks.


