Skip to content
View ReskyQian's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Tongji University
  • Shanghai

Block or report ReskyQian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
45 stars written in Python
Clear filter

Public repository for Agent Skills

Python 141,031 16,666 Updated May 19, 2026

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 78,564 10,489 Updated May 19, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 57,782 6,294 Updated Apr 30, 2026

"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/

Python 40,500 3,829 Updated May 23, 2026

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,903 5,217 Updated Mar 3, 2026

Let us control diffusion models!

Python 33,897 3,014 Updated Feb 25, 2024

AI Agent Assistant & development framework that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

Python 33,137 2,277 Updated May 26, 2026

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 24,795 1,850 Updated Mar 13, 2025

A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent

Python 24,637 1,651 Updated May 26, 2026

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Python 8,834 775 Updated Dec 10, 2023

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms

Python 7,857 1,028 Updated May 15, 2026

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python 6,033 694 Updated Sep 8, 2025

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 5,021 730 Updated Jan 21, 2025
Python 4,433 408 Updated Sep 27, 2024

[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation

Python 4,324 395 Updated Jan 2, 2024

基于vits与softvc的歌声音色转换模型

Python 3,786 27 Updated Oct 19, 2024

The PyTorch-based audio source separation toolkit for researchers

Python 2,567 447 Updated May 13, 2026

Graphormer is a general-purpose deep learning backbone for molecular modeling.

Python 2,450 375 Updated Jun 7, 2024

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Python 2,215 422 Updated Jul 11, 2024

A recreation of Neuro-Sama originally created in 7 days.

Python 1,948 218 Updated Jan 17, 2025

Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch

Python 1,740 346 Updated Mar 8, 2023

[SIGGRAPH 2025] One Model to Rig Them All: Diverse Skeleton Rigging with UniRig

Python 1,569 150 Updated Apr 22, 2026

[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"

Python 1,391 174 Updated Mar 14, 2026

A toolbox for skeleton-based action recognition.

Python 1,238 226 Updated Feb 19, 2026

Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

Python 954 61 Updated Sep 25, 2025

Official implementation of "MeshDiffusion: Score-based Generative 3D Mesh Modeling" (ICLR 2023 Spotlight)

Python 831 42 Updated May 20, 2024
Python 798 168 Updated Aug 16, 2023

Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer (NeurIPS 2019)

Python 665 104 Updated Jul 25, 2024
Next