AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,720 386 Updated Apr 9, 2026

huangrt01 / CS-Notes

我的自学笔记，终身更新

Python 3,952 484 Updated Apr 7, 2026

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,597 315 Updated Apr 9, 2026

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,292 195 Updated Mar 27, 2024

ArcReel / ArcReel

AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频，跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI

Python 1,775 381 Updated Apr 15, 2026

wbopan / moffee

moffee: Make Markdown Ready to Present

Python 1,332 65 Updated Aug 2, 2025

chengzeyi / stable-fast

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,306 92 Updated Mar 27, 2025

alephpi / Texo

A minimalist SOTA LaTeX OCR model with only 20M parameters, running in browser. Full training pipeline available for self-reproduction. | 超轻量SOTA LaTeX公式识别模型，仅20M参数量，可在浏览器中运行。训练全流程代码开源，以便自学复现。

Python 793 46 Updated Feb 23, 2026

modal-labs / gpu-glossary

GPU documentation for humans

Python 561 71 Updated Mar 24, 2026

microsoft / sarathi-serve

A low-latency & high-throughput serving engine for LLMs

Python 491 63 Updated Jan 8, 2026

JyChen9811 / FaithDiff

[CVPR 2025] FaithDiff for Classic Film Rejuvenation, Old Photo Revival, Social Media Restoration, Image Enhancement and AIGC Enhancement.

Python 247 17 Updated Mar 25, 2026

hpcaitech / CachedEmbedding

A memory efficient DLRM training solution using ColossalAI

Python 107 13 Updated Nov 22, 2022

OpenPPL / ppl.pmx

Python 59 17 Updated Nov 21, 2024

fastalgo / imagenet_resnet50_lamb

Python 4 1 Updated Nov 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LiuQiang thorneliu

Achievements

Achievements

Block or report thorneliu

Stars

vllm-project / vllm

deepspeedai / DeepSpeed

hpcaitech / ColossalAI

sgl-project / sglang

mnielsen / neural-networks-and-deep-learning

facebookresearch / xformers

InternLM / lmdeploy

baichuan-inc / Baichuan-7B

xlite-dev / Awesome-LLM-Inference

facebookincubator / AITemplate