-
University of Science and Technology of China
Highlights
- Pro
Popular repositories Loading
-
-
nnfusion
nnfusion PublicForked from zheng-ningxin/nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
C++
-
sputnik
sputnik PublicForked from google-research/sputnik
A library of GPU kernels for sparse matrix operations.
C++
-
Structure-LTH
Structure-LTH PublicForked from VITA-Group/Structure-LTH
[ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wang, Zhangyang Wang.
Cuda
-
-
TensorRT-LLM
TensorRT-LLM PublicForked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++
If the problem persists, check the GitHub status page or contact support.