Qlin00

Follow

Junqing Lin Qlin00

Follow

University of Science and Technology of China

1 follower · 1 following

University of Science and Technology of China

Highlights

Pro

Popular repositories Loading

ppsdnn ppsdnn Public

Performance Prediction of Sparse DNN Inference

Python
nnfusion nnfusion Public

Forked from zheng-ningxin/nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

C++
sputnik sputnik Public

Forked from google-research/sputnik

A library of GPU kernels for sparse matrix operations.

C++
Structure-LTH Structure-LTH Public

Forked from VITA-Group/Structure-LTH

[ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wang, Zhangyang Wang.

Cuda
llama.cpp llama.cpp Public

Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++
TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++