Mini-SGLang

A simple version of sglang project. For study purpose.

Roadmap

Basic Architecture

Tokenizer & Detokenizer & Scheduler procs
Model Runner forward
Zmq IPC for worker/control reqs
Server & APIs

Memory Management

Scheduler

FSFS/Random
Cache Aware(LPM)
Aggressive max_new_tokens prediction & Retracting
Chunked Prefill

Backend

Torch Native kernels
FA3 support

Distributed Support

Models

Llama 3
Mixtral MoE

Optimizations

Stream Output
CUDA graph forward
Overlap Scheduling
Unit tests

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
3rdparaty		3rdparaty
docs		docs
minisglang		minisglang
scripts		scripts
test		test
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mini-SGLang

Roadmap

Basic Architecture

Memory Management

Scheduler

Backend

Distributed Support

Models

Optimizations

About

Uh oh!

Releases

Packages

Languages

CrazyDave999/Mini-SGLang

Folders and files

Latest commit

History

Repository files navigation

Mini-SGLang

Roadmap

Basic Architecture

Memory Management

Scheduler

Backend

Distributed Support

Models

Optimizations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages