Skip to content
Merged
Changes from 1 commit
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
a8f79c5
optimization
ishandhanani Nov 24, 2025
dec9f2b
releaes and fix
ishandhanani Nov 24, 2025
de194ec
Merge branch 'main' into ishan/dockerfile-opt
ishandhanani Nov 27, 2025
cf8e30e
go
ishandhanani Nov 27, 2025
9e7fd89
go
ishandhanani Nov 27, 2025
922d1fc
go
ishandhanani Nov 27, 2025
4369be5
bump
ishandhanani Nov 27, 2025
659b00e
rip
ishandhanani Nov 27, 2025
a9368ce
bump
ishandhanani Dec 1, 2025
0ed8083
cmts
ishandhanani Dec 1, 2025
beabe27
Revert
ishandhanani Dec 1, 2025
84d3054
go
ishandhanani Dec 1, 2025
eea0f3d
lint
ishandhanani Dec 1, 2025
8c6fd4b
Merge branch 'main' into ishan/dockerfile-opt
ishandhanani Dec 1, 2025
631d2ed
dockerfile
ishandhanani Dec 2, 2025
8c70df1
raise
ishandhanani Dec 2, 2025
805fef8
Redundant
ishandhanani Dec 2, 2025
4131b85
go
ishandhanani Dec 2, 2025
94b5057
Merge branch 'main' into ishan/dockerfile-opt
ishandhanani Dec 2, 2025
3c08e2d
Merge branch 'main' into ishan/dockerfile-opt
ishandhanani Dec 4, 2025
cccb654
rebase
ishandhanani Dec 4, 2025
6d739b6
go
ishandhanani Dec 4, 2025
ebd574f
fix
ishandhanani Dec 4, 2025
8d41efe
go
ishandhanani Dec 4, 2025
1018cb0
lint
ishandhanani Dec 4, 2025
7e54a12
docker: upgrade docker to 24.04 (#14273)
ishandhanani Dec 4, 2025
e742f29
hopper sbo
ishandhanani Dec 4, 2025
c5cae36
Merge branch 'ishan/dockerfile-opt' of github.com:sgl-project/sglang …
ishandhanani Dec 4, 2025
89262a2
git
ishandhanani Dec 4, 2025
f257f80
path and jit
ishandhanani Dec 4, 2025
471f527
fix triton
ishandhanani Dec 4, 2025
b6f76c1
nccl
ishandhanani Dec 4, 2025
ee26736
release cu13 runtime so users can start testing
ishandhanani Dec 4, 2025
e3446c0
Merge branch 'main' into ishan/dockerfile-opt
ishandhanani Dec 4, 2025
59d0b21
go
ishandhanani Dec 5, 2025
d2247d9
Merge branch 'ishan/dockerfile-opt' of github.com:sgl-project/sglang …
ishandhanani Dec 5, 2025
459af4a
have to use this one for nvcc
ishandhanani Dec 5, 2025
350cd25
nvshmem
ishandhanani Dec 5, 2025
f766b2b
nvshmem
ishandhanani Dec 5, 2025
66dc1cc
small
ishandhanani Dec 5, 2025
6207ca0
comment
ishandhanani Dec 5, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
go
  • Loading branch information
ishandhanani committed Dec 2, 2025
commit 4131b8536c5c0dc361c804055c1862b9e3b9c2a2
29 changes: 1 addition & 28 deletions docker/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -148,20 +148,6 @@ ENV LANG=en_US.UTF-8 \
########################################################
########## Framework Development Image ################
########################################################
#
# PURPOSE: Framework development and SGLang compilation
#
# This stage builds and compiles framework dependencies including:
# - SGLang inference engine with CUDA support
# - DeepEP for distributed inference
# - All necessary build tools and compilation dependencies
# - Framework-level Python packages and extensions
#
# Use this stage when you need to:
# - Build SGLang from source with custom modifications
# - Develop or debug framework-level components
# - Create custom builds with specific optimization flags
#

# Copy local source if building from local
FROM scratch AS local_src
Expand Down Expand Up @@ -436,20 +422,7 @@ WORKDIR /sgl-workspace/sglang
########################################################
########## Runtime Image ##############################
########################################################
#
# PURPOSE: Lightweight production runtime environment
#
# This stage creates a production-ready image containing:
# - Pre-compiled SGLang and DeepEP components
# - Essential runtime dependencies only (no build tools)
# - Optimized for inference workloads and deployment
# - Significantly smaller than the framework image
#
# Use this stage when you need:
# - Production deployment of SGLang
# - Minimal runtime footprint without build tools
# - Ready-to-run inference server environment
#

FROM nvidia/cuda:${CUDA_VERSION}-cudnn-runtime-ubuntu22.04 AS runtime

ARG CUDA_VERSION
Expand Down
Loading