Skip to content

Conversation

@utsumi-fj
Copy link
Contributor

Fixes #10262.

This PR fixes the documentation bug described in #10262, which was closed automatically by the stale bot.

The current Docker example:

docker run -v /path/to/models:/models \
  -p 8000:8000 \
  ghcr.io/ggml-org/llama.cpp:server \
  -m /models/7B/ggml-model-q4_0.gguf \
  --port 8000 \
  --host 0.0.0.0 \
  -n 512

causes the container health check to fail and the container status to become unhealthy.

The reason is that the example configures the server to listen on port 8000 (--port 8000), while the container image’s HEALTHCHECK is hard-coded to probe port 8080. This PR updates the documentation to use port 8080 so that the health check works correctly.

@github-actions github-actions bot added the documentation Improvements or additions to documentation label Dec 10, 2025
@taronaeo taronaeo merged commit e4ae383 into ggml-org:master Dec 11, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Bug: docker sample usage will always trigger unhealty container status

2 participants