Use OpenAI-compatible `/v1/models` endpoint by default #17689

allozaur · 2025-12-02T12:36:13Z

This PR introduces an update for WebUI to use the OpenAI-compatible endpoint for models by default. Additionally introduces small refactor improvements for services/stores usage.

…models

* refactor: Data fetching via stores * chore: update webui build output * refactor: Use OpenAI compat `/v1/models` endpoint by default to list models * chore: update webui build output * chore: update webui build output

* origin/master: server: strip content-length header on proxy (ggml-org#17734) server: move msg diffs tracking to HTTP thread (ggml-org#17740) examples : add missing code block end marker [no ci] (ggml-org#17756) common : skip model validation when --help is requested (ggml-org#17755) ggml-cpu : remove asserts always evaluating to false (ggml-org#17728) convert: use existing local chat_template if mistral-format model has one. (ggml-org#17749) cmake : simplify build info detection using standard variables (ggml-org#17423) ci : disable ggml-ci-x64-amd-* (ggml-org#17753) common: use native MultiByteToWideChar (ggml-org#17738) metal : use params per pipeline instance (ggml-org#17739) llama : fix sanity checks during quantization (ggml-org#17721) build : move _WIN32_WINNT definition to headers (ggml-org#17736) build: enable parallel builds in msbuild using MTT (ggml-org#17708) ggml-cpu: remove duplicate conditional check 'iid' (ggml-org#17650) Add a couple of file types to the text section (ggml-org#17670) convert : support latest mistral-common (fix conversion with --mistral-format) (ggml-org#17712) Use OpenAI-compatible `/v1/models` endpoint by default (ggml-org#17689) webui: Fix zero pasteLongTextToFileLen to disable conversion being overridden (ggml-org#17445)

allozaur requested review from ggerganov and ngxson December 2, 2025 12:36

ggerganov approved these changes Dec 2, 2025

View reviewed changes

github-actions bot added examples server labels Dec 2, 2025

ngxson approved these changes Dec 2, 2025

View reviewed changes

allozaur added 5 commits December 3, 2025 20:46

refactor: Data fetching via stores

83958cf

chore: update webui build output

c70d86f

refactor: Use OpenAI compat /v1/models endpoint by default to list …

6717731

…models

chore: update webui build output

8cb5839

chore: update webui build output

5e56b5b

allozaur force-pushed the allozaur/models-endpoint-openai-compat branch from a1c81a0 to 5e56b5b Compare December 3, 2025 19:48

allozaur merged commit e9f9483 into ggml-org:master Dec 3, 2025
7 checks passed

allozaur deleted the allozaur/models-endpoint-openai-compat branch December 5, 2025 22:04

gabe-l-hart mentioned this pull request Dec 10, 2025

feat: llama.cpp bump (17f7f4) for SSM performance improvements ollama/ollama#13408

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use OpenAI-compatible `/v1/models` endpoint by default #17689

Use OpenAI-compatible `/v1/models` endpoint by default #17689

Uh oh!

allozaur commented Dec 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use OpenAI-compatible /v1/models endpoint by default #17689

Use OpenAI-compatible /v1/models endpoint by default #17689

Uh oh!

Conversation

allozaur commented Dec 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use OpenAI-compatible `/v1/models` endpoint by default #17689

Use OpenAI-compatible `/v1/models` endpoint by default #17689