Skip to content

Tags: ggml-org/LlamaBarn

Tags

0.30.0

Toggle 0.30.0's commit message
Rename FitParams to MemProfile

The struct, runner, cache, and on-disk cache directory are now named after
what they hold (a per-model memory profile) rather than the tool that
produces them today (llama-fit-params). Also drops the 'fit' prefix from
CatalogEntry.fitResidentBytes, leaving residentBytes.

0.29.1

Toggle 0.29.1's commit message
Fix Swift concurrency warnings in ModelManager and FitParams

0.29.0

Toggle 0.29.0's commit message
Update llama.cpp to b8797

0.28.0

Toggle 0.28.0's commit message
Fix ctxBytesPer1kTokens for Gemma 4 models (account for SWA pattern)

0.27.0

Toggle 0.27.0's commit message
Suppress llama-server's automatic model discovery from HF cache

b8609's router mode auto-scans the HF cache and lists every GGUF it finds, not just models in models.ini. This causes the web UI to show models that don't appear in the app's own UI.

0.26.0

Toggle 0.26.0's commit message
Switch Qwen 3.5 quantized builds from UD-Q4_K_XL to Q4_K_M for consis…

…tency with other model families

0.25.0

Toggle 0.25.0's commit message
Improve models folder settings row layout

Show the path inline next to the label with truncation for long paths. Replace the text "Reset" button with a compact restore (↺) button that only appears when using a non-default directory. Both buttons use controlSize(.small) for a tighter fit.

0.24.0

Toggle 0.24.0's commit message
Improve context length info text to explain memory relationship

0.23.0

Toggle 0.23.0's commit message
Update llama.cpp to b7772

0.22.0

Toggle 0.22.0's commit message
Update memory budget formula to totalRAM × 0.75 − 2GB and align with …

…--fit-target