-
Notifications
You must be signed in to change notification settings - Fork 14k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
server: Windows 7 compatibility
build
Compilation issues
examples
Review Complexity : Low
Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
server
#8208
opened Jun 29, 2024 by
Zor-X-L
Loading…
2 of 4 tasks
vulkan : add dynamic VRAM heuristic for low-VRAM GPUs
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#17485
opened Nov 25, 2025 by
cafeTechne
Loading…
chat: add defensive IBM Granite Jinja compatibility (<tool_call> and <|tool_call|> support)
#16537
opened Oct 12, 2025 by
ServeurpersoCom
•
Draft
--numa mirror: mirror model weights to every Numa node in the system
Apple Metal
sgemm: reuse loaded vector in AVX dot product calculation
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17648
opened Dec 1, 2025 by
GermanAizek
Loading…
tests(test-backend-ops): Test backend ops verbosity
testing
Everything test related
#17029
opened Nov 5, 2025 by
gabe-l-hart
Loading…
server: add support for local image path loading for server
examples
python
python script changes
server
#16874
opened Oct 30, 2025 by
cchadowitz
Loading…
webui: save model name with conversation history (#13570)
examples
server
#14192
opened Jun 15, 2025 by
deepanshu2015
Loading…
CUDA & CPU: support F32 kernel type for changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
CONV_TRANSPOSE_2D
ggml
#17094
opened Nov 8, 2025 by
AgainstEntropy
Loading…
ggml : enhance rel-pos and window ops with CUDA support
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#17383
opened Nov 19, 2025 by
bluebread
Loading…
Add CUDA non-contiguous Unary Ops support
build
Compilation issues
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14639
opened Jul 11, 2025 by
YavorGIvanov
Loading…
logit_bias: apply configurable escalating EOG bias at low n_remain
examples
server
testing
Everything test related
#14229
opened Jun 16, 2025 by
graehl
Loading…
Metal TQ2_0
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12485
opened Mar 20, 2025 by
dmahurin
Loading…
Fix convert script for non-hf GLM4 checkpoints
python
python script changes
#12992
opened Apr 17, 2025 by
Tianyue-Zhao
Loading…
2 of 4 tasks
Introduce Graph Profiler
ggml
changes relating to the ggml tensor library for machine learning
#9659
opened Sep 26, 2024 by
max-krasnyansky
Loading…
model : Fix marker placement for LFM2-VL in single turn llama-mtmd-cli
examples
#17616
opened Nov 30, 2025 by
tdakhran
Loading…
Add PagedAttention support (experimental, CUDA only)
examples
ggml
changes relating to the ggml tensor library for machine learning
model
Model specific
Nvidia GPU
Issues specific to Nvidia GPUs
server
#17579
opened Nov 28, 2025 by
ericcurtin
•
Draft
mtmd : Support jinja in libmtmd (Only for QwenVL and Qwen Omni)
examples
#14730
opened Jul 17, 2025 by
alielmorsy
Loading…
GGML: Fix leak of backend buffer memory address in RPC
ggml
changes relating to the ggml tensor library for machine learning
#14882
opened Jul 26, 2025 by
struct
Loading…
feat(batched): Add functionality to upload benchmark test results
examples
#14811
opened Jul 22, 2025 by
MengAiDev
Loading…
cmake : set Compilation issues
RPATH to $ORIGIN on Linux (#13740)
build
#13741
opened May 24, 2025 by
sunhaitao
Loading…
Move page cache via mbind to prevent cross-NUMA access
build
Compilation issues
#13731
opened May 23, 2025 by
vishalc-ibm
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.