Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

server: Windows 7 compatibility build Compilation issues examples Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix server
#8208 opened Jun 29, 2024 by Zor-X-L Loading…
2 of 4 tasks
llama : store non-RoPEd K cache demo Demonstrate some concept or idea, not intended to be merged
#3234 opened Sep 17, 2023 by ggerganov Draft
vulkan : add dynamic VRAM heuristic for low-VRAM GPUs documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#17485 opened Nov 25, 2025 by cafeTechne Loading…
--numa mirror: mirror model weights to every Numa node in the system Apple Metal https://en.wikipedia.org/wiki/Metal_(API) Ascend NPU issues specific to Ascend NPUs devops improvements to build systems and github actions examples ggml changes relating to the ggml tensor library for machine learning IBM zDNN issues specific to IBM zDNN Accelerator Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#16000 opened Sep 15, 2025 by dbsanfte Draft
sgemm: reuse loaded vector in AVX dot product calculation ggml changes relating to the ggml tensor library for machine learning vibe-coded Created with heavy use of LLM assistants, requires human verification
#17648 opened Dec 1, 2025 by GermanAizek Loading…
tests(test-backend-ops): Test backend ops verbosity testing Everything test related
#17029 opened Nov 5, 2025 by gabe-l-hart Loading…
CUDA & CPU: support F32 kernel type for CONV_TRANSPOSE_2D ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17094 opened Nov 8, 2025 by AgainstEntropy Loading…
ggml : enhance rel-pos and window ops with CUDA support ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17383 opened Nov 19, 2025 by bluebread Loading…
Add CUDA non-contiguous Unary Ops support build Compilation issues documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#14639 opened Jul 11, 2025 by YavorGIvanov Loading…
Metal TQ2_0 Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#12485 opened Mar 20, 2025 by dmahurin Loading…
Fix convert script for non-hf GLM4 checkpoints python python script changes
#12992 opened Apr 17, 2025 by Tianyue-Zhao Loading…
2 of 4 tasks
Introduce Graph Profiler ggml changes relating to the ggml tensor library for machine learning
#9659 opened Sep 26, 2024 by max-krasnyansky Loading…
Add PagedAttention support (experimental, CUDA only) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs server
#17579 opened Nov 28, 2025 by ericcurtin Draft
GGML: Fix leak of backend buffer memory address in RPC ggml changes relating to the ggml tensor library for machine learning
#14882 opened Jul 26, 2025 by struct Loading…
cmake : set RPATH to $ORIGIN on Linux (#13740) build Compilation issues
#13741 opened May 24, 2025 by sunhaitao Loading…
Move page cache via mbind to prevent cross-NUMA access build Compilation issues
#13731 opened May 23, 2025 by vishalc-ibm Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.