Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Modern Bert Support model Model specific python python script changes
#15641 opened Aug 28, 2025 by ryan-mangeno Loading…
llama : add llama_batch_ext android Issues specific to Android examples python python script changes server
#11875 opened Feb 14, 2025 by ngxson Loading…
llama: Attempt to add ModernBert model Model specific python python script changes
#14014 opened Jun 4, 2025 by huydt84 Loading…
add FP8 support to gguf/llama: build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning script Script related Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes testing Everything test related
#10055 opened Oct 26, 2024 by Djip007 Draft
1 of 3 tasks
Implement SparseK Attention mechanism — new GGML operator with CPU backend (GPU planned next) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#16817 opened Oct 28, 2025 by yael-works Loading…
tool: add convertation of text/parquet to custom format build Compilation issues examples
#14622 opened Jul 10, 2025 by lexasub Loading…
model : add LLADA 2.0 diffusion support examples model Model specific python python script changes
#17454 opened Nov 23, 2025 by wsbagnsv1 Draft
Implementation of a sequence repetition penalty sampler enhancement New feature or request generation quality Quality of model output need feedback Testing and feedback with results are needed
#2593 opened Aug 12, 2023 by KerfuffleV2 Draft
llama : second attempt to refactor vision API examples python python script changes server
#11292 opened Jan 18, 2025 by ngxson Draft
1 of 5 tasks
mtmd: Add DeepSeekOCR Support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17400 opened Nov 20, 2025 by sfallah Loading…
llama-cli: add support for reasoning examples
#16603 opened Oct 16, 2025 by bandoti Loading…
sampling : add support for backend sampling Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server testing Everything test related
#17004 opened Nov 4, 2025 by danbev Loading…
17 of 25 tasks
WIP: Add model merge example demo Demonstrate some concept or idea, not intended to be merged help wanted Needs help from the community
#5741 opened Feb 26, 2024 by ngxson Draft
cuda : Add conv2d Implicit GEMM ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15805 opened Sep 4, 2025 by bssrdf Loading…
[MPI] Add support for per-node options, thread counts, and layer allocations build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning server
#3334 opened Sep 26, 2023 by AutonomicPerfectionist Draft
2 of 5 tasks
Implement llama-pull tool examples
#16423 opened Oct 4, 2025 by ericcurtin Loading…
CANN: add support for partial RoPE and Vision mode Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17543 opened Nov 27, 2025 by noemotiovon Loading…
support MiniCPM-V-2 demo Demonstrate some concept or idea, not intended to be merged enhancement New feature or request examples python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#6919 opened Apr 26, 2024 by Achazwl Loading…
Feature/kimi linear support ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17592 opened Nov 29, 2025 by cacaview Loading…
Layer skipping/self-speculation demo demo Demonstrate some concept or idea, not intended to be merged research 🔬
#3565 opened Oct 10, 2023 by KerfuffleV2 Draft
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.