Tags · JamePeng/llama-cpp-python · GitHub

Tags

v0.3.16-cu128-AVX2-win-20250913

Simplify the code structure of test.yaml

Sep 13, 2025
44e5534
zip
tar.gz
Notes
Downloads

v0.3.16-cu128-AVX2-linux-20250913

Simplify the code structure of test.yaml

Sep 13, 2025
44e5534
zip
tar.gz
Notes
Downloads

v0.3.16-cu126-AVX2-win-20250913

Simplify the code structure of test.yaml

Sep 13, 2025
44e5534
zip
tar.gz
Notes
Downloads

v0.3.16-cu126-AVX2-linux-20250913

Simplify the code structure of test.yaml

Sep 13, 2025
44e5534
zip
tar.gz
Notes
Downloads

v0.3.16-cu124-AVX2-win-20250913

Simplify the code structure of test.yaml

Sep 13, 2025
44e5534
zip
tar.gz
Notes
Downloads

v0.3.16-cu124-AVX2-linux-20250913

Simplify the code structure of test.yaml

Sep 13, 2025
44e5534
zip
tar.gz
Notes
Downloads

v0.3.16-cu128-AVX2-win-20250831

Sync llama: use FA + max. GPU layers by default

Aug 31, 2025
2be720d
zip
tar.gz
Notes
Downloads

v0.3.16-cu128-AVX2-linux-20250831

Sync llama: use FA + max. GPU layers by default

Aug 31, 2025
2be720d
zip
tar.gz
Notes
Downloads

v0.3.16-cu126-AVX2-win-20250831

Sync llama: use FA + max. GPU layers by default

Aug 31, 2025
2be720d
zip
tar.gz
Notes
Downloads

v0.3.16-cu126-AVX2-linux-20250831

Sync llama: use FA + max. GPU layers by default

Aug 31, 2025
2be720d
zip
tar.gz
Notes
Downloads

PreviousNext