Skip to content
Merged
Changes from 1 commit
Commits
Show all changes
53 commits
Select commit Hold shift + click to select a range
8498429
`main`/`server`: rename to `llama` / `llama-server` for consistency w…
Jun 6, 2024
f298cc6
server: update refs -> llama-server
Jun 6, 2024
f5f19a2
server: simplify nix package
Jun 6, 2024
8b7c734
main: update refs -> llama
Jun 6, 2024
9a03341
main/server: fix targets
Jun 6, 2024
8695bae
update more names
Jun 6, 2024
a0a7f2b
Update build.yml
Jun 6, 2024
fbd8313
Merge remote-tracking branch 'origin/master' into bins
Jun 6, 2024
99df4cc
rm accidentally checked in bins
Jun 7, 2024
7fbe600
update straggling refs
Jun 7, 2024
af8f016
Update .gitignore
Jun 7, 2024
0dba582
Update server-llm.sh
Jun 7, 2024
fe93cc9
Merge remote-tracking branch 'origin/master' into bins
Jun 8, 2024
23d0df5
main: target name -> llama-cli
Jun 8, 2024
ab5efbb
Prefix all example bins w/ llama-
Jun 8, 2024
78bca8c
fix main refs
Jun 8, 2024
10650b6
rename {main->llama}-cmake-pkg binary
Jun 8, 2024
81222f0
prefix more cmake targets w/ llama-
Jun 8, 2024
b648243
add/fix gbnf-validator subfolder to cmake
Jun 8, 2024
eef922e
sort cmake example subdirs
Jun 8, 2024
b0eb3b8
rm bin files
Jun 8, 2024
efaa441
fix llama-lookup-* Makefile rules
Jun 8, 2024
78eae7f
gitignore /llama-*
Jun 8, 2024
347f308
rename Dockerfiles
Jun 8, 2024
5265c15
rename llama|main -> llama-cli; consistent RPM bin prefixes
Jun 10, 2024
daeaeb1
Merge remote-tracking branch 'origin/master' into bins
Jun 10, 2024
0bb2a3f
fix some missing -cli suffixes
Jun 10, 2024
0fcf2c3
rename dockerfile w/ llama-cli
Jun 10, 2024
1cc6514
rename(make): llama-baby-llama
Jun 10, 2024
051633e
update dockerfile refs
Jun 10, 2024
b8cb44e
more llama-cli(.exe)
Jun 10, 2024
4881a94
fix test-eval-callback
Jun 10, 2024
b843639
rename: llama-cli-cmake-pkg(.exe)
Jun 10, 2024
f9cfd04
address gbnf-validator unused fread warning (switched to C++ / ifstream)
Jun 10, 2024
0be5f39
add two missing llama- prefixes
Jun 10, 2024
e7e0373
Updating docs for eval-callback binary to use new `llama-` prefix.
HanClinto Jun 10, 2024
2fd66b2
Updating a few lingering doc references for rename of main to llama-cli
HanClinto Jun 10, 2024
72660c3
Updating `run-with-preset.py` to use new binary names.
HanClinto Jun 10, 2024
70de0de
Updating documentation references for lookup-merge and export-lora
HanClinto Jun 10, 2024
82df7f9
Merge pull request #1 from HanClinto/bins-rename-nits
ochafik Jun 10, 2024
1f5ec2c
Updating two small `main` references missed earlier in the finetune d…
HanClinto Jun 10, 2024
8cf8c12
Update apps.nix
Jun 10, 2024
2a9c4cd
Merge remote-tracking branch 'origin/master' into bins
Jun 11, 2024
166397f
update grammar/README.md w/ new llama-* names
Jun 11, 2024
ee3a086
Merge pull request #2 from HanClinto/bins-nits-2
ochafik Jun 11, 2024
e474ef1
update llama-rpc-server bin name + doc
Jun 11, 2024
be66f9e
Revert "update llama-rpc-server bin name + doc"
Jun 12, 2024
ceb2859
Merge remote-tracking branch 'origin/master' into bins
Jun 12, 2024
08da184
add hot topic notice to README.md
Jun 12, 2024
ecdde74
Update README.md
ochafik Jun 12, 2024
1910241
Update README.md
ochafik Jun 12, 2024
48e5009
rename gguf-split & quantize bins refs in **/tests.sh
Jun 12, 2024
73d4a4a
Merge branch 'bins' of https://github.com/ochafik/llama.cpp into bins
Jun 12, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Update README.md
  • Loading branch information
ochafik authored Jun 12, 2024
commit ecdde745ba6f6a6b25e20e683bb13f3b1efb1bdc
4 changes: 3 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,9 @@ Inference of Meta's [LLaMA](https://arxiv.org/abs/2302.13971) model (and others)

### Hot topics

- > [!IMPORTANT] Binaries have been renamed w/ a `llama-` prefix. `main` is now `llama-cli`, `server` is `llama-server`, etc (https://github.com/ggerganov/llama.cpp/pull/7809)
> [!IMPORTANT]
Binaries have been renamed w/ a `llama-` prefix. `main` is now `llama-cli`, `server` is `llama-server`, etc (https://github.com/ggerganov/llama.cpp/pull/7809)

- **`convert.py` has been deprecated and moved to `examples/convert-legacy-llama.py`, please use `convert-hf-to-gguf.py`** https://github.com/ggerganov/llama.cpp/pull/7430
- Initial Flash-Attention support: https://github.com/ggerganov/llama.cpp/pull/5021
- BPE pre-tokenization support has been added: https://github.com/ggerganov/llama.cpp/pull/6920
Expand Down