Tags · edude03/llama.cpp

b3190

common: fix warning (ggml-org#8036)

* common: fix warning

* Update common/common.cpp

Co-authored-by: slaren <[email protected]>

---------

Co-authored-by: slaren <[email protected]>

Jun 20, 2024
abd894a
zip
tar.gz

b3189

[SYCL] Fix windows build and inference (ggml-org#8003)

* add sycl preset

* fix debug link error. fix windows crash

* update README

Jun 20, 2024
de391e4
zip
tar.gz

b3188

CUDA: stream-k decomposition for MMQ (ggml-org#8018)

* CUDA: stream-k decomposition for MMQ

* fix undefined memory reads for small matrices

Jun 20, 2024
d50f889
zip
tar.gz

b3187

metal : fix `ggml_metal_supports_op` for BF16 (ggml-org#8021)

Currently the Metal backend does not support BF16. `ggml_metal_supports_op` was returning true in these cases, leading to a crash with models converted with `--leave-output-tensor`. This commit checks if the first few sources types are BF16 and returns false if that's the case.

Jun 20, 2024
2075a66
zip
tar.gz

b3186

server : fix smart slot selection (ggml-org#8020)

Jun 19, 2024
ba58993
zip
tar.gz

b3184

ggml : synchronize threads using barriers (ggml-org#7993)

Jun 19, 2024
9c77ec1
zip
tar.gz

b3183

codecov : remove (ggml-org#8004)

Jun 19, 2024
a04a953
zip
tar.gz

b3182

[SYCL] refactor (ggml-org#6408)

* seperate lower precision GEMM from the main files

* fix workgroup size hardcode

Jun 19, 2024
623494a
zip
tar.gz

b3181

tokenizer : BPE fixes (ggml-org#7530)

* Random test: add_bos_token, add_eos_token
* Random test: add BPE models for testing
* Custom regex split fails with codepoint 0
* Fix falcon punctuation regex
* Refactor llm_tokenizer_bpe: move code to constructor
* Move 'add_special_bos/eos' logic to llm_tokenizer_bpe
* Move tokenizer flags to vocab structure.
* Default values for special_add_bos/eos
* Build vocab.special_tokens_cache using vocab token types
* Generalize 'jina-v2' per token attributes
* Fix unicode whitespaces (deepseek-coder, deepseek-llm)
* Skip missing byte tokens (falcon)
* Better unicode data generation
* Replace char32_t with uint32_t

Jun 18, 2024
37bef89
zip
tar.gz

b3180

Only use FIM middle token if it exists (ggml-org#7648)

* Only use FIM middle if it exists

* Only use FIM middle if it exists

Jun 18, 2024
91c188d
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b3190

b3189

b3188

b3187

b3186

b3184

b3183

b3182

b3181

b3180

Tags: edude03/llama.cpp