diff --git a/README.md b/README.md index 580593ed5..9183157af 100644 --- a/README.md +++ b/README.md @@ -68,7 +68,7 @@ pip3 install -e ".[model_worker,webui]" ### Vicuna Weights [Vicuna](https://lmsys.org/blog/2023-03-30-vicuna/) is based on Llama 2 and should be used under Llama's [model license](https://github.com/facebookresearch/llama/blob/main/LICENSE). -You can use the commands below to start chatting. It will automatically download the weights from Hugging Face repos. +You can use the commands below to start chatting. It will automatically download the weights from Hugging Face repos. Downloaded models are stored in a .cache folder in user's home folder ~. For example `~/.cache/huggingface/hub/`. See more command options and how to handle out-of-memory in the "Inference with Command Line Interface" section below. **NOTE: `transformers>=4.31` is required for 16K versions.**