Skip to content

Commit cea8926

Browse files
authored
docs: add instruction to download Qwen2-1.5B model (#75)
* docs: add instruction to download Qwen2-1.5B model
1 parent f1f4f98 commit cea8926

File tree

2 files changed

+11
-0
lines changed

2 files changed

+11
-0
lines changed

book/src/week1-05-qwen2-model.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@ Before we start, please make sure you have downloaded the models:
66

77
```bash
88
huggingface-cli download Qwen/Qwen2-0.5B-Instruct-MLX
9+
huggingface-cli download Qwen/Qwen2-1.5B-Instruct-MLX
910
huggingface-cli download Qwen/Qwen2-7B-Instruct-MLX
1011
```
1112

@@ -47,6 +48,7 @@ You should pass all tests for this task by running:
4748
```bash
4849
# Download the models if you haven't done so
4950
huggingface-cli download Qwen/Qwen2-0.5B-Instruct-MLX
51+
huggingface-cli download Qwen/Qwen2-1.5B-Instruct-MLX
5052
huggingface-cli download Qwen/Qwen2-7B-Instruct-MLX
5153
# Run the tests
5254
pdm run test --week 1 --day 5 -- -k task_1
@@ -88,6 +90,7 @@ You should pass all tests for this task by running:
8890
```bash
8991
# Download the models if you haven't done so; we need to tokenizers
9092
huggingface-cli download Qwen/Qwen2-0.5B-Instruct-MLX
93+
huggingface-cli download Qwen/Qwen2-1.5B-Instruct-MLX
9194
huggingface-cli download Qwen/Qwen2-7B-Instruct-MLX
9295
# Run the tests
9396
pdm run test --week 1 --day 5 -- -k task_2
@@ -152,6 +155,7 @@ You should pass all tests for this task by running:
152155
```bash
153156
# Download the models if you haven't done so
154157
huggingface-cli download Qwen/Qwen2-0.5B-Instruct-MLX
158+
huggingface-cli download Qwen/Qwen2-1.5B-Instruct-MLX
155159
huggingface-cli download Qwen/Qwen2-7B-Instruct-MLX
156160
# Run the tests
157161
pdm run test --week 1 --day 5 -- -k task_3

book/src/week1-06-generate-response.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -58,8 +58,15 @@ We will optimize the `decode` process to use key-value cache to speed up the gen
5858
You can test your implementation by running the following command:
5959

6060
```bash
61+
# Download the models if you haven't done so
62+
huggingface-cli download Qwen/Qwen2-0.5B-Instruct-MLX
63+
huggingface-cli download Qwen/Qwen2-1.5B-Instruct-MLX
64+
huggingface-cli download Qwen/Qwen2-7B-Instruct-MLX
65+
# Run the tests
6166
pdm run main --solution tiny_llm --loader week1 --model qwen2-0.5b \
6267
--prompt "Give me a short introduction to large language model"
68+
pdm run main --solution tiny_llm --loader week1 --model qwen2-1.5b \
69+
--prompt "Give me a short introduction to large language model"
6370
pdm run main --solution tiny_llm --loader week1 --model qwen2-7b \
6471
--prompt "Give me a short introduction to large language model"
6572
```

0 commit comments

Comments
 (0)