Skip to content

Commit 039c061

Browse files
authored
fix: Update eagle_one configs with speculative_model_dir field (#2283)
1 parent 58ad4a2 commit 039c061

File tree

3 files changed

+3
-3
lines changed

3 files changed

+3
-3
lines changed

components/backends/trtllm/engine_configs/llama4/eagle_one_model/eagle_agg.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ disable_overlap_scheduler: true # disable_overlap_scheduler is having acc issue
2424
speculative_config:
2525
decoding_type: Eagle
2626
max_draft_len: 3
27-
pytorch_weights_path: nvidia/Llama-4-Maverick-17B-128E-Eagle3
27+
speculative_model_dir: nvidia/Llama-4-Maverick-17B-128E-Eagle3
2828
eagle3_one_model: true
2929

3030
kv_cache_config:

components/backends/trtllm/engine_configs/llama4/eagle_one_model/eagle_decode.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ disable_overlap_scheduler: true
2626
speculative_config:
2727
decoding_type: Eagle
2828
max_draft_len: 3
29-
pytorch_weights_path: nvidia/Llama-4-Maverick-17B-128E-Eagle3
29+
speculative_model_dir: nvidia/Llama-4-Maverick-17B-128E-Eagle3
3030
eagle3_one_model: True
3131

3232
kv_cache_config:

components/backends/trtllm/engine_configs/llama4/eagle_one_model/eagle_prefill.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ disable_overlap_scheduler: true
2626
speculative_config:
2727
decoding_type: Eagle
2828
max_draft_len: 3
29-
pytorch_weights_path: nvidia/Llama-4-Maverick-17B-128E-Eagle3
29+
speculative_model_dir: nvidia/Llama-4-Maverick-17B-128E-Eagle3
3030
eagle3_one_model: True
3131

3232
kv_cache_config:

0 commit comments

Comments
 (0)