Fix for Llama4 models #329

vidyasiv · 2025-08-11T23:25:59Z

Command

PT_HPU_LAZY_MODE=0 ./calibrate_model.sh \
 -m <>/Llama-4-Maverick-17B-128E-Instruct \
 -d <>/mlperf_inference/llama2/processed-data.pkl  \
 -o .  \
 -b 128 -t 8 -l 4096

Error

1/4 Preparing calibration dataset
Loading source dataset: /mnt/weka/data/mlperf_inference/llama2/processed-data.pkl
Creating calibration dataset...
Traceback (most recent call last):
  File "/root/8625/vllm-hpu-extension/calibration/step-1-prepare-calibration-dataset.py", line 93, in <module>
    main(args)
  File "/root/8625/vllm-hpu-extension/calibration/step-1-prepare-calibration-dataset.py", line 65, in main
    tmp_input = tokenizer.apply_chat_template(
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'bool' object has no attribute 'apply_chat_template'
Error in step 1

Similar issue

Closest issue to boolean value return: huggingface/transformers#35037

Proposed fix

Based on HF documentation: Llama4 AutoTokenizer should work for Llama4 Text only, in multimodel cases, we need to use Autoprocessor.

I noticed omitting use_fast=False or setting use_fast=True in AutoTokenizer.from_pretrained() helped get past the error.

One option (this PR) is to raise an error so we switch to use_fast=True, another option is to omit use_fast=false in default case.
I don't have enough context to gauge if second option is a good idea.

Please advise if this PR makes sense or we need some other enabling for Llama4 models to fix the issue.

Testing

Tested with Llama-4-Scout-17B-16E-Instruct
Testing with Llama-4-Maverick-17B-128E-Instruct

Fix for Llama4 models

e42253e

vidyasiv marked this pull request as ready for review August 19, 2025 22:38

vidyasiv requested review from afierka-intel, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, tzielinski-habana and xuechendi as code owners August 19, 2025 22:38

vidyasiv changed the base branch from v1.22.0 to main August 19, 2025 22:49

vidyasiv changed the base branch from main to v1.22.0 August 19, 2025 22:50

vidyasiv mentioned this pull request Aug 19, 2025

Fix for Llama4 models (targets main) #341

Open

vidyasiv closed this Sep 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix for Llama4 models #329

Fix for Llama4 models #329

Uh oh!

vidyasiv commented Aug 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix for Llama4 models #329

Fix for Llama4 models #329

Uh oh!

Conversation

vidyasiv commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Command

Error

Similar issue

Proposed fix

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vidyasiv commented Aug 11, 2025 •

edited

Loading