Load Compressed Tensors Models with run_compressed=False #923

horheynm · 2024-11-18T19:34:49Z

SUMMARY:
Load Compressed Tensors Models with run_compressed=False

TEST PLAN:
Made tests to check that run_compressed=False (new pathway) do not have any Compressed Linear
Made tests to check that run_compressed=True (AutoModelForCausalLM pathway) has Compressed Linear.

horheynm · 2024-11-18T19:35:01Z

src/llmcompressor/transformers/utils/helpers.py

    "create_fake_dataloader",
-    "POSSIBLE_TOKENIZER_FILES",
-    "download_repo_from_huggingface_hub",
+    "detect_last_checkpoint",


alphabetical order

github-actions · 2024-11-18T19:35:06Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

kylesayrs

Beautiful

src/llmcompressor/transformers/utils/helpers.py

dsikka · 2024-11-18T20:02:08Z

src/llmcompressor/transformers/utils/helpers.py

+        config.quantization_config["run_compressed"] = False
+
+    return AutoModelForCausalLM.from_pretrained(
+        pretrained_model_name_or_path, config=config, **kwargs


Wont we get the error you were seeing unless we set the status to Frozen in HFQuant? In which case, the test below will error out?

Yes good point. The module set to frozen will be triggered from the HFQuantizer side

If this will error until then, we should wait until your change lands in HF and keep this in draft form.

yes wait until its merged

Co-authored-by: Kyle Sayers <[email protected]>

dsikka

Can you set this to draft since blocked on hf

horheynm · 2024-11-22T17:33:06Z

@dsikka
sure thing

horheynm · 2024-12-12T18:34:23Z

Dont need this no more, closing this pr

load models with run_compressed False

aa734f3

horheynm commented Nov 18, 2024

View reviewed changes

kylesayrs previously approved these changes Nov 18, 2024

View reviewed changes

src/llmcompressor/transformers/utils/helpers.py Outdated Show resolved Hide resolved

dsikka reviewed Nov 18, 2024

View reviewed changes

Update src/llmcompressor/transformers/utils/helpers.py

d163f34

Co-authored-by: Kyle Sayers <[email protected]>

horheynm dismissed kylesayrs’s stale review via d163f34 November 19, 2024 17:17

horheynm mentioned this pull request Nov 19, 2024

Actually make the run_compressed test useful #920

Merged

Merge branch 'main' into hfquantizer-run_compressed

4521903

dsikka reviewed Nov 22, 2024

View reviewed changes

horheynm marked this pull request as draft November 22, 2024 17:32

horheynm closed this Dec 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Load Compressed Tensors Models with run_compressed=False #923

Load Compressed Tensors Models with run_compressed=False #923

Uh oh!

horheynm commented Nov 18, 2024

Uh oh!

horheynm Nov 18, 2024

Uh oh!

github-actions bot commented Nov 18, 2024

Uh oh!

kylesayrs left a comment

Uh oh!

Uh oh!

dsikka Nov 18, 2024

Uh oh!

horheynm Nov 19, 2024

Uh oh!

dsikka Nov 20, 2024

Uh oh!

horheynm Nov 20, 2024

Uh oh!

dsikka left a comment

Uh oh!

horheynm commented Nov 22, 2024

Uh oh!

horheynm commented Dec 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Load Compressed Tensors Models with run_compressed=False #923

Load Compressed Tensors Models with run_compressed=False #923

Uh oh!

Conversation

horheynm commented Nov 18, 2024

Uh oh!

horheynm Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 18, 2024

Uh oh!

kylesayrs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dsikka Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

horheynm Nov 19, 2024

Choose a reason for hiding this comment

Uh oh!

dsikka Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

horheynm Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

horheynm commented Nov 22, 2024

Uh oh!

horheynm commented Dec 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants