early exit #34244

gante · 2024-10-18T10:18:01Z

What does this PR do?

ArthurZucker

Only missing docs and test, super super nice otherwise!

ArthurZucker · 2024-10-18T12:03:14Z

src/transformers/generation/candidate_generator.py

+        inputs_tensor: Optional[torch.Tensor] = None,
+        logits_processor: "LogitsProcessorList" = None,
+    ):
+        # TODO(joao): somehow check whether the model supports early exit


should we add _supports_early_exist ? hasattr(model, "active_layers")

IMO it depends on how the model is structured/trained 👀

if the model is expected to have early exit at ANY layer because the lm head is compatible with all layers -> there is no way to detect unless we manually add an argument in the config, which is... brittle. Probably I would suggest to not do any check for now?

If the model is expected to have early exit on specific layers, store those layers in the config and check that attribute here.

WDYT?

* 😅 * early exit (#34244) * mvp * docs and tests * a few fixes * no shared cache * Apply suggestions from code review Co-authored-by: Mostafa Elhoushi <[email protected]> * docs * make fix-copies * cohere fix * [test all] * [test all] consistent model code copies * [test all] make fix-copies :D * Apply suggestions from code review Co-authored-by: Pedro Cuenca <[email protected]> Co-authored-by: Mostafa Elhoushi <[email protected]> * Update src/transformers/generation/candidate_generator.py * Update src/transformers/generation/configuration_utils.py Co-authored-by: Pedro Cuenca <[email protected]> * [test all] don't use a stand-alone attribute; fix test --------- Co-authored-by: Joao Gante <[email protected]> Co-authored-by: Joao Gante <[email protected]> Co-authored-by: Mostafa Elhoushi <[email protected]> Co-authored-by: Pedro Cuenca <[email protected]>

* 😅 * early exit (huggingface#34244) * mvp * docs and tests * a few fixes * no shared cache * Apply suggestions from code review Co-authored-by: Mostafa Elhoushi <[email protected]> * docs * make fix-copies * cohere fix * [test all] * [test all] consistent model code copies * [test all] make fix-copies :D * Apply suggestions from code review Co-authored-by: Pedro Cuenca <[email protected]> Co-authored-by: Mostafa Elhoushi <[email protected]> * Update src/transformers/generation/candidate_generator.py * Update src/transformers/generation/configuration_utils.py Co-authored-by: Pedro Cuenca <[email protected]> * [test all] don't use a stand-alone attribute; fix test --------- Co-authored-by: Joao Gante <[email protected]> Co-authored-by: Joao Gante <[email protected]> Co-authored-by: Mostafa Elhoushi <[email protected]> Co-authored-by: Pedro Cuenca <[email protected]>

mvp

6c38a18

ArthurZucker reviewed Oct 18, 2024

View reviewed changes

docs and tests

10c0b1a

gante merged commit 42f0df6 into huggingface:layer-skip Oct 21, 2024
3 of 5 checks passed

gante deleted the early_exit branch October 21, 2024 10:15

gante restored the early_exit branch October 21, 2024 11:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

early exit #34244

early exit #34244

Uh oh!

gante commented Oct 18, 2024

Uh oh!

ArthurZucker left a comment

Uh oh!

ArthurZucker Oct 18, 2024

Uh oh!

gante Oct 18, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

early exit #34244

early exit #34244

Uh oh!

Conversation

gante commented Oct 18, 2024

What does this PR do?

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

gante Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants