[NPU]Custom fusion operator unification by Galaxy1458 · Pull Request #8431 · PaddlePaddle/PaddleNLP

Galaxy1458 · 2024-05-13T06:55:30Z

PR types

Others

PR changes

Models

Description

Custom fusion operator unification

…o develop

paddle-bot · 2024-05-13T06:55:34Z

Thanks for your contribution!

codecov · 2024-05-13T08:09:12Z

Codecov Report

Attention: Patch coverage is 23.52941% with 65 lines in your changes are missing coverage. Please review.

Project coverage is 55.42%. Comparing base (17fb497) to head (0a6d6b8).
Report is 1 commits behind head on develop.

Files	Patch %	Lines
paddlenlp/transformers/llama/fusion_ops.py	22.50%	62 Missing ⚠️
paddlenlp/transformers/llama/modeling.py	40.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8431      +/-   ##
===========================================
- Coverage    55.43%   55.42%   -0.01%     
===========================================
  Files          616      617       +1     
  Lines        96243    96281      +38     
===========================================
+ Hits         53348    53366      +18     
- Misses       42895    42915      +20

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

SylarTiaNII

LGTM

wawltor

LGTM

ZHUI · 2024-05-14T02:38:01Z

+    flash_attention = None
+
+
+def fusion_rope(query_states, key_states, value_states, hidden_states, position_ids, past_key_value, rotary_emb):


fusion_rope、fusion_flash_attention这种太长了就不建议去抽取了

已经将paddlenlp/transformers/fusion_ops.py 移动到paddlenlp/transformers/llama/fusion_ops.py

ZHUI · 2024-05-14T04:40:32Z


        for lib in os.listdir(os.getenv("CUSTOM_DEVICE_ROOT")):
            if lib.endswith(".so"):
                paddle.utils.cpp_extension.extension_utils.load_op_meta_info_and_register_op(lib)


注意看是不是有不需要的代码，注意删除掉。

Galaxy1458 and others added 16 commits May 9, 2024 12:03

update

a5ed9ed

Merge branch 'develop' of https://github.com/Galaxy1458/PaddleNLP int…

8ebdcfa

…o develop

add llama-npu-opt-script

bd0aa87

Merge branch 'PaddlePaddle:develop' into develop

ce921ab

Update dev_opt_lora.sh

cc24132

Update dev_opt_ppt.sh

036d03c

Update dev_opt_lora.sh

8dd2d02

Update dev_opt_ppt.sh

96e69aa

Update dev_opt_sft.sh

a35ba59

Rename dev_opt_lora.sh to llama_npu_opt_lora.sh

68388a7

Update dev_opt_ppt.sh

fee8f04

Rename dev_opt_ppt.sh to llama_npu_opt_ppt.sh

783de3b

Update llama_npu_opt_lora.sh

10f9415

Update and rename dev_opt_sft.sh to llama_npu_opt_sft.sh

f3d96e5

Merge branch 'PaddlePaddle:develop' into develop

e51cc9a

add funsion ops

6771aa9

Galaxy1458 added 4 commits May 13, 2024 15:04

add funsion ops

61dc79c

add funsion ops

558200f

add funsion ops

f387c30

add funsion ops

a12947b

Galaxy1458 added 7 commits May 13, 2024 16:12

add funsion ops

aff105e

add funsion ops

075c8de

add funsion ops

15f2fe3

add funsion ops

2741769

add funsion ops

12fc048

add funsion ops

f678361

add funsion ops

9b2ca6b

Galaxy1458 changed the title ~~fix~~ [NPU]Custom fusion operator unification May 13, 2024

Galaxy1458 added 4 commits May 13, 2024 18:13

add funsion ops

cac0f8e

add funsion ops

73866a2

add funsion ops

d8f1950

add funsion ops

9a2f1c5

SylarTiaNII approved these changes May 13, 2024

View reviewed changes

wawltor previously approved these changes May 14, 2024

View reviewed changes

ZHUI reviewed May 14, 2024

View reviewed changes

update

df78b71

Galaxy1458 dismissed wawltor’s stale review via df78b71 May 14, 2024 03:28

Galaxy1458 and others added 2 commits May 14, 2024 11:30

Update fusion_ops.py

8c3cd0d

update

0a6d6b8

ZHUI approved these changes May 14, 2024

View reviewed changes

wawltor merged commit 05acad5 into PaddlePaddle:develop May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NPU]Custom fusion operator unification#8431

[NPU]Custom fusion operator unification#8431
wawltor merged 34 commits into
PaddlePaddle:developfrom
Galaxy1458:develop

Galaxy1458 commented May 13, 2024 •

edited

Loading

Uh oh!

paddle-bot Bot commented May 13, 2024

Uh oh!

codecov Bot commented May 13, 2024 •

edited

Loading

Uh oh!

SylarTiaNII left a comment

Uh oh!

wawltor left a comment

Uh oh!

ZHUI May 14, 2024

Uh oh!

Galaxy1458 May 14, 2024

Uh oh!

ZHUI May 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		flash_attention = None


		def fusion_rope(query_states, key_states, value_states, hidden_states, position_ids, past_key_value, rotary_emb):

Conversation

Galaxy1458 commented May 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

Uh oh!

paddle-bot Bot commented May 13, 2024

Uh oh!

codecov Bot commented May 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

SylarTiaNII left a comment

Choose a reason for hiding this comment

Uh oh!

wawltor left a comment

Choose a reason for hiding this comment

Uh oh!

ZHUI May 14, 2024

Choose a reason for hiding this comment

Uh oh!

Galaxy1458 May 14, 2024

Choose a reason for hiding this comment

Uh oh!

ZHUI May 14, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Galaxy1458 commented May 13, 2024 •

edited

Loading

codecov Bot commented May 13, 2024 •

edited

Loading