Skip to content

[LLM INFER] not use gemm_dequant default and fix bug#9498

Merged
ZHUI merged 2 commits into
PaddlePaddle:developfrom
yuanlehome:llm_infer_bugs
Nov 26, 2024
Merged

[LLM INFER] not use gemm_dequant default and fix bug#9498
ZHUI merged 2 commits into
PaddlePaddle:developfrom
yuanlehome:llm_infer_bugs

Conversation

@yuanlehome
Copy link
Copy Markdown
Collaborator

@yuanlehome yuanlehome commented Nov 26, 2024

PR types

Bug fixes

PR changes

Others

Description

not use gemm_dequant default and fix bug

bug error:

line 265, in get_block_shape_and_split_kv_block
    outs = _C_ops._run_custom_op("get_block_shape_and_split_kv_block", seq_lens_encoder,seq_lens_decoder,max_enc_len_this_time,seq_lens_this_time,cum_offsets,encoder_block_shape_q,decoder_block_shape_q,group_size,block_size,decoder_step_token_num)
TypeError: (InvalidType) argument (position 11) must be int, but got tuple (at ../paddle/fluid/pybind/eager_utils.cc:185)

@paddle-bot
Copy link
Copy Markdown

paddle-bot Bot commented Nov 26, 2024

Thanks for your contribution!

@CLAassistant
Copy link
Copy Markdown

CLAassistant commented Nov 26, 2024

CLA assistant check
All committers have signed the CLA.

@codecov
Copy link
Copy Markdown

codecov Bot commented Nov 26, 2024

Codecov Report

Attention: Patch coverage is 0% with 7 lines in your changes missing coverage. Please review.

Project coverage is 52.95%. Comparing base (0b4b810) to head (8fad937).
Report is 215 commits behind head on develop.

Files with missing lines Patch % Lines
...erimental/transformers/fused_transformer_layers.py 0.00% 4 Missing ⚠️
...lp/experimental/transformers/qwen2_moe/modeling.py 0.00% 3 Missing ⚠️
Additional details and impacted files
@@             Coverage Diff             @@
##           develop    #9498      +/-   ##
===========================================
+ Coverage    52.84%   52.95%   +0.11%     
===========================================
  Files          688      688              
  Lines       109378   109381       +3     
===========================================
+ Hits         57797    57922     +125     
+ Misses       51581    51459     -122     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@ZHUI ZHUI merged commit f5ca96e into PaddlePaddle:develop Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants