Skip to content

[LLM Inference] Qwen2_Moe Support wint4#9129

Merged
qingqing01 merged 2 commits into
PaddlePaddle:developfrom
CJ77Qi:qwen2_moe_wint4
Sep 14, 2024
Merged

[LLM Inference] Qwen2_Moe Support wint4#9129
qingqing01 merged 2 commits into
PaddlePaddle:developfrom
CJ77Qi:qwen2_moe_wint4

Conversation

@CJ77Qi
Copy link
Copy Markdown
Contributor

@CJ77Qi CJ77Qi commented Sep 12, 2024

PR types

New features

PR changes

Models

Description

Qwen-Moe支持Wint4推理

@paddle-bot
Copy link
Copy Markdown

paddle-bot Bot commented Sep 12, 2024

Thanks for your contribution!

@codecov
Copy link
Copy Markdown

codecov Bot commented Sep 12, 2024

Codecov Report

Attention: Patch coverage is 0% with 8 lines in your changes missing coverage. Please review.

Project coverage is 53.32%. Comparing base (d3302c5) to head (d778c5d).
Report is 239 commits behind head on develop.

Files with missing lines Patch % Lines
...erimental/transformers/fused_transformer_layers.py 0.00% 8 Missing ⚠️
Additional details and impacted files
@@             Coverage Diff             @@
##           develop    #9129      +/-   ##
===========================================
- Coverage    53.32%   53.32%   -0.01%     
===========================================
  Files          652      652              
  Lines       105436   105442       +6     
===========================================
  Hits         56222    56222              
- Misses       49214    49220       +6     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@qingqing01 qingqing01 merged commit e340457 into PaddlePaddle:develop Sep 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants