[Models] Add Llama-3.2 by DrownFish19 · Pull Request #9199 · PaddlePaddle/PaddleNLP

DrownFish19 · 2024-09-26T02:15:24Z

PR types

New features

PR changes

Models

Description

Add Llama-3.2.

meta-llama/Llama-3.2-1B
meta-llama/Llama-3.2-1B-Instruct
meta-llama/Llama-3.2-3B
meta-llama/Llama-3.2-3B-Instruct
meta-llama/Llama-Guard-3-1B

DrownFish19 · 2024-09-26T02:16:56Z

 |  [Qwen2-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)   | Qwen/Qwen2-Math-1.5B, Qwen/Qwen2-Math-1.5B-Instruct, Qwen/Qwen2-Math-7B, Qwen/Qwen2-Math-7B-Instruct, Qwen/Qwen2-Math-72B, Qwen/Qwen2-Math-72B-Instruct, Qwen/Qwen2-Math-RM-72B                                                                                                                                                                                                               |
 |    [Qwen2.5](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)    | Qwen/Qwen2.5-0.5B, Qwen/Qwen2.5-0.5B-Instruct, Qwen/Qwen2.5-1.5B, Qwen/Qwen2.5-1.5B-Instruct, Qwen/Qwen2.5-3B, Qwen/Qwen2.5-3B-Instruct, Qwen/Qwen2.5-7B, Qwen/Qwen2.5-7B-Instruct, Qwen/Qwen2.5-14B, Qwen/Qwen2.5-14B-Instruct, Qwen/Qwen2.5-32B, Qwen/Qwen2.5-32B-Instruct, Qwen/Qwen2.5-72B, Qwen/Qwen2.5-72B-Instruct                                                                     |
-| [Qwen2.5-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)  | Qwen/Qwen2.5-Math-1.5B, Qwen/Qwen2.5-Math-1.5B-Instruct, Qwen/Qwen2.5-Math-7B, Qwen/Qwen2.5-Math-7B-Instruct, Qwen/Qwen2.5-Math-72B, Qwen/Qwen2.5-Math-72B-Instruct, Qwen/Qwen2.5-Math-RM-72B                                                                                                                                                                                                                                |
+| [Qwen2.5-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)  | Qwen/Qwen2.5-Math-1.5B, Qwen/Qwen2.5-Math-1.5B-Instruct, Qwen/Qwen2.5-Math-7B, Qwen/Qwen2.5-Math-7B-Instruct, Qwen/Qwen2.5-Math-72B, Qwen/Qwen2.5-Math-72B-Instruct, Qwen/Qwen2.5-Math-RM-72B                                                                                                                                                                                                 |


空格变化，内容没有变化

DrownFish19 · 2024-09-26T02:17:33Z

@@ -96,9 +97,6 @@ Unified Checkpoint 大模型存储格式在模型参数分布上支持动态扩
 |:---------------------:|:--------:|:------------:|:--------:|:------------:|:------:|:------:|:----------:|
 |                       |          |   基础能力   | 序列并行 |    stage1    | stage2 | stage3 |            |
 |         Llama         |    ✅     |      ✅       |    ✅     |      ✅       |   ✅    |   ✅    |     ✅      |


统一LLaMA和Llama不同版本

codecov · 2024-09-26T02:48:30Z

Codecov Report

Attention: Patch coverage is 47.72727% with 23 lines in your changes missing coverage. Please review.

Project coverage is 53.02%. Comparing base (cd4e816) to head (68a5cb1).
Report is 256 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/llama/modeling_pp.py	40.00%	12 Missing ⚠️
paddlenlp/transformers/llama/modeling.py	54.16%	11 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9199      +/-   ##
===========================================
- Coverage    53.06%   53.02%   -0.05%     
===========================================
  Files          656      656              
  Lines       106147   106181      +34     
===========================================
- Hits         56324    56299      -25     
- Misses       49823    49882      +59

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

add llama3.2

6619c7a

DrownFish19 commented Sep 26, 2024

View reviewed changes

ZHUI previously approved these changes Sep 26, 2024

View reviewed changes

update for llama3.2

4c30595

DrownFish19 dismissed ZHUI’s stale review via 4c30595 September 26, 2024 03:21

fix jamba

68a5cb1

ZHUI approved these changes Sep 27, 2024

View reviewed changes

ZHUI merged commit db80bdd into PaddlePaddle:develop Sep 27, 2024

DrownFish19 deleted the dev_20240926_add_llama3.2 branch September 27, 2024 10:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Models] Add Llama-3.2#9199

[Models] Add Llama-3.2#9199
ZHUI merged 3 commits into
PaddlePaddle:developfrom
DrownFish19:dev_20240926_add_llama3.2

DrownFish19 commented Sep 26, 2024

Uh oh!

DrownFish19 Sep 26, 2024

Uh oh!

DrownFish19 Sep 26, 2024

Uh oh!

codecov Bot commented Sep 26, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DrownFish19 commented Sep 26, 2024

PR types

PR changes

Description

Uh oh!

DrownFish19 Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

DrownFish19 Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Sep 26, 2024 •

edited

Loading