Skip to content

[Models] Add Llama-3.2#9199

Merged
ZHUI merged 3 commits into
PaddlePaddle:developfrom
DrownFish19:dev_20240926_add_llama3.2
Sep 27, 2024
Merged

[Models] Add Llama-3.2#9199
ZHUI merged 3 commits into
PaddlePaddle:developfrom
DrownFish19:dev_20240926_add_llama3.2

Conversation

@DrownFish19
Copy link
Copy Markdown
Collaborator

PR types

New features

PR changes

Models

Description

Add Llama-3.2.

  • meta-llama/Llama-3.2-1B
  • meta-llama/Llama-3.2-1B-Instruct
  • meta-llama/Llama-3.2-3B
  • meta-llama/Llama-3.2-3B-Instruct
  • meta-llama/Llama-Guard-3-1B

Comment thread README.md
| [Qwen2-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/) | Qwen/Qwen2-Math-1.5B, Qwen/Qwen2-Math-1.5B-Instruct, Qwen/Qwen2-Math-7B, Qwen/Qwen2-Math-7B-Instruct, Qwen/Qwen2-Math-72B, Qwen/Qwen2-Math-72B-Instruct, Qwen/Qwen2-Math-RM-72B |
| [Qwen2.5](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/) | Qwen/Qwen2.5-0.5B, Qwen/Qwen2.5-0.5B-Instruct, Qwen/Qwen2.5-1.5B, Qwen/Qwen2.5-1.5B-Instruct, Qwen/Qwen2.5-3B, Qwen/Qwen2.5-3B-Instruct, Qwen/Qwen2.5-7B, Qwen/Qwen2.5-7B-Instruct, Qwen/Qwen2.5-14B, Qwen/Qwen2.5-14B-Instruct, Qwen/Qwen2.5-32B, Qwen/Qwen2.5-32B-Instruct, Qwen/Qwen2.5-72B, Qwen/Qwen2.5-72B-Instruct |
| [Qwen2.5-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/) | Qwen/Qwen2.5-Math-1.5B, Qwen/Qwen2.5-Math-1.5B-Instruct, Qwen/Qwen2.5-Math-7B, Qwen/Qwen2.5-Math-7B-Instruct, Qwen/Qwen2.5-Math-72B, Qwen/Qwen2.5-Math-72B-Instruct, Qwen/Qwen2.5-Math-RM-72B |
| [Qwen2.5-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/) | Qwen/Qwen2.5-Math-1.5B, Qwen/Qwen2.5-Math-1.5B-Instruct, Qwen/Qwen2.5-Math-7B, Qwen/Qwen2.5-Math-7B-Instruct, Qwen/Qwen2.5-Math-72B, Qwen/Qwen2.5-Math-72B-Instruct, Qwen/Qwen2.5-Math-RM-72B |
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

空格变化,内容没有变化

Comment thread README.md
@@ -96,9 +97,6 @@ Unified Checkpoint 大模型存储格式在模型参数分布上支持动态扩
|:---------------------:|:--------:|:------------:|:--------:|:------------:|:------:|:------:|:----------:|
| | | 基础能力 | 序列并行 | stage1 | stage2 | stage3 | |
| Llama | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

统一LLaMA和Llama不同版本

ZHUI
ZHUI previously approved these changes Sep 26, 2024
@codecov
Copy link
Copy Markdown

codecov Bot commented Sep 26, 2024

Codecov Report

Attention: Patch coverage is 47.72727% with 23 lines in your changes missing coverage. Please review.

Project coverage is 53.02%. Comparing base (cd4e816) to head (68a5cb1).
Report is 256 commits behind head on develop.

Files with missing lines Patch % Lines
paddlenlp/transformers/llama/modeling_pp.py 40.00% 12 Missing ⚠️
paddlenlp/transformers/llama/modeling.py 54.16% 11 Missing ⚠️
Additional details and impacted files
@@             Coverage Diff             @@
##           develop    #9199      +/-   ##
===========================================
- Coverage    53.06%   53.02%   -0.05%     
===========================================
  Files          656      656              
  Lines       106147   106181      +34     
===========================================
- Hits         56324    56299      -25     
- Misses       49823    49882      +59     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@ZHUI ZHUI merged commit db80bdd into PaddlePaddle:develop Sep 27, 2024
@DrownFish19 DrownFish19 deleted the dev_20240926_add_llama3.2 branch September 27, 2024 10:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants