Skip to content

Cherry pick Bump to pytorch 25.05 container along with TE update (13899) into r2.4.0#14145

Merged
chtruong814 merged 2 commits intor2.4.0from
cherry-pick-13899-r2.4.0
Jul 7, 2025
Merged

Cherry pick Bump to pytorch 25.05 container along with TE update (13899) into r2.4.0#14145
chtruong814 merged 2 commits intor2.4.0from
cherry-pick-13899-r2.4.0

Conversation

@ko3n1g
Copy link
Contributor

@ko3n1g ko3n1g commented Jul 6, 2025

beep boop [🤖]: Hi @chtruong814 👋,

we've cherry picked #13899 into  for you! 🚀

Please review and approve this cherry pick by your convenience!

* Update base container to be pytorch:25.05-py3

Signed-off-by: Charlie Truong <[email protected]>

* Update TE to 2.4

Signed-off-by: Charlie Truong <[email protected]>

* Remove torch accelerator patch

Signed-off-by: Charlie Truong <[email protected]>

* Update triton patch

Signed-off-by: Charlie Truong <[email protected]>

* Bump TE and Mcore commits

Signed-off-by: Charlie Truong <[email protected]>

* Fix triton patch

Signed-off-by: Charlie Truong <[email protected]>

* Fix triton patch

Signed-off-by: Charlie Truong <[email protected]>

* No fail fast

Signed-off-by: Charlie Truong <[email protected]>

* Update trt-llm to 0.20.0

Signed-off-by: Charlie Truong <[email protected]>

* Fix test_sched_config_parse_reduce_on_plateau

Signed-off-by: Charlie Truong <[email protected]>

* Add no build isolation to TE

Signed-off-by: Charlie Truong <[email protected]>

* Update trt-llm dependencies

Signed-off-by: Charlie Truong <[email protected]>

* Update manifest

Signed-off-by: Charlie Truong <[email protected]>

* Revert "Enable LoRA for TELinear layers (#13929)"

This reverts commit 7d9f40f.

* update mcore with wd_mult key fix

Signed-off-by: oliver könig <[email protected]>

* Revert "Revert "Enable LoRA for TELinear layers (#13929)""

This reverts commit 5a1da6c.

Signed-off-by: Charlie Truong <[email protected]>

* Fix nemo install

Signed-off-by: Charlie Truong <[email protected]>

* Fix nemo install

Signed-off-by: Charlie Truong <[email protected]>

* Fix export image build

Signed-off-by: Charlie Truong <[email protected]>

* Remove unnecessary sed for torch_tensorrt

Signed-off-by: Charlie Truong <[email protected]>

* Update TE and Mcore commits

Signed-off-by: Charlie Truong <[email protected]>

* Add optional tests

Signed-off-by: Charlie Truong <[email protected]>

* Fix install

Signed-off-by: Charlie Truong <[email protected]>

* Ensure test script arg types are correct for top_p and top_k

Signed-off-by: Charlie Truong <[email protected]>

* Increase export deploy timeouts

Signed-off-by: Charlie Truong <[email protected]>

* Skip failing test_rnnt_logprobs_random after pytorch bump

Signed-off-by: Charlie Truong <[email protected]>

* Skip coverage artifact config-3.12.py

Signed-off-by: Charlie Truong <[email protected]>

* Include more config files ot exclude during coverage

Signed-off-by: Charlie Truong <[email protected]>

* Update dependencies

Signed-off-by: Charlie Truong <[email protected]>

* Ensure top_p is float in nemo_export test script

Signed-off-by: Charlie Truong <[email protected]>

* Set Optional_L2_Speech_Batch_Size_OOMptimizer_Canary to truly be optional

Signed-off-by: Charlie Truong <[email protected]>

* Fix top_k and top_p types in megatronllm_deployable

Signed-off-by: Charlie Truong <[email protected]>

* Revert "Skip failing test_rnnt_logprobs_random after pytorch bump"

This reverts commit c6c3a76.

Signed-off-by: Charlie Truong <[email protected]>

* Fix optional export test

Signed-off-by: Charlie Truong <[email protected]>

* Revert unnecessary changes

Signed-off-by: Charlie Truong <[email protected]>

---------

Signed-off-by: Charlie Truong <[email protected]>
Signed-off-by: oliver könig <[email protected]>
Co-authored-by: Alexandros Koumparoulis <[email protected]>
Co-authored-by: oliver könig <[email protected]>
@chtruong814 chtruong814 merged commit 2afaf47 into r2.4.0 Jul 7, 2025
31 of 33 checks passed
@chtruong814 chtruong814 deleted the cherry-pick-13899-r2.4.0 branch July 7, 2025 13:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants