Added training script for Gemma model #4822

vfdev-5 · 2025-07-16T21:51:35Z

What does this PR do?

Added training script for gemma example based on lm1b_nnx example:
- Added training code from l1mb_nnx
- Added support distributed training via sharding, tested on 2 GPUs and TPU VM with 4 devices.
- Added mixed precision config
- Add support for multiple samples per sequence as in lm1b (nothing to modify in the attention layer, just use appropriate attention mask and provide shifted data)

Addresses #4740

2 GPUs training logs, gemma3-1b model config: link (1000 iters)
TPU v4-8 training logs, gemma3-1b config: link (40000 iters)

vfdev-5 mentioned this pull request Jul 16, 2025

Add training script for gemma example #4761

Closed

Added training script for Gemma model

2eb7baa

vfdev-5 force-pushed the add-train-script-gemma-example branch from 0c16694 to 2eb7baa Compare July 17, 2025 00:12

vfdev-5 marked this pull request as ready for review July 17, 2025 08:04

vfdev-5 requested a review from IvyZX July 17, 2025 08:04

vfdev-5 added the pull ready label Jul 17, 2025

IvyZX approved these changes Jul 17, 2025

View reviewed changes

8bitmp3 assigned IvyZX and 8bitmp3 Jul 21, 2025

8bitmp3 added pull ready and removed pull ready labels Jul 21, 2025

copybara-service bot merged commit 97f4a49 into main Jul 22, 2025
19 of 20 checks passed

copybara-service bot deleted the add-train-script-gemma-example branch July 22, 2025 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added training script for Gemma model #4822

Added training script for Gemma model #4822

Uh oh!

vfdev-5 commented Jul 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added training script for Gemma model #4822

Added training script for Gemma model #4822

Uh oh!

Conversation

vfdev-5 commented Jul 16, 2025

What does this PR do?

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants