pass additional info for fix untrained tokens when using distributed + offloading #2388

winglian · 2025-03-06T16:53:51Z

Description

Fix untrained tokens doesn't quite work properly when using distributed offloading as the embeddings need to be gathered, but there isn't enough information in the model that the function can determine this, so we need to pass this to the function.

https://github.com/axolotl-ai-cloud/axolotl-contribs-lgpl/pulls

…+ offloading

winglian mentioned this pull request Mar 6, 2025

handle distributed embeddings axolotl-ai-cloud/axolotl-contribs-lgpl#4

Merged

pass additional info for fix untrained tokens when using distributed …

310c273

…+ offloading

winglian force-pushed the fix-untrained-w-zero3 branch from 6305227 to 310c273 Compare March 7, 2025 14:00

winglian added 6 commits March 7, 2025 11:15

use latest version of vendored lib

c31fe9b

use v0.0.5 of contribs lgpl

5f7fe93

fix for no bad tokens and add tests

d70e0fa

use release

814604b

add multigpu test too

7873c05

make sure the multigpu zero3 test actually uses zero3

3900ce8

winglian merged commit 59899b9 into main Mar 11, 2025
16 checks passed

winglian deleted the fix-untrained-w-zero3 branch March 11, 2025 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

pass additional info for fix untrained tokens when using distributed + offloading #2388

pass additional info for fix untrained tokens when using distributed + offloading #2388

Uh oh!

winglian commented Mar 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

pass additional info for fix untrained tokens when using distributed + offloading #2388

pass additional info for fix untrained tokens when using distributed + offloading #2388

Uh oh!

Conversation

winglian commented Mar 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

winglian commented Mar 6, 2025 •

edited

Loading