Skip to content

Conversation

@winglian
Copy link
Collaborator

@winglian winglian commented Mar 6, 2025

Description

Fix untrained tokens doesn't quite work properly when using distributed offloading as the embeddings need to be gathered, but there isn't enough information in the model that the function can determine this, so we need to pass this to the function.

https://github.com/axolotl-ai-cloud/axolotl-contribs-lgpl/pulls

@winglian winglian force-pushed the fix-untrained-w-zero3 branch from 6305227 to 310c273 Compare March 7, 2025 14:00
@winglian winglian merged commit 59899b9 into main Mar 11, 2025
16 checks passed
@winglian winglian deleted the fix-untrained-w-zero3 branch March 11, 2025 16:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants