-
-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Move inputs to right devices. #2919
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+337
−216
Merged
Changes from 2 commits
Commits
Show all changes
19 commits
Select commit
Hold shift + click to select a range
2b95cc7
Move tensors to right devices
Datta0 2ca7875
fix multi gpu for non mistral models
Datta0 03c57c1
multi GPU RoPE for gemma2
Datta0 a937baa
Finish up multi GPU inference
Datta0 44955c3
Make multiGPU rope a list
Datta0 e5158da
Remove unnecessary transfer to CPU
Datta0 de38ece
Remove unnecessary move to CPU
Datta0 324b392
Donot move inputs to device yet
Datta0 6c62402
Move inputs to appropriate decoder device
Datta0 ac75b22
Merge remote-tracking branch 'origin/main' into multigpu_inputs
Datta0 88a31ae
Make device count global variable
Datta0 f1c6bd6
Cleanup RoPE device code
Datta0 704f8ec
Fixup num_gpu to device count
Datta0 21b65b0
Cleanup device counts
Datta0 53759c8
Use device index for RoPE get_cache
Datta0 e7a2220
Merge remote-tracking branch 'origin/main' into multigpu_inputs
Datta0 dac2ae8
Donot typecast
Datta0 464df7c
Use tuple instead of list for tensors. Use device index directly
Datta0 da2bf84
fixup move to device logic
Datta0 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.