Skip to content

fix(mig): fallback gpu_memory_total value#3353

Closed
tomheno wants to merge 1 commit intosgl-project:mainfrom
tomheno:main
Closed

fix(mig): fallback gpu_memory_total value#3353
tomheno wants to merge 1 commit intosgl-project:mainfrom
tomheno:main

Conversation

@tomheno
Copy link
Copy Markdown

@tomheno tomheno commented Feb 6, 2025

Motivation

It's currently not possible to run on on MiG paritionned GPU as those have insufficient permission when using nvidia-smi cli to access available memory.
Tested on H100 & H200.

See #2933

Modifications

Added a SGLANG_GPU_MEMORY_TOTAL_FALLBACK environment variable to manually set available memory when nvidia-smi is not possible

@zhyncs
Copy link
Copy Markdown
Collaborator

zhyncs commented Feb 7, 2025

@dsingal0 what do you think?

@merrymercy
Copy link
Copy Markdown
Contributor

can you resolve the conflicts?

@tomheno
Copy link
Copy Markdown
Author

tomheno commented Jun 1, 2025

Hi, I did solve conflict @merrymercy
Thanks

@Garrybest
Copy link
Copy Markdown
Contributor

Fix in #8167.

@zhyncs zhyncs closed this Jul 20, 2025
@zhyncs
Copy link
Copy Markdown
Collaborator

zhyncs commented Jul 20, 2025

Thanks for the contribution!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants