Skip to content

Conversation

@Crispig
Copy link
Collaborator

@Crispig Crispig commented Aug 7, 2025

What does this PR do?

Fix local rank binding issue when setting RAY_EXPERIMENTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES

Checklist Before Starting

[done] Search for similar PR(s).

Design & Code Changes

change verl/verl/workers/rollout/vllm_rollout/vllm_rollout_spmd.py

Checklist Before Submitting

[ done ] Read the Contribute Guide.
[ done ] Apply pre-commit checks.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a local rank binding issue for Ascend devices when the RAY_EXPERIMENTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES environment variable is set. The logic correctly assigns the local rank based on RAY_LOCAL_RANK when this setting is active, and otherwise defaults to 0. While the fix itself is correct, I have raised a high-severity maintainability concern about the implementation. The new code introduces logic that is conceptually duplicated elsewhere in the codebase, creating a risk of future inconsistencies and bugs.

@wuxibin89 wuxibin89 merged commit 68598bd into volcengine:main Aug 7, 2025
48 checks passed
girishbarca pushed a commit to cgftinc/verl that referenced this pull request Aug 10, 2025
…NTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES (volcengine#2967)

### What does this PR do?

Fix local rank binding issue when setting
RAY_EXPERIMENTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES

### Checklist Before Starting

[done] Search for similar PR(s).

### Design & Code Changes

change verl/verl/workers/rollout/vllm_rollout/vllm_rollout_spmd.py

### Checklist Before Submitting

[ done ] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
[ done ] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).

---------

Co-authored-by: liaochangyue <[email protected]>
yellowbee686 pushed a commit to yellowbee686/verl that referenced this pull request Aug 11, 2025
…NTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES (volcengine#2967)

### What does this PR do?

Fix local rank binding issue when setting
RAY_EXPERIMENTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES

### Checklist Before Starting

[done] Search for similar PR(s).

### Design & Code Changes

change verl/verl/workers/rollout/vllm_rollout/vllm_rollout_spmd.py

### Checklist Before Submitting

[ done ] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
[ done ] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).

---------

Co-authored-by: liaochangyue <[email protected]>
whatadayG pushed a commit to whatadayG/verl that referenced this pull request Sep 5, 2025
…NTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES (volcengine#2967)

### What does this PR do?

Fix local rank binding issue when setting
RAY_EXPERIMENTAL_NOSET_ASCEND_RT_VISIBLE_DEVICES

### Checklist Before Starting

[done] Search for similar PR(s).

### Design & Code Changes

change verl/verl/workers/rollout/vllm_rollout/vllm_rollout_spmd.py

### Checklist Before Submitting

[ done ] Read the [Contribute
Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide).
[ done ] Apply [pre-commit
checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting).

---------

Co-authored-by: liaochangyue <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants