-
Notifications
You must be signed in to change notification settings - Fork 681
Implement ciflow/rocm on Torchtitan #2114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
0539dbe
19a5a15
a2a55a7
cc29b39
4485cd1
132a3eb
75a1d53
8f8d9df
9417b90
924444d
e2b8f12
db8cb50
c07b45d
a53e303
b5d35a4
1b429ca
878e01b
b001e4f
17e7fd5
e54daee
255ab3d
ffd1da3
c6822c9
4fd8306
158571f
a2f63ab
917886d
fab557f
968cdbf
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,5 @@ | ||
| tracking_issue: 24422 | ||
| ciflow_tracking_issue: 64124 | ||
akashveramd marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
akashveramd marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
| ciflow_push_tags: | ||
| - ciflow/8gpu | ||
akashveramd marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
| - ciflow/rocm | ||
akashveramd marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -3,6 +3,8 @@ name: 8 GPU Feature Tests | |
| on: | ||
| push: | ||
| branches: [ main ] | ||
| tags: | ||
| - ciflow/8gpu/* | ||
|
Comment on lines
+6
to
+7
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What's this for -- why do we need both
Collaborator
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. As per my understanding the PR workflows and tag workflows are totally independent. Tags provides CI flow, meaning tags can be pushed to trigger CI runs on specific commits even after the PR is closed. They can also be used for versioning releases. |
||
| paths-ignore: | ||
| - 'torchtitan/experiments/**' | ||
| pull_request: | ||
|
|
@@ -28,20 +30,22 @@ jobs: | |
| # Step 1: Dynamically compute the matrix based on conditions | ||
| set-matrix: | ||
| runs-on: ubuntu-latest | ||
| env: | ||
| HAS_8GPU_LABEL: ${{ (github.event_name == 'pull_request') && contains(github.event.pull_request.labels.*.name, 'ciflow/8gpu') }} | ||
akashveramd marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
| outputs: | ||
| matrix: ${{ steps.set.outputs.matrix }} | ||
| steps: | ||
| - id: set | ||
| run: | | ||
| # Decide which matrix entries to include based on event type | ||
| if [[ "${{ github.event_name }}" == "push" && "${{ github.ref }}" == "refs/heads/main" ]] || [[ "${{ github.event_name }}" == "schedule" ]]; then | ||
| # Include both CUDA and ROCm | ||
| # Runs CUDA and ROCm for push to main, cron schedule and PR label | ||
| if [[ "${{ github.event_name }}" == "push" && "${{ github.ref }}" == "refs/heads/main" ]] || [[ "${{ github.event_name }}" == "schedule" ]] || [[ "${HAS_8GPU_LABEL}" == "true" ]]; then | ||
akashveramd marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
| echo '{"include":[ | ||
| {"name":"cuda","runner":"linux.g5.48xlarge.nvidia.gpu","gpu-arch-type":"cuda","gpu-arch-version":"12.6","docker-image":"torchtitan-ubuntu-20.04-clang12","index-url":"https://download.pytorch.org/whl/nightly/cu126"}, | ||
| {"name":"rocm","runner":"linux.rocm.gpu.gfx942.8","gpu-arch-type":"rocm","gpu-arch-version":"7.0","docker-image":"torchtitan-rocm-ubuntu-22.04-clang12","index-url":"https://download.pytorch.org/whl/nightly/rocm7.0"} | ||
| ]}' > matrix.json | ||
| # Runs CUDA for normal PR (without PR label) | ||
| else | ||
| # Include only CUDA | ||
| echo '{"include":[ | ||
| {"name":"cuda","runner":"linux.g5.48xlarge.nvidia.gpu","gpu-arch-type":"cuda","gpu-arch-version":"12.6","docker-image":"torchtitan-ubuntu-20.04-clang12","index-url":"https://download.pytorch.org/whl/nightly/cu126"} | ||
| ]}' > matrix.json | ||
|
|
||
Uh oh!
There was an error while loading. Please reload this page.