Skip to content

Self-hosted runner scale set (AMD mi325 scheduled CI caller) #153

Self-hosted runner scale set (AMD mi325 scheduled CI caller)

Self-hosted runner scale set (AMD mi325 scheduled CI caller) #153

Triggered via workflow run April 7, 2026 06:47
@i3hzi3hz
completed da37a4d
Status Failure
Total duration 41s
Artifacts
Matrix: DeepSpeed CI / Check Runners
Matrix: Example CI / Check Runners
Matrix: Model CI / Check Runners
Matrix: Torch pipeline CI / Check Runners
Matrix: DeepSpeed CI / Setup
Matrix: DeepSpeed CI / Examples directory
Matrix: DeepSpeed CI / PyTorch pipelines
Matrix: DeepSpeed CI / Torch ROCm deepspeed tests
Matrix: Example CI / Setup
Matrix: Example CI / Examples directory
Matrix: Example CI / PyTorch pipelines
Matrix: Example CI / Torch ROCm deepspeed tests
Matrix: Model CI / Setup
Matrix: Model CI / Examples directory
Matrix: Model CI / PyTorch pipelines
Matrix: Model CI / Torch ROCm deepspeed tests
Matrix: Torch pipeline CI / Setup
Matrix: Torch pipeline CI / Examples directory
Matrix: Torch pipeline CI / PyTorch pipelines
Matrix: Torch pipeline CI / Torch ROCm deepspeed tests
Matrix: DeepSpeed CI / Single GPU tests
Waiting for pending jobs
Matrix: Example CI / Single GPU tests
Waiting for pending jobs
Matrix: Model CI / Single GPU tests
Waiting for pending jobs
Matrix: Torch pipeline CI / Single GPU tests
Waiting for pending jobs
DeepSpeed CI  /  ...  /  Send results to webhook
32s
DeepSpeed CI / Slack Report / Send results to webhook
Example CI  /  ...  /  Send results to webhook
32s
Example CI / Slack Report / Send results to webhook
Model CI  /  ...  /  Send results to webhook
36s
Model CI / Slack Report / Send results to webhook
Torch pipeline CI  /  ...  /  Send results to webhook
34s
Torch pipeline CI / Slack Report / Send results to webhook
Fit to window
Zoom out
Zoom in

Annotations

12 errors and 4 warnings
Torch pipeline CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
DeepSpeed CI / Check Runners (2gpu)
Required runner group 'amd-mi325-2gpu' not found
Example CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
Torch pipeline CI / Check Runners (2gpu)
Required runner group 'amd-mi325-2gpu' not found
DeepSpeed CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
Example CI / Check Runners (2gpu)
Required runner group 'amd-mi325-2gpu' not found
Model CI / Check Runners (2gpu)
Required runner group 'amd-mi325-2gpu' not found
Model CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
DeepSpeed CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Example CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Torch pipeline CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Model CI / Slack Report / Send results to webhook
Process completed with exit code 1.
DeepSpeed CI / Slack Report / Send results to webhook
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@v4, actions/download-artifact@v4. Actions will be forced to run with Node.js 24 by default starting June 2nd, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
Example CI / Slack Report / Send results to webhook
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@v4, actions/download-artifact@v4. Actions will be forced to run with Node.js 24 by default starting June 2nd, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
Torch pipeline CI / Slack Report / Send results to webhook
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@v4, actions/download-artifact@v4. Actions will be forced to run with Node.js 24 by default starting June 2nd, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
Model CI / Slack Report / Send results to webhook
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@v4, actions/download-artifact@v4. Actions will be forced to run with Node.js 24 by default starting June 2nd, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/