Skip to content

Conversation

@rraminen
Copy link
Contributor

Relaxing the tolerance values to enable the below two unit testa, with FP16 and BF16 data types on ROCm

unit/runtime/half_precision/test_fp8.py::TestFp8ComposabilityAcrossZero::test[bf16]
unit/runtime/half_precision/test_fp8.py::TestFp8ComposabilityAcrossZero::test[fp32]

if is_rocm_pytorch() and model_dtype == torch.float16:
rtol, atol = 3e-07, 3e-05
if is_rocm_pytorch() and base_datatype in ["fp16", "bf16"]:
rtol, atol = 1e-07, 1e-04
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rraminen Suggest figuring out if this is HW-specific e.g. MI200/MI300?

Copy link
Contributor Author

@rraminen rraminen Nov 10, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For ROCm, fp8 features are only supported on MI300. This workaround is needed for MI300 onwards.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rraminen rraminen marked this pull request as draft October 30, 2025 20:04
@rraminen rraminen marked this pull request as ready for review November 10, 2025 17:09
@rraminen rraminen marked this pull request as draft November 14, 2025 17:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants