You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
---------
Co-authored-by: Pavel Emeliyanenko <[email protected]>
(cherry picked from commit f013645)
(cherry picked from commit b03cd94)
Added support for waves_per_eu function attribute. (#181)
(cherry picked from commit bc1d816)
(cherry picked from commit d3f94e9)
removed two line change (revert of half of the openxla#25959 commit
(cherry picked from commit 109e138)
Fixes for jax 0.6.0 (#207)
* Add fixes for jax plugin 0.6.0
Drop NEEDED linking to unnecessary libs.
These are loaded by amdhipruntime and not us.
Fix missing NEEDED on MIOpen shared object.
* Minor rocblas related changes for rocm 70
(cherry picked from commit 0de7d49)
---------
Co-authored-by: Zoran Jovanovic <[email protected]>
(cherry picked from commit 28f10a0)
Add hipBLASLt support for gfx11. (#301)
(cherry picked from commit f814bff)
Add bf16 starting from gfx11, bugfix & optimize RocmComputeCapability (#303)
* Bugfix and improve device_description.h::RocmComputeCompatibility
* Enable ALG_DOT_BF16* on rocm with HW support
(cherry picked from commit 510ea06)
[ROCm] Use bundled bitcode files (#196)
Also trim bitcode file list to ockl.bc and ocml.bc only.
(cherry picked from commit fc9e3c3)
Add MIOPEN_FIND_ENFORCE For ROCm 7 for convolution gemms (#312)
* Add MIOPEN_FIND_ENFORCE For ROCm 7 for convolution gemms
* Exclude failing CollectiveOpsE2E tests
(cherry picked from commit fb6ddfb)
Restore RocmComputeCapability:: gfx11_rx7900() and gfx12_rx8900() methods (#333)
At least gfx11_rx7900() is still needed for TF build.
(cherry picked from commit 13c3de1)
Make device_count_ atomic (#343)
* Make device_count_ atomic
* Use relaxed memory order
* Fix build error
(cherry picked from commit 8513f2d)
fix hardcoded max registers (#345)
(cherry picked from commit f3e170a)
fix hardcoded ecc enabled (#348)
(cherry picked from commit 9cfa74a)
remove reserved memory (#349)
(cherry picked from commit 0015d0e)
Add rocm_dev config for remote caching (#353)
(cherry picked from commit c815420)
added rocm7 support to EnablePeerAccess (#347)
* added rocm7 support to EnablePeerAccess
* use wrap namespace, clang-format and add comments
(cherry picked from commit 85548a7)
[ROCm] Disable Cudnn fusions (#358)
(cherry picked from commit edab8b2)
0 commit comments