Skip to content

Releases: JuliaGPU/GPUCompiler.jl

v0.26.0

20 Feb 12:16
a95fd25

Choose a tag to compare

GPUCompiler v0.26.0

Diff since v0.25.0

Merged pull requests:

Closed issues:

  • Overlay methods disabling IR interpreter breaks const-prop of GPU-incompatible code (#384)
  • GCN: Assertion failure due to stack address space (#504)
  • Optimizer regression on 1.11 (#506)
  • propagate_julia_addrsp! is not defined (#531)

v0.25.0

29 Sep 11:36
45c4325

Choose a tag to compare

GPUCompiler v0.25.0

Breaking change: PTX back-end now does not replace trap with exit anymore. This probably only matters to CUDA.jl, which now has to make sure to emit exit in its signal handler, to avoid tripping over the trap that probably follows it.

Diff since v0.24.5

Merged pull requests:

  • Update manifest (#522) (@github-actions[bot])
  • PTX: Improve handling of trap (#524) (@maleadt)

v0.24.5

22 Sep 09:31
324d83d

Choose a tag to compare

GPUCompiler v0.24.5

Diff since v0.24.4

Merged pull requests:

  • Only warn about missed transformations when debug output is enabled. (#521) (@maleadt)

v0.24.4

19 Sep 12:00
b23fe47

Choose a tag to compare

GPUCompiler v0.24.4

Diff since v0.24.3

Merged pull requests:

v0.24.3

14 Sep 12:56
a607fd5

Choose a tag to compare

GPUCompiler v0.24.3

Diff since v0.24.2

Merged pull requests:

v0.24.2

13 Sep 12:10
947507e

Choose a tag to compare

GPUCompiler v0.24.2

Diff since v0.24.0

Merged pull requests:

v0.24.1

12 Sep 13:03
c916b4e

Choose a tag to compare

GPUCompiler v0.24.1

Diff since v0.24.0

Merged pull requests:

v0.24.0

12 Sep 08:35
f28fcdc

Choose a tag to compare

GPUCompiler v0.24.0

Diff since v0.23.0

Closed issues:

  • compile(:asm) returns LLVM IR without requiring a context (#420)

Merged pull requests:

v0.23.0

29 Aug 11:26
ffbedbb

Choose a tag to compare

GPUCompiler v0.23.0

Diff since v0.22.0

Closed issues:

  • How to set fast math for CUDA (#491)

Merged pull requests:

v0.22.0

14 Aug 08:54
5fc9c6d

Choose a tag to compare

GPUCompiler v0.22.0

Diff since v0.21.4

Closed issues:

  • Linking bitcode libraries fail on 1.11 (#489)

Merged pull requests:

  • Update manifest (#487) (@github-actions[bot])
  • adjust concrete_eval_eligible overload to v1.10 (#488) (@aviatesk)
  • Add dwarf_version function so targets can choose (#490) (@gbaraldi)
  • Add fastmath flag to PTXCompilerTarget (#492) (@Zentrik)