Skip to content

Releases: JuliaGPU/KernelAbstractions.jl

v0.9.4

01 May 16:45
be5b773
Compare
Choose a tag to compare

KernelAbstractions v0.9.4

Diff since v0.9.3

Merged pull requests:

v0.9.3

25 Apr 14:45
edaa024
Compare
Choose a tag to compare

KernelAbstractions v0.9.3

Diff since v0.9.2

Merged pull requests:

  • Migrate from SnoopPrecompile to PrecompileTools (#386) (@timholy)

v0.9.2

11 Apr 16:25
Compare
Choose a tag to compare

KernelAbstractions v0.9.2

Diff since v0.9.1

Closed issues:

  • Use occupancy API for autotuning (#19)
  • Allow user to turn off contract (#20)
  • Assigning ::ROCDevice to ::KA.GPU (#321)
  • ROCKernels: using queue pool causes performance regression (#344)
  • KernelAbstractions.jl is blocked to v0.8.6 by CUDAKernels (#380)

Merged pull requests:

v0.9.1

25 Mar 20:08
985f960
Compare
Choose a tag to compare

KernelAbstractions v0.9.1

Diff since v0.9.0

Closed issues:

  • Can't run the example in quickstart (#371)

Merged pull requests:

v0.9.0

09 Mar 20:11
4128c3d
Compare
Choose a tag to compare

KernelAbstractions v0.9.0

Diff since v0.8.6

Closed issues:

  • No speedup on CPU (#322)
  • Add Metal support (#326)

Merged pull requests:

v0.8.6

20 Nov 20:16
62d7bb9
Compare
Choose a tag to compare

KernelAbstractions v0.8.6

Diff since v0.8.5

Closed issues:

  • Support for single-threaded kernels even when Threads.nthreads() != 1? (#328)
  • Render issue with Docs admonition? (#332)

Merged pull requests:

v0.8.5

16 Nov 22:39
2c67ba2
Compare
Choose a tag to compare

KernelAbstractions v0.8.5

Diff since v0.8.4

Closed issues:

  • Add backend lookup function based on input arguments (#229)
  • Update for CUDA.jl 3.0 (#241)

Merged pull requests:

v0.7.3

16 Nov 22:39
3c17ca1
Compare
Choose a tag to compare

KernelAbstractions v0.7.3

Diff since v0.7.2

Closed issues:

  • Support atomics (#7)
  • Add backend lookup function based on input arguments (#229)
  • Separate Cassette context from CompilerMetadata (#231)
  • Update for CUDA.jl 3.0 (#241)
  • Adding a function to get device from array (type)? (#268)
  • Support for atomics (#276)
  • CUDA 3.6.3 broke KernelAbstactions. (#280)
  • Enzyme fails on GPU kernel (#307)

Merged pull requests:

v0.8.4

14 Sep 10:05
67122c1
Compare
Choose a tag to compare

KernelAbstractions v0.8.4

Diff since v0.8.3

Merged pull requests:

v0.8.3

25 Jun 20:24
0d5bec9
Compare
Choose a tag to compare

KernelAbstractions v0.8.3

Diff since v0.8.2

Closed issues:

  • Enzyme fails on GPU kernel (#307)

Merged pull requests:

  • Add 'return nothing' to autodiff (#309) (@pxl-th)
  • bounding UnsafeAtomics and UnsafeAtomicsLLVM (#311) (@leios)