Skip to content

Tags: JuliaGPU/KernelAbstractions.jl

Tags

v0.9.37

Toggle v0.9.37's commit message
[Diff since v0.9.36](v0.9.36...v0.9.37)

- Support `@kernel` definition inside functions

**Merged pull requests:**
- Use stacked method tables (#615) (@vchuravy)
- avoid boxing when `@kernel` is used as a closure (#625) (@simeonschaub)

v0.9.36

Toggle v0.9.36's commit message
[Diff since v0.9.35](v0.9.35...v0.9.36)

- `get_backend` support for StaticArrays

**Merged pull requests:**
- Use Printf to report errors from POCL (#592) (@vchuravy)
- use unsafe_indices for a few examples (#612) (@vchuravy)
- Switch to SPIRVIntrinsics 0.3 and the new backend (#614) (@vchuravy)
- KA.__synchronize, add GLOBAL_MEM_FENCE semantic (#618) (@vchuravy)
- add get_backend for StaticArrays (#621) (@vchuravy)

**Closed issues:**
- How to improve CPU performance? (#357)

v0.9.35

Toggle v0.9.35's commit message
[Diff since v0.9.34](v0.9.34...v0.9.35)

**Merged pull requests:**
- Implement a CPU backend using POCL (#556) (@vchuravy)
- [0.10] Forbid divergent execution of work-group barriers (#558) (@vchuravy)
- Bump julia-actions/setup-julia from 1 to 2 (#561) (@dependabot[bot])
- Switch Format.yml to CUDA.jl style (#568) (@vchuravy)
- Test pocl#main on CI (#569) (@vchuravy)
- CompatHelper: add new compat entry for SPIRVIntrinsics at version 0.2, (keep existing compat) (#571) (@github-actions[bot])
- CompatHelper: add new compat entry for GPUCompiler at version 1, (keep existing compat) (#572) (@github-actions[bot])
- CompatHelper: add new compat entry for LLVM at version 9, (keep existing compat) (#573) (@github-actions[bot])
- Check that malformed allocations throw and don't stackoverflow (#576) (@vchuravy)
- Check that malformed allocations throw and don't stackoverflow (#576) (#577) (@vchuravy)
- Avoid callgraph recursion due to exception branch in get_global_id (#579) (@vchuravy)
- Remove CPU(static=true) test (#580) (@vchuravy)
- Set SPIR-V to 1.2 (#582) (@vchuravy)
- use POCL with fixes (#589) (@vchuravy)
- use barrier with LOCAL_MEM_FENCE (#591) (@vchuravy)
- Test correct backend in examples test (#597) (@christiangnrd)
- Switch to pocl_jll@v7 (#599) (@vchuravy)
- prevent `get_backend` from overflowing the stack (#602) (@nsajko)
- [NFC] Ignore formatting PRs in blame (#604) (@christiangnrd)
- Enable downstream CI for 0.10 (#608) (@vchuravy)
- Disable Float16 on the CPU backend (#609) (@vchuravy)

**Closed issues:**
- Exception when getting the index on the CPU backend (#542)
- `Undefined variable error` in kernel after update to `0.9.34` (#575)
- Utility for copying array to GPU (#581)
- `StackOverflowError` on `get_backend(::UnitRange)` (#588)
- KA reports undefined variables, when they are in fact defined (#596)

v0.9.34

Toggle v0.9.34's commit message
[Diff since v0.9.33](v0.9.33...v0.9.34)

**Merged pull requests:**
- Bump googleapis/code-suggester from 2 to 4 (#560) (@dependabot[bot])
- Allow opt-out of implicit bounds-checking (#563) (@vchuravy)
- [0.9] Forbid divergent execution of work-group barriers (#564) (@vchuravy)
- Update Changelog in docs (#565) (@vchuravy)
- Fix docs and test for unsafe_indicies=true (#566) (@vchuravy)
- Fix indicies->indices typo everywhere (#567) (@vchuravy)

v0.9.33

Toggle v0.9.33's commit message
[Diff since v0.9.32](v0.9.32...v0.9.33)

**Merged pull requests:**
- Don't overload Base.ndims(::Any) (#557) (@vchuravy)

v0.9.32

Toggle v0.9.32's commit message
[Diff since v0.9.31](v0.9.31...v0.9.32)

- Clarify the semantics of `KernelAbstractions.copyto!` and add `KernelAbstractions.pagelock!`
- Add support for multiple devices per backend

**Merged pull requests:**
- Run Runic after explicit return rule addition (#516) (@fredrikekre)
- Avoid the exception branch in expand (#518) (@vchuravy)
- Allow for ndims query (#551) (@vchuravy)
- Switch Runic CI (#552) (@vchuravy)
- Update quickstart.md (#553) (@Dale-Black)
- support multiple devices per backend (#554) (@vchuravy)
- Document the semantics of copyto! and add pagelock! (#555) (@vchuravy)

**Closed issues:**
- Add Feature to Select Devices to Execute Kernels On (#458)

v0.9.31

Toggle v0.9.31's commit message
[Diff since v0.9.30](v0.9.30...v0.9.31)

**Merged pull requests:**
- Remove unecessary dependencies from KA (#549) (@vchuravy)

v0.9.30

Toggle v0.9.30's commit message
[Diff since v0.9.29](v0.9.29...v0.9.30)

**Merged pull requests:**
- Add Atomix v1 compat (#545) (@christiangnrd)

v0.9.29

Toggle v0.9.29's commit message
[Diff since v0.9.28](v0.9.28...v0.9.29)

**Merged pull requests:**
- Allow return statements for GPU-only kernels (#538) (@pxl-th)

**Closed issues:**
- Multi-GPU backend (#540)

v0.9.28

Toggle v0.9.28's commit message
[Diff since v0.9.27](v0.9.27...v0.9.28)

**Merged pull requests:**
- Enzyme: fix propagation of runtime activity (#534) (@wsmoses)
- [CI] Only run Enzyme on v1.10+ (#535) (@wsmoses)
- Enzyme support older versions (#537) (@wsmoses)