Stars
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
collection of benchmarks to measure basic GPU capabilities
Dissecting NVIDIA GPU Architecture
Parallel solvers for sparse linear systems featuring multigrid methods.
The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
Lightweight, general, scalable C++ library for finite element methods
GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.
Example Fenics antenna simulations as part of "Basics of Antenna Modeling with FEniCS Finite Element Suite"
Next generation FEniCS Form Compiler for finite element forms
A template for Python packages
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Python SYCL bindings and SYCL-based Python Array API library
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous progra…
A reimplementation of the Springer book: https://github.com/hplgit/fenics-tutorial/, covering new topics as well as transitioning from dolfin to dolfinx




