Forget handcrafting CUDA/OpenCL, forget threading, mutexes, and locks, use Bohrium!
The features include:
- Lazy Evaluation, Bohrium will lazy evaluate all Python/NumPy operations until it encounters a “Python Read” such a printing an array or having a if-statement testing the value of an array.
- Views Bohrium supports NumPy views fully thus operating on array slices does not involve data copying.
- Loop Fusion, Bohrium uses a fusion algorithm that fuses (or merges) array operations into the same computation kernel that are then JIT-compiled and executed. However, Bohrium can only fuse operations that have some common sized dimension and no horizontal data conflicts.
- Lazy CPU/GPU Communiction, Bohrium only move data between the host and the GPU when the data is accessed directly by Python or a Python C-extension.
bohrium
that introduces a new array class bohrium.ndarray
which inherent from numpy.ndarray
. The two array classes a fully compatible thus one only has to replace numpy.ndarray
with bohrium.ndarray
in order to utilize the Bohrium runtime system.The documentation is available at www.bh107.org
https://github.com/bh107/bohrium
No comments:
Post a Comment