libgemm

CUDA cuBLAS DGEMM and SGEMM, Ozaki Scheme I EF and Ozaki Scheme II (Fast and Accurate Modes) Interception Library for BLAS dgemm and CBLAS cblas_dgemm

Usage

Compiling

bash compile.sh

Using

LD_PRELOAD=/path/to/libgemm/lib/libgemm.so python <script>

libgemm uses the LIBGEMM_OP_MODE to determine the interception redirection behaviour

Valid Modes:

0 - Netlib BLAS dgemm_ / Netlib CBLAS cblas_dgemm
10 - cuBLAS dgemm
15 - cuBLAS sgemm
103-116 - Ozaki Scheme I EF with Splits 3-17
202-220 - Ozaki Scheme II Fast Mode with Moduli 2-20
302-320 - Ozaki Scheme II Accurate Mode with Moduli 2-20

PySCF bindings

Patched numpy_helper.py file to enable interception of only lib.dgemm and lib.einsum calls for PySCF.

Requires the LIBGEMM_LIMITED_OP environment variable as an intercept target. The patched script will set and reset the LIBGEMM_OP_MODE target for libgemm based on this variable. Valid operation modes are the same as the LIBGEMM_OP_MODE modes.

It is recommended to compile the C backend for numpy_helper with the No-OpenMP flag, since parallel dgemm calls cause issues with the Ozaki Scheme II code. In the pyscf/lib/np_helper directory, run:

gcc *.c -I.. -O3 -shared -fPIC -fno-openmp -lopenblas -o ../libnp_helper.so

Debug Mode

libgemm_debug.so computes the L2 error between the OS-II matrices and cuBLAS dgemm matrices as reference.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
include		include
src		src
README.md		README.md
compile.sh		compile.sh
numpy_helper.py		numpy_helper.py
run_tests.py		run_tests.py
run_tests.sh		run_tests.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

libgemm

Usage

Compiling

Using

PySCF bindings

Debug Mode

About

Uh oh!

Releases

Packages

Languages

prajvalk/libgemm

Folders and files

Latest commit

History

Repository files navigation

libgemm

Usage

Compiling

Using

PySCF bindings

Debug Mode

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages