cuda-examples

Examples of GPU acceleration with CUDA and OpenACC.

This repository primarily focuses on a High-performance CSR Sparse Matrix - Dense Vector multiplication (SpMV) in CUDA located in Cpp/SpMV_Cpp_CUDA.

Older examples (Fortran, Managed Memory) have been moved to the archive/ directory.

Requirements

NVIDIA HPC SDK: These examples depend on different modules and libraries like CUDA, OpenACC, cuSPARSE and Thrust.

Active Projects

SpMV_Cpp_CUDA: High-performance CSR Sparse Matrix - Dense Vector multiplication (SpMV) in CUDA. It features a templated CSR SpMV kernel with optional shared-memory reduction and aims to be competitive with cuSPARSE. This forms the foundation for our upcoming GPU-accelerated Finite Element Method (FEM) linear solver.

Archived Projects

Located in archive/:

SpMV_Fortran_OACC: Implementation of sparse matrix-vector multiplication using OpenACC directives in Fortran.
Thrust_interop: Shows how to couple Fortran using OpenACC directives with the Thrust library via CUDA Fortran interfaces.
cuSPARSE_Fortran_example: Sparse matrix multiplication on Fortran arrays ported to GPU using OpenACC, wrapping the cuSPARSE library.
SpMV_Cpp_CUDA_ManagedMemory: Earlier SpMV implementation using unified managed memory.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
Cpp		Cpp
archive		archive
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cuda-examples

Requirements

Active Projects

Archived Projects

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

cuda-examples

Requirements

Active Projects

Archived Projects

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages