Skip to content

simondanielsson/kernels

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Example GPU kernel implementations

This repository contains a implementations of some common GPU kernels in CUDA C++.

Installation

Compile the kernels using

mkdir build
make

Then run the kernels, for instance using

./build/scan

This have only been tested on nvcc 12.9 on an L4 (compute capability 8.9).

Profiling

The kernels can be profiled using Nsight Compute or Nsight Systems:

# compute
make ncu

# systems
make nsys

About

Reference implementations of various parallel algorithms

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published