You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
|
2 months ago | |
---|---|---|
.. | ||
.vscode | 12 months ago | |
CMakeLists.txt | 4 months ago | |
README.md | 2 months ago | |
cudaGraphPerfScaling.cu | 3 months ago | |
dataCollection.bash | 12 months ago |
README.md
cudaGraphsPerfScaling - Cuda Graphs Perf Scaling
Description
A simple program for characterizing cuda graph api performance with different sized graphs.
Key Concepts
Performance Strategies
Supported SM Architectures
SM 5.0 SM 5.2 SM 5.3 SM 6.0 SM 6.1 SM 7.0 SM 7.2 SM 7.5 SM 8.0 SM 8.6 SM 8.7 SM 8.9 SM 9.0
Supported OSes
Linux, Windows
Supported CPU Architecture
x86_64, armv7l
CUDA APIs involved
CUDA Runtime API
cudaStreamBeginCapture, cudaGraphInstantiate, cudaGraphLaunch, cudaGraphUpload
Prerequisites
Download and install the CUDA Toolkit for your corresponding platform.