You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
|
2 months ago | |
---|---|---|
.. | ||
.vscode | 4 years ago | |
CMakeLists.txt | 3 months ago | |
README.md | 2 months ago | |
warpAggregatedAtomicsCG.cu | 3 months ago |
README.md
warpAggregatedAtomicsCG - Warp Aggregated Atomics using Cooperative Groups
Description
This sample demonstrates how using Cooperative Groups (CG) to perform warp aggregated atomics to single and multiple counters, a useful technique to improve performance when many threads atomically add to a single or multiple counters.
Key Concepts
Cooperative Groups, Atomic Intrinsics
Supported SM Architectures
Supported OSes
Linux, Windows
Supported CPU Architecture
x86_64, armv7l, aarch64
CUDA APIs involved
CUDA Runtime API
cudaMemcpy, cudaFree, cudaDeviceGetAttribute, cudaMemset, cudaMalloc
Prerequisites
Download and install the CUDA Toolkit for your corresponding platform.