You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
|
2 months ago | |
---|---|---|
.. | ||
.vscode | 4 years ago | |
CMakeLists.txt | 3 months ago | |
README.md | 2 months ago | |
batchCUBLAS.cpp | 3 months ago | |
batchCUBLAS.h | 3 months ago |
README.md
batchCUBLAS - batchCUBLAS
Description
A CUDA Sample that demonstrates how using batched CUBLAS API calls to improve overall performance.
Key Concepts
Linear Algebra, CUBLAS Library
Supported SM Architectures
SM 5.0 SM 5.2 SM 5.3 SM 6.0 SM 6.1 SM 7.0 SM 7.2 SM 7.5 SM 8.0 SM 8.6 SM 8.7 SM 8.9 SM 9.0
Supported OSes
Linux, Windows
Supported CPU Architecture
x86_64, armv7l
CUDA APIs involved
CUDA Driver API
cuRand, cuEqual
CUDA Runtime API
cudaMemcpy, cudaGetErrorString, cudaFree, cudaGetLastError, cudaDeviceSynchronize, cudaGetDevice, cudaMalloc, cudaStreamCreate, cudaGetDeviceProperties
Dependencies needed to build/run
Prerequisites
Download and install the CUDA Toolkit for your corresponding platform. Make sure the dependencies mentioned in Dependencies section above are installed.