A note of Nvidia CUDA programming learning
CUDA Official Docs here
- Parallelism: Use the GPU CUDA kernel to perform various operations in parallel.
- Performance Comparison: Compare the excution time of the CPU and GPU implementations, showcasing the potential speedup of GPU parallelism computation.
- Correctness Check: The results from the GPU and CPU are compared to ensure the CUDA kernel gets the correct output.