Implement usage of shared memory to optimize kernels.
Implement usage of shared memory to optimize kernels.