Parallel Programming on the GPU using CUDA
GPUtimer : the timer used in the following codes.
VectorMultiply1 : This code performs element-wise vector multiplication.
OpenCV Tutorials:
https://docs.opencv.org/3.0beta/doc/tutorials/introduction/table_of_content_introduction/table_of_content_introduction.html