92
Figure
8-3: A comparison traditional loop with the OpenCL data parallel kernel
The Figure 8-3 gives the advantages of writing the code in OpenCL as well as the parallel
computing capability of the OpenCL kernel. From Figure 8-4 we can clearly understand the
steps involved in the OpenCL programming model. The steps show the structural design
and execution of a kernel. Whenever
the kernel is designed, these steps are followed in
order to execute the kernel on the appropriate device.
Here in our work, we have used the OpenCL in order
to develop the UM
–
CNN which is to be
executed on the GPU. The required kernel which is necessary for the execution of the CNN
is written using the OpenCL programming model. The interface of the data is done from the
CPU which is to be loaded onto the device. For proper interface of the communication of
data we have used the
Open Computer Vision Library
(OpenCV) for the reading of images,
loading of the image data on to the memory elements and to display the retrieved image
back from the GPU after its execution.