This project will deal with the computation of the fast fourier transform (FFT) and singular value decompositions (SVD) using GPU acceleration. There are first implementations entirely based on OpenCL available, which should be reconsidered and extended to CUDA and OpenMP. Consequently, the student will learn how to code in CUDA and OpenCL for using GPUs, and also get some experience with OpenMP. Successful implementations will be made available to the public via the free open source library ViennaCL.
Juraj Kabzan
The following list of milestones will help you in completing the project successfully.
The reimbursement for each of the milestones is indicative, as we hope that you will reach and complete the final milestone anyway
Ample of tutorials for CUDA and OpenCL can be found in the web. A small selection is as follows:
https://developer.nvidia.com/cuda-education-training
http://www.nvidia.com/docs/IO/116711/sc11-cuda-c-basics.pdf
http://developer.amd.com/tools-and-sdks/heterogeneous-computing/amd-accelerated-parallel-processing-app-sdk/introductory-tutorial-to-opencl/
http://www.cc.gatech.edu/~vetter/keeneland/tutorial-2011-04-14/06-intro_to_opencl.pdf