A comparison of fully connected network (forward and backward propagation) implementations.
C++/CUDA
.openmp
library.openblas
library.include/
contains headers.src/
contains all variant implementations.data/
contains MNIST data.proposal.pdf
contains the project proposal.presentation.pdf
contains the presentation given at the end of the project.report.pdf
contains details, experiments and analysis.Makefile
is used to make target executables.Makefile
.make all
to build all targets.cpu_serial.out
cuda_parallel.out
openmp.out
openblas.out
cuda_serial.out
make <target-name>
.make clean
../<target-name>
to run a target.