[WIP] Option for using GPU #3

pskrunner14 · 2018-10-15T21:17:55Z

note: fixes #2

Merge branch 'master' of github.com:pskrunner14/neural-networks

pskrunner14 · 2018-10-15T21:29:02Z

TODO:

Resolve merge conflicts.
Use cuBLAS for matrix ops and benchmark performance against OpenAAC and OpenMP for larger inputs and hidden units.
Fix numerical gradient checking and improve tests for CI consistency.
Refactor code for future maintainability.

pskrunner14 added 13 commits September 18, 2018 04:00

branched gpu

454a4c6

fixes forward cuda

434546e

Merge pull request #1 from pskrunner14/master

9104b15

Merge branch 'master' of github.com:pskrunner14/neural-networks

adds cuda so lib

b586a67

fixes gpu computation bug

537728f

moves gpu data manip to indep methods

5cb72f6

fixes pyton float and c_float bug

aa14ff5

ref docstring

0b64e2c

adds numba and ref modules

cd76988

adds elemwise defs to cuda_c

fb22785

sep module for computation methods

220a9bb

update cuda code

2091781

enables gpu mode

d321ffc

pskrunner14 added 3 commits October 17, 2018 04:41

functional api change and major refactor

d09bdb6

some more refactor for codacy

3c28999

Merge branch 'master' into gpu

2657f99

Provide feedback