gpu_ver

History

Name		Name	Last commit message	Last commit date
parent directory ..
datasets		datasets
docs		docs
figs		figs
preprocess		preprocess
refcode		refcode
LICENSE		LICENSE
README.md		README.md
args.py		args.py
hypergrad.py		hypergrad.py
layers.py		layers.py
models.py		models.py
simple_mlp.py		simple_mlp.py
updates.py		updates.py

README.md

DrMAD in Theano

WARNING: The GPU version is only experimental and might have unknown bugs. The GPU version is not dicussed in the original paper. We only tested it with small-scaled models, which is by no means thorough.

DrMAD-Theano uses Lasagne to build a simple MLP.

Run:

THEANO_FLAGS=mode=FAST_RUN,device=gpu0,floatX=float32 python simple_mlp.py

Structure

simple_mlp.py includes three phases:
- Phase 1: Algorithm 1.
- Phase 2: obtain the validation loss on validation set. (Since there're multiple iterations, we have to output the gradients and take the average of grads across different iterations.)
- Phase 3: Algorithm 2.
  - We use Lop() to obtain the hessian-vector products in line 6-7 of Algo. 2., which is defined in hypergrad.py.
args.py configuration for DrMAD
layers.py provides class DenseLayerWithReg() to build up a simple MLP.
models.py provides class MLP().
updates.py provides update rules for different theano functions.

References

hypergrad

T1T2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Files

gpu_ver

gpu_ver

README.md

DrMAD in Theano

Structure

References

Collapse file tree

Files

gpu_ver

Directory actions

More options

Directory actions

More options

Latest commit

History

gpu_ver

Folders and files

parent directory

README.md

DrMAD in Theano

Structure

References