- AlexNet
- ResNet 34, 50, 101
- Layerwise benchmarks
- Benchmarking class to log kernel runtime and hardware details (temp, clock) to
tsv
and log files gputop
: monitor temperature, engine clock, memory clock, fan speedsummarize.sql
: summarize layer-wise.tsv
benchmarking logs to find the most time consuming layers
make
make benchmark
Take a look into the Makefile
or .cpp
sources for more details.