Skip to content

This is a PyTorch implementation of the paper: "Processing Megapixel Images with Deep Attention-Sampling Models".

License

Notifications You must be signed in to change notification settings

stegmuel/attention-sampling-pytorch

 
 

Repository files navigation

Attention Sampling - Pytorch

This is a PyTorch implementation of the the paper: "Processing Megapixel Images with Deep Attention-Sampling Models". This repository is based on the original repository belonging to this paper which is written in TensorFlow.

Porting to PyTorch

The code from the original repository has been rewritten to to a PyTorch 1.4.0 implementation. The most difficult part was rewriting the functions that extract the patches from the high resolution image. The original version uses special C/C++ files for this, I have done this in native Python. This is probably more inefficient and slower because it requires a nested for-loop. I tested with performing the patch extraction in parallel but this adds so much overhead that it is actually slower.

Furthermore, I hope I implemented the part where the expectation is calculated correctly. This uses a custom backward() function and I hope there are no bugs in it.

Performance

This code repository has been tested on two of the tasks mentioned in the original paper: the Mega-MNIST and the traffic sign detection task. A qualitative analysis of the results show they are comparable to the original work, however a qualitative analysis shows the errors are higher in this code base. A couple of users have alerted me that they can not reproduce the results from the original paper using this code base. I suspect there may still be a couple of bugs in this work. If you intend to use it beware, and any help finding these will be greatly appreciated. Experiments can be run by running mega_mnist.py and speed_limits.py.

Installation

Dependencies can be found inside the requirements.txt file. To install, run pip3 install -r requirements.txt. This code repository defaults to running on a GPU if it is available. It has been tested on both CPU and GPU.

Questions and contributions

If you have any question about the code or methods used in this repository you can reach out to joris.mollinga@surf.nl. If you find bugs in this code (which could be possible) please also contact me or file an issue. If you want to contribute to this code my making it more efficient (for example, the patch extraction procedure is quite inefficient) please contact me or submit a pull request.

Research

If this repository has helped you in your research we would value to be acknowledged in your publication.

Acknowledgement

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825292. This project is better known as the ExaMode project. The objectives of the ExaMode project are:

  1. Weakly-supervised knowledge discovery for exascale medical data.
  2. Develop extreme scale analytic tools for heterogeneous exascale multimodal and multimedia data.
  3. Healthcare & industry decision-making adoption of extreme-scale analysis and prediction tools.

For more information on the ExaMode project, please visit www.examode.eu.

enter image description here enter image description here

About

This is a PyTorch implementation of the paper: "Processing Megapixel Images with Deep Attention-Sampling Models".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%