DRL Project SS22
This is our final project for the course 'Deep Reinforcement Learning' given in SS22 at Osnabrück University.
We used the Atari-HEAD dataset [1] to compare human attention maps and DQN saliency maps, and explore the possibility of adding human attention data to the training process. We have also experimented with making inputs to the DQN more similar to human perceptual input. Further information can be found in the project report.
Our project is divided into four parts:
-
Training a baseline DQN to play Asterix (Atari):
-
Creating saliency maps for the baseline DQN
- The scripts for creating saliency maps and analysing them are in saliency.
-
Giving gaze data to the DQN
- Model files are in the folder gaze_network.
- The weights of the gaze prediction model and the weights of the augmented DQN are also uploaded here.
-
Making the learning more human like by foveating the input and focusing agent's attention on one region at a time.
- Code can be found in the folder asterix_with_blurr:
- The weights of the final foveated model are made available.
In the root folder, there are the files for reading in and processing the gaze data.
- my_reader_class.py is used to read in the data and save it in an instance of
Reader
to be able to access it easily. The object then also has methods to create fixation maps and gaze heatmaps to compare them with saliency maps. - For reading in gaze coordinates from the Atari-HEAD files, the file data_reader.py is used which was shared by the creators of the Atari-HEAD dataset and can be found in their GitHub repository.
- The file create_heatmaps.py is used to create heatmaps for training the gaze prediction network, which is then used to augment the DQN for the third part of our project.
Lastly, the report and meeting summaries are in the report folder.
[1] Zhang, R., Walshe, C., Liu, Z., Guan, L., Muller, K. S., Whritner, J. A., Zhang, L., Hayhoe, M., & Ballard, D. (2019). Atari-HEAD: Atari Human Eye-Tracking and Demonstration Dataset (Version 4). Zenodo. https://doi.org/10.5281/zenodo.3451402