Skip to content

Repository for Data Science exam, Cognitive Science Masters @ Aarhus University 2022

Notifications You must be signed in to change notification settings

bokajgd/data-science-exam

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


Logo

Enhancing Image Classification Using Cellular Automata Inspired by Natural Transformation Processes

Data science exam 2022

Emil Trenckner Jessen, Jakob Grøhn Damgaard and Johan Kresten Horsmans
Read the paper here»

Table of Contents
  1. About the project
  2. Getting started
  3. Repository structure
  4. Contact

About the project

Logo

Example image displaying the classification framework of the paper

See here for an overview of the entire paper.

This project contains the exam for the Data Science exam of 2022.

Abstract

Many recent advances within the field of image classification are coming from the development of various new data augmentation methods which enable ample model training when faced with scarce data resources (Shorten & Khoshgoftaar, 2019). This paper investigates a novel approach to data augmentation in which cellular automata simulations are used as a method to enhance image data distinctiveness. We develop a set of cellular automata rules and apply them to the Fashion-MNIST data set to assess whether it is possible to enhance discriminatory features of the images in a way that will improve the classification performance of a simple multinomial logistic regression model. The rule sets utilized are rooted in natural transformation processes, namely, the Game of Life, pitting corrosion and melting. Our results suggest that the cellular automata algorithms are successful in enhancing unique and distinguishable key-features between the different image categories as the inclusion of the processed images leads to an increase in performance from 0.762 to 0.807 as measured in Matthews Correlation Coefficient. Hereafter, we discuss what might constitute the emergent discriminatory properties in the processed images which lead to improved performance. Lastly, we explore potential future prospects of using CA augmentation in combination with multidimensional convolutional neural networks within the field of computer vision.

Getting started

For running the scripts, we recommend following the below steps in your bash-terminal.

Cloning repository and creating virtual environment

The below code will clone the repository.

MAC/LINUX/WORKER02

git clone https://github.com/bokajgd/data-science-exam.git
cd data-science-exam

WINDOWS:

git clone https://github.com/bokajgd/data-science-exam.git
cd data-science-exam

Virtual environment

As the code runs on a number of dependencies that may be prone to changes, please use the included file requirements.txt to setup a virtual environment.

After cloning the repo, creating the virtual environment you should be ready to go.

Repository structure

This repository has the following structure:

Column Description
README_images/ Directory containing the few images used in the READMEs.
src/ Directory containing the scripts of the analysis
report.pdf Document that provides a full overview of the exam project.
vis/ Directory containing the visualizations produce by the main script
requirements.txt A list of the required packages.
.gitignore A list of the files that git should ignore upon push/pulling (e.g. virtual environment).
README.md This very README file.

Contact

Feel free to write Emil Jessen, Jakob Grøhn Damgaard or Johan Kresten Horsmans for any questions. You can do so on Slack, using the provided hyperlink.

About

Repository for Data Science exam, Cognitive Science Masters @ Aarhus University 2022

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages