PRML Project

Download and place the dataset inside the repository.
Make a folder called test_images and place all pictures you want to make predictions on in this folder. Make sure the folder is inside the repository.
To create the custom dataset (custom_dataset.pkl), run all cells in the notebook custom_dataset.ipynb.
Run all cells in bmi.ipynb to predict BMI values.
Run all cells in gender.ipynb to predict gender.
Run all cells in distribution.ipynb to visualize the number of offences commited.

Data Preprocessing

Notebook - `custom_dataset.ipynb`

CSV file - `person.csv`

Image folders - `/front` and `/side`

Implementation

Reading gender of each inmate.
Calculating BMI of each inmate using the formulae BMI = weight(kg) / height(m)^2.
Reading the front image and side image of the inmates only if the image exists.

The inmate is recorded only if the front and side image is available.
A limit of 4000 is kept on the number of males to match the number of females.
The images are resized to (512,512).
The images are converted to grayscale.
The images are flattened before storing it in the pickle file.

The dataset is converted to a pickle file custom_dataset.pkl for further use.

Feature Detection

Notebook

It is implemented in both the notebooks, bmi.ipynb and gender.ipynb.

Feature Extraction

The 68 facial landmarks were used as features. The coordinates of the landmarks (x and y values) gives a total of 68 * 2 = 136 dimensions.

dlib library is used to extract these features.

Dimensionality reduction

Principal Component Analysis (PCA) is performed on these 136 dimensions to reduce the number of dimensions to 23. This captures 99.9% of the variance.

BMI Prediction

Notebook - `bmi.ipynb`

Implementation

Linear Regression model is used with a 80-20 train-test split. The model is evaluated on the following parameters -

MAE
MSE
R2
Pearson Coefficient

The model achieved a MAE score of 4.32.

Gender Classification

Notebook - `gender.ipynb`

Implementation

Support vector machine (SVM) model is used with a 80-20 train-test split. The model is evaluated on the following parameters -

MAE
MSE
R2
Pearson Coefficient

The model achieved an accuracy of 87.43%.

Distribution of Offences

Notebook - `distribution.ipynb`

CSV file - `sentencing.csv`

Implementation

A count of number of times an offence is committed is displayed. As there are a lot of offences, we have plotted the offences which make up 75% of the data for easier visualisation. Additionally, the count of all offences have also been plotted.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.gitignore		.gitignore
README.md		README.md
bmi.ipynb		bmi.ipynb
custom_dataset.ipynb		custom_dataset.ipynb
distribution.ipynb		distribution.ipynb
gender.ipynb		gender.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PRML Project

Table of Contents

Dataset

Illinois DOC labeled faces dataset

Objective

Steps to run the notebooks

Data Preprocessing

Notebook - `custom_dataset.ipynb`

CSV file - `person.csv`

Image folders - `/front` and `/side`

Implementation

Feature Detection

Notebook

Feature Extraction

Dimensionality reduction

BMI Prediction

Notebook - `bmi.ipynb`

Implementation

Gender Classification

Notebook - `gender.ipynb`

Implementation

Distribution of Offences

Notebook - `distribution.ipynb`

CSV file - `sentencing.csv`

Implementation

Authors

About

Releases

Packages

Languages

jyolx/PRML_Project

Folders and files

Latest commit

History

Repository files navigation

PRML Project

Table of Contents

Dataset

Illinois DOC labeled faces dataset

Objective

Steps to run the notebooks

Data Preprocessing

Notebook - custom_dataset.ipynb

CSV file - person.csv

Image folders - /front and /side

Implementation

Feature Detection

Notebook

Feature Extraction

Dimensionality reduction

BMI Prediction

Notebook - bmi.ipynb

Implementation

Gender Classification

Notebook - gender.ipynb

Implementation

Distribution of Offences

Notebook - distribution.ipynb

CSV file - sentencing.csv

Implementation

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Notebook - `custom_dataset.ipynb`

CSV file - `person.csv`

Image folders - `/front` and `/side`

Notebook - `bmi.ipynb`

Notebook - `gender.ipynb`

Notebook - `distribution.ipynb`

CSV file - `sentencing.csv`

Packages