About

Project from Udacity.
Final project from 1st Section Original GitHub project

Approach taken

1st Part (define model, analyse parameters like loss-functions, optimizers, ...) file:

Define parameters to tweek:

Optimizers: Adam, SGD
Loss Functions: MSE, MAE-Smooth, MAE
Momentum/Bestas for Optimizers: Adam-(0.9, 0.999), (0.85, 0.999), (0.85, 0.95) ; SGD-(0.8, 0.9), 0.9, 0.8, 0.85
Learning Rates: 0.0001, 0.001, 0.01
Models: Model_v1.0, Model_v1.1, Model_v1.2, Model_v1.3

Run different combination (252 runs) with 20 epochs storing test MSE, MAE, MAE-Smooth from runs in Neptune.ai
Analyse and make statistics based on Runs, using 2.1 NeptuneAnalysis.ipynb
- Produces also 2 images: Combinations from different parameters and 3 measures from all divisions
Based on that selected best 2 models:
Run for 1,000 epochs with EarlyStopping (20 epochs)
Tested as well Xavier Activation and Batch sizes
Did the rest of the requirements as requested

2nd Part file

Load one of the best models saved (not included due to size limitations +30 MB)
Perform all necessary transformations and plot

3rd Part Bonus file

Improve filter in picture:

Take the filter and resize it
Padd the filter to match image size
Rotate the filter according to the angle of the head
Combine filter with Image

Smile Detection:

Take mouth points
Draw a curved line
Fix the rotation of curved line
Check the upper and lower point of the mouth according to its edges
Based on that determine if it's Big-Smile, Smile, Neutral, Sad
Next steps: Improve on smile detection taking more key points from face

Swap Faces:

Draw "green screen" ellipse on the surface of face (taking into account angle)
Draw a filter with the shape of the ellipse capturing the "green screen" from ellipse
Put pixels from image 2 in image 1 elipse
Put pixels from image 1 in image 2 elipse
Next steps: Improve on face detection taking more key points from face, and try to smoothen

K-Means Face position:

With the example in tutorial check the vector of observer and face
with that check where the observer is relative to the face (x and y)
Uppon that and with a defined threshold, decide if person is looking: Left-Central-Right and Upwards-Central-Downwards

Rotation Function:

Similar to the ones already defined, created a Class for roration
Taking into account also keypoints rotation
Plot 2 examples using the class

Instructions

Submission Files

✅ models.py:
✅ Notebooks 2: Defining and Training a Convolutional Neural Network (CNN) to Predict Facial Keypoints
✅ Notebooks 3: Facial Keypoint Detection Using Haar Cascades and your Trained CNN
✅ 💥 Notebooks 4: Extra Features - Fun with Keypoints.ipynb (Bonus)

Project Rubric link_original

`models.py`

Specify the CNN architecture

Criteria	Meets Specifications
✅ Define a CNN in `models.py`.	Define a convolutional neural network with at least one convolutional layer, i.e. self.conv1 = nn.Conv2d(1, 32, 5). The network should take in a grayscale, square image.

Notebook 2

Define the data transform for training and test data

Criteria	Meets Specifications
✅ Define a `data_transform` and apply it whenever you instantiate a DataLoader.	The composed transform should include: rescaling/cropping, normalization, and turning input images into torch Tensors. The transform should turn any input image into a normalized, square, grayscale image and then a Tensor for your model to take it as input.

Define the loss and optimization functions

Criteria	Meets Specifications
✅ Select a loss function and optimizer for training the model.	The loss and optimization functions should be appropriate for keypoint detection, which is a regression problem.

Train the CNN

Criteria	Meets Specifications
✅ Train your model.	Train your CNN after defining its loss and optimization functions. You are encouraged, but not required, to visualize the loss over time/epochs by printing it out occasionally and/or plotting the loss over time. Save your best trained model.

Answer questions about model architecture

Criteria	Meets Specifications
✅ All questions about model, training, and loss choices are answered.	After training, all 3 questions in notebook 2 about model architecture, choice of loss function, and choice of batch_size and epoch parameters are answered.

Visualize one or more learned feature maps

Criteria	Meets Specifications
✅ Apply a learned convolutional kernel to an image and see its effects.	Your CNN "learns" (updates the weights in its convolutional layers) to recognize features and this step requires that you extract at least one convolutional filter from the trained model, apply it to an image, and see what effect this filter has on the image.

Answer question about feature visualization

Criteria	Meets Specifications
✅ After visualizing a feature map, answer: what do you think it detects?	This answer should be informed by how the filtered image (from the step above) looks.

Notebook 3

Detect faces in a given image

Criteria	Meets Specifications
✅ Use a haar cascade face detector to detect faces in a given image.	The submission successfully employs OpenCV's face detection to detect all faces in a selected image.

Transform each detected face into an input Tensor

Criteria	Meets Specifications
✅ Turn each detected image of a face into an appropriate input Tensor.	You should transform any face into a normalized, square, grayscale image and then a Tensor for your model to take in as input (similar to what the `data_transform` did in Notebook 2).

Predict and display the keypoints

Criteria	Meets Specifications
✅ Predict and display the keypoints on each detected face.	After face detection with a Haar cascade and face pre-processing, apply your trained model to each detected face, and display the predicted keypoints on each face in the image.

Recommendations

Your home folder (including subfolders) must be less than 2GB (/home/workspace)
Your home folder (including subfolders) must be less than 25 megabytes to submit as a project.

Bonus 💥💥💥

✅ Initialize the weights of your CNN by sampling a normal distribution or by performing Xavier initialization so that a particular input signal does not get too big or too small as the network trains.
✅ In Notebook 4, create face filters that add sunglasses, mustaches, or any .png of your choice to a given face in the correct location.
✅ Use the keypoints around a person's mouth to estimate the curvature of their mouth and create a smile recognition algorithm .
✅ Use OpenCV's k-means clustering algorithm to extract the most common facial poses (left, middle, or right-facing, etc.).
✅ Use the locations of keypoints on two faces to swap those faces.
✅ Add a rotation transform to our list of transformations and use it to do data augmentation.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
aux		aux
backup_originals		backup_originals
data		data
detector_architectures		detector_architectures
images		images
saved_models		saved_models
.gitignore		.gitignore
1. Load and Visualize Data.ipynb		1. Load and Visualize Data.ipynb
2. Define the Network Architecture.ipynb		2. Define the Network Architecture.ipynb
2.1 NeptuneAnalysis.ipynb		2.1 NeptuneAnalysis.ipynb
3. Facial Keypoint Detection, Complete Pipeline.ipynb		3. Facial Keypoint Detection, Complete Pipeline.ipynb
4. Fun with Keypoints.ipynb		4. Fun with Keypoints.ipynb
5. Zip Your Project Files and Submit.ipynb		5. Zip Your Project Files and Submit.ipynb
LICENSE		LICENSE
README.md		README.md
data_load.py		data_load.py
filelist.txt		filelist.txt
models.py		models.py
requirements.txt		requirements.txt
workspace_utils.py		workspace_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Approach taken

1st Part (define model, analyse parameters like loss-functions, optimizers, ...) file:

2nd Part file

3rd Part Bonus file

Instructions

Submission Files

Project Rubric link_original

`models.py`

Specify the CNN architecture

Notebook 2

Define the data transform for training and test data

Define the loss and optimization functions

Train the CNN

Answer questions about model architecture

Visualize one or more learned feature maps

Answer question about feature visualization

Notebook 3

Detect faces in a given image

Transform each detected face into an input Tensor

Predict and display the keypoints

Recommendations

Bonus 💥💥💥

About

Releases

Packages

Languages

License

joao-d-oliveira/CV-Face_Recognition

Folders and files

Latest commit

History

Repository files navigation

About

Approach taken

1st Part (define model, analyse parameters like loss-functions, optimizers, ...) file:

2nd Part file

3rd Part Bonus file

Instructions

Submission Files

Project Rubric link_original

models.py

Specify the CNN architecture

Notebook 2

Define the data transform for training and test data

Define the loss and optimization functions

Train the CNN

Answer questions about model architecture

Visualize one or more learned feature maps

Answer question about feature visualization

Notebook 3

Detect faces in a given image

Transform each detected face into an input Tensor

Predict and display the keypoints

Recommendations

Bonus 💥💥💥

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`models.py`

Packages