Distributed face recognition

Using kubernetes cluster

Quick Start

To deploy:

git clone https://github.com/GHRik/Distributed-k8s-face-recognition.git
cd Distributed-k8s-face-recognition/ansible
ansible-playbook -i inventory.yaml main.yaml

Features

Full automatization deploy:

install kubernetes and nvidia-docker pkg
Create cluster with calcio CNI and nvidia GPU scheduling
Deploy kubernetes resource (check describe section
Support CUDA 9.2, 10.0, 10.1, 10.2 11.0, 11.0.3, 11.1, 11.1.1, 11.3
Recognize face as distrubuted way

Used technology:

dlib - module to recognize face
cuda - to accelerate GPU card
ansible - to automatization create cluster
kubernetes - to create cluster
my docker hub repo - to store built images
kubernetes-sample-cluster - to pattern code
nvidia-docker - to passthrought my gpu to containers
Microsoft azure cloud - for testing
Calico - as CNI k8s plugin

Describe

This repo is reworked code from this repo so if you want any info about components or how everything works together , check this link

If you still dont know how it works, maybe this diagram will help you ;)

Where is distrubuted?

dlib have a Pool thread using to find face

Helping ansible tags

To deploy this code you can use ansible tags:

...

No install nvida-docker and kubernetes packages

ansible-playbook -i inventory.yaml main.yaml

...

Have cluster, but dont have deploy cluster face fecogniton from this repo

ansible-playbook -i inventory.yaml main.yaml --tags "deploy"

...

Have cluster, have deployed face recognition from this repo, but you make changes on kube files or known/unknown people images

ansible-playbook -i inventory.yaml main.yaml --tags "redeploy"

...

Have cluster, this face regoznition deployed, but images not load or is an error in "recognize" role

ansible-playbook -i inventory.yaml main.yaml --tags "recognize"

...

Have cluster before , have deployed face recognition, but want to recreate cluster

ansible-playbook -i inventory.yaml main.yaml --tags "destroy_cluster" 
ansible-playbook -i inventory.yaml main.yaml

...

Have deployed face recognition cluster, but want clear it:

ansible-playbook -i inventory.yaml main.yaml --tags: "destroy"

Cuda Support

This code support CUDA. In this case if you want deploy this cluster with CUDA support:

Check your GPU - which version CUDA your GPU is using

nvidia-smi

You will see output like this:

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 465.19.01    Driver Version: 465.19.01    CUDA Version: 11.3     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA Tesla K80    Off  | 00000001:00:00.0 Off |                    0 |
| N/A   34C    P8    32W / 149W |      0MiB / 11441MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

This cluster was tested uising CUDA 11.3 version, but on my docker hub you can pull other version. Only one pod will be running using CUDA support face_recognition If you want change a CUDA version, change this line on other version:

face_recognition.yaml

30: image: ghrik/face_recognition:cuda11.3

This script using nvida-docker to deploy GPU Scheduling on k8s cluster. In this case you should uninstall your docker if you have.

Without CUDA Support

You can run this cluster without CUDA.

In this case you have to change

face_recognition.yaml

30: image: ghrik/face_recognition:1.0

Result from example

Results are in two pleaces:

Result.txt - If ansible end properly this file will be fill with the calculated time it takes to recognize a given face

$ cat results/results.txt

Server is on: http://10.98.219.249:8081
LOGS:
Checking image: unknown_people/unknown_02.PNG
Time: 0.4799957275390625 sec.

Checking image: unknown_people/unknown_03.PNG
Time: 0.6136119365692139 sec.

Checking image: unknown_people/unknown_04.PNG
Time: 0.5596208572387695 sec.

Checking image: unknown_people/unknown_01.PNG
Time: 0.46269893646240234 sec.

The first line from result.txt is a ip to frontend site. On this site you will see what faces have been recognized.

Prepare your own face database

As you can see this cluster is checking only faces in unknown_people dir. To make your own database with face you change do a small change in

ansible/kube_files/database_setup.sql

So the first step is a create relation people-face

insert into person (name) values('Damian');
insert into person (name) values('Barack');
insert into person (name) values('Duda');
insert into person (name) values('Lewy');

It is very simple, add only something like that

The next step is create relation picture from known_people - people_id

insert into person_images (image_name, person_id) values ('damian_01.PNG', 1);
insert into person_images (image_name, person_id) values ('damian_02.PNG', 1);
insert into person_images (image_name, person_id) values ('barack_01.jpg', 2);
insert into person_images (image_name, person_id) values ('barack_02.PNG', 2);
insert into person_images (image_name, person_id) values ('duda_01.PNG', 3);
insert into person_images (image_name, person_id) values ('duda_02.PNG', 3);
insert into person_images (image_name, person_id) values ('lewy_01.PNG', 4);
insert into person_images (image_name, person_id) values ('lewy_02.PNG', 4);

Debug / Known Bugs

In any case of error check for the first image_processor pod

kubectl logs image_processor

List_out_of range
Probably one of images (from unknown/known_people) does not have any face to recognize. In this case image_processor cant process this image.
Image_processor is not up
Sometimes a image_processor must have a more time to get up. You can see it if you run new cluster. Pulling image to pod can take a long time
No such file or directory on image processor pod
Sometimes face_recog_unknown_pvc is connected to face_recog_known_pv, rerun with "redeploy" tag
dont_delete dir in unknown_people
Dont delete end.jpg , it is corelated with show time all recognized faces.
Sleep 60 in recognize
Sometimes a other services need more time to get up. To fast deploy you can comment "sleep 60", and after failed deploy recognize, rerun with tag: "recognize"
Circuitbreaker is engaged
It means you have more than 5images in unknown_people dir. Probably it will unfreeze if not, you can add sleep function in

ansible/roles/recognize/tasks/main.yaml

40: shell: sleep 10 && curl -d '{"path":"{{ item.path }}"}' http://{{ receiver_ip.stdout }}:8000/image/post

Or add fewer face pictures ;)

Core dump using without CUDA image
ghrik/face_recognition:1.0 was builded with AVX acceleration. All of CUDA images is using SSE4 (not AVX) If you want to use dlib without AVX acceleration check flags in dlib section:

images/face_recognitionGPU/Dockerfile

and colerate this with

images/face_recognition/Dockerfile

License

Free to use ;)

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
ansible		ansible
images		images
need_jpg		need_jpg
nvidiaJetsonTestMPI		nvidiaJetsonTestMPI
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed face recognition

Using kubernetes cluster

Table of contents

Quick Start

Features

Used technology:

Describe

Where is distrubuted?

Helping ansible tags

Cuda Support

Without CUDA Support

Result from example

Prepare your own face database

Debug / Known Bugs

License

About

Releases 2

Packages

Languages

GHRik/Distributed-k8s-face-recognition

Folders and files

Latest commit

History

Repository files navigation

Distributed face recognition

Using kubernetes cluster

Table of contents

Quick Start

Features

Used technology:

Describe

Where is distrubuted?

Helping ansible tags

Cuda Support

Without CUDA Support

Result from example

Prepare your own face database

Debug / Known Bugs

License

About

Topics

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages