THE SIMPSONS CHARACTER RECOGNITION APPLICATION

Members: Daniel Fischer, Mo Habib, Fazliddin Naimov, Kevin Freehill, Dahmane Skendraoui, Peter Kim

ABOUT

Training a convolutional neural network to recognize The Simpsons characters. Our approach to solve this problem will be based on convolutional neural networks (CNNs): multi-layered feed-forward neural networking able to learn many features.

Technology Stack Used:

Python
HTML/CSS/Bootstrap/Javascript
Keras
Tensorflow
Flask

PROCESS

CNN Model

Dataset

We used kaggle Simpsons dataset that has more than 40 characters of pictures. For training, we only used characters that have more than 290 images.

characters = [k.split('/')[2] for k in glob.glob('./characters/*') if len([p for p in glob.glob(k+'/*') if 'edited' in p or 'pic_vid' in p]) > 290]

The model was trained to classify 18 characters only; here is the list:

Abraham Grampa
Apu Nahasapeemapetilon
Bart Simpson
Charles Montgomery Burns
Chief Wiggum
Comic Book Guy
Edna Krabappel
Homer Simpson
Kent Brockman
Krusty the Clown
Lisa Simpson
Marge Simpson
Milhouse van Houten
Moe Szyslak
Ned Flanders
Nelson Muntz
Principal Skinner
Sideshow Bob

Dataset Link

Training

Splitting the data to Train and Test using get_dataset function from train.py.

imp.reload(train)
X_train, X_test, y_train, y_test = train.get_dataset(save=True)

We used a feed-forward 4 convolutional layers with ReLU activation followed by a fully connected hidden layer. The model iterated over batches of the training set (batch size: 32) for 200 epochs. We also used data augmentation that did several random variations over the pictures, so the model never sees the same picture twice. This helps prevent overfitting and allows the model to generalize better.

datagen = ImageDataGenerator(
 featurewise_center=False, # set input mean to 0 over the dataset
 samplewise_center=False, # set each sample mean to 0
 featurewise_std_normalization=False, # divide inputs by std 
 samplewise_std_normalization=False, # divide each input by its std
 rotation_range=0, # randomly rotate images in the range 
 width_shift_range=0.1, # randomly shift images horizontally 
 height_shift_range=0.1, # randomly shift images vertically 
 horizontal_flip=True, # randomly flip images
 vertical_flip=False) # randomly flip images

Loss and Accuracy (Validation and Training) during training

Classification evaluation

The accuracy (f1-score) is really good: above 90 % for every character except Lisa. The precision for Lisa is 82%. Maybe Lisa is mixed up with other characters.

Improving the CNN model

To make the neural net understand more details and more complexity, we can add more convolutional layers. We tried with 6 convolutional layers and going deeper (dimensions of the output space 32, 64, 512 vs. 32, 64, 256, 1024). It has improved the accuracy (precision and recall), as you can see below. The lower precision is 0.89 for Nelson Muntz, and we only had 300 training examples for this character. Moreover, this model converges quicker: only 40 epochs (vs. 200).

Visualizing predicted characters

Predict from file and URL

Created two functions that uses model to predict from image and URL.

def file_predict(image_path, all_perc=False):
    image = cv2.imread(image_path)
    img = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
    plt.imshow(img)
    plt.show()
    pic = cv2.resize(image, (64,64))
    a = model.predict_proba(pic.reshape(1, 64, 64,3))[0]
    if all_perc:
        print('\n'.join(['{} : {}%'.format(map_characters[i], round(k*100)) for i,k in sorted(enumerate(a), key=lambda x:x[1], reverse=True)]))
    else:
        return map_characters[np.argmax(a)].replace('_',' ').title()
def url_predict(url, all_perc=False):
    image = url_to_image(url)
    img = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
    plt.imshow(img)
    plt.show()
    pic = cv2.resize(image, (64,64))
    a = model.predict_proba(pic.reshape(1, 64, 64,3))[0]
    if all_perc:
        print('\n'.join(['{} : {}%'.format(map_characters[i], round(k*100)) for i,k in sorted(enumerate(a), key=lambda x:x[1], reverse=True)]))
    else:
        return map_characters[np.argmax(a)].replace('_',' ').title()

Flask

We used a Javascript/HTML frontend with a Flask backend server written in python. The back end comprises two endpoints, as displayed below - the first endpoint renders the page while the second handle prediction requests sent from the frontend. All requests include a base64 string representation of a picture file that is then decoded and converted into a NumPy array to pass through the model. Predictions passed back from the model are then relayed to the front end as a string, thus completing the initial request.

@app.route("/")
def index():
    return render_template("index.html")

@app.route('/predict', methods=['GET', 'POST'])
def predict():

HTML/CSS/JS

The HTML page is comprised of two buttons. JavaScript will save the file and transport it to the server for prediction.

    $('#btn-predict').click(function () {
        var form_data = new FormData($('#upload-file')[0]);

        // Show loading animation
        $(this).hide();
        $('.loader').show();

        // Make prediction by calling api /predict
        $.ajax({
            type: 'POST',
            url: '/predict',
            data: form_data,
            contentType: false,
            cache: false,
            processData: false,
            async: true,
            success: function (data) {
                // Get and display the result
                $('.loader').hide();
                $('.output-container').show();
                $('#result').fadeIn(600);
                $('#result').text(' This is ' + data);
                console.log(data)
            },
        });
    });

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
images		images
models		models
static		static
templates		templates
.gitignore		.gitignore
Presentation for Final Project.pdf		Presentation for Final Project.pdf
Procfile		Procfile
README.md		README.md
app.py		app.py
model.py		model.py
model_training.ipynb		model_training.ipynb
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

THE SIMPSONS CHARACTER RECOGNITION APPLICATION

ABOUT

PROCESS

CNN Model

Dataset

Training

Classification evaluation

Improving the CNN model

Visualizing predicted characters

Predict from file and URL

Flask

HTML/CSS/JS

FINAL APPLICATION

About

Releases

Packages

Contributors 5

Languages

faznaimov/simpsons-classification

Folders and files

Latest commit

History

Repository files navigation

THE SIMPSONS CHARACTER RECOGNITION APPLICATION

ABOUT

PROCESS

CNN Model

Dataset

Training

Classification evaluation

Improving the CNN model

Visualizing predicted characters

Predict from file and URL

Flask

HTML/CSS/JS

FINAL APPLICATION

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages