Skip to content

Document Cleaner is a deep convolutional autoencoder model for generating clean form of dirty documents.

Notifications You must be signed in to change notification settings

DevMilk/Document-Cleaner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Document-Cleaner

Document Cleaner is a deep convolutional autoencoder model for generating a clean form of dirty documents.

Autoencoders can be used for ignoring the noise from image data. Here is a work for it.

Model trained for 300 epochs and its last loss is 0.0030. Loss function is "mean squared error" and optimizer is a "adamax" with 0.001 learning rate. Last layer's activation function is a custom function which maps values between -0.5 and 0.5 and other layers' activation functions are linear function.


Other solutions to improve model:

-Getting more data by editing train images with rotating, transforming etc.

-Making experiments on more optimizer and more hyperparameters

-Trying more neuron counts on layers


Here is the loss plot of model:

alt text


Cleaned Dirty Document examples:

alt text

alt text

alt text

alt text

alt text

alt text

alt text

alt text

About

Document Cleaner is a deep convolutional autoencoder model for generating clean form of dirty documents.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages