Gcash_screenshot_parser

About

A proof of concept aimed at automating retrieval of key information in gcash screenshots such as referrence number, amount tendered and date and time. Additionally the user is able to export these to and xlsx file (only supported file format so far)

Use case

Although there are many other ways of obtaining such key information without going through OCR, this tool is intended for small business owners or establishments who have to rely on using screenshots/photos or manually writing on a notebook as their main mode of book keeping. Hopefully, it should reduce the work load at the end of the work day when one has to go through the sales for the day.

EDIT: As per some suggestions, I've included a feature that enables this tool to also parse the transaction history PDF that you can request from gcash themselves. This will require the user to enter their password since the PDF sent are usually secure

Usage

To run the tool you may simply run it on the terminal as so:

py .\main.py

GUI

You should be greeted with the tkinter gui where you can select the screenshot files to parse.

After selecting a single/multiple files it will go ahead and parse it. When satisfied you can export this to .xlsx.

Please note that I've sensored my reference numbers for privacy reasons.

Dependencies

This tool relies on the tesseract OCR engine that for the actual Optical Character recognition . I used this python wrapper in particular for tesseract. Its important to note that you need to put this tessdata folder in the ./data since this is where I've pointed the engine to look. Lastly xlsxwriter was used to export things to .xlsx for use with excel. Other than this you would need PyPDF2 for the new PDF reading feature.

Change log

11/14/2022 : Initial release and minor bug fixes
11/24/2022 : Pointed path to tessdata to relative path in repo.
12/26/2022 : Started using dateutil parser to guess date formats
12/27/2022 : Added ability to parse transaction history PDF using PyPDF2

To do

Error throwing for when password is incorrect

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gcash_screenshot_parser

About

Use case

Usage

GUI

Dependencies

Change log

To do

About

Releases

Packages

Languages

License

AJAbanto/Gcash_screenshot_parser

Folders and files

Latest commit

History

Repository files navigation

Gcash_screenshot_parser

About

Use case

Usage

GUI

Dependencies

Change log

To do

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages