Pabst

Pabst is named after the legendary American beer 'Pabst Blue Ribbon', because PBR = Parsing Bank Records

Notes:

uses the Dash framework
pdf to text(exp_text_parser.py) is currently better than ocr, used as default by main_parser.py
Put all pdf files inside the ./parsing directory to convert to csv. The csv files are output in ./data

Dev

/data holds input csv data

Installing viz-only, without Docker

For Windows, first install miniconda (follow setup to create env also) and then install dash with conda:

conda install -c conda-forge pandas dash dash-html-components dash-core-components

Other:

pip install -r requirements.txt
pip install pandas

Running:

activate [name-of-env] # switch to conda env
python app.py   # Full main app

Installing OCR deps, without Docker

Tips: use Python 3, make sure if you have 64-bit python, install 64-bit dependency versions. Similarly, 32-bit dep versions for 32-bit python.

source venv/bin/activate # Remember to activate your virtualenv
pip install -r requirements.txt
Install imagemagick and libimagemagickdev (differs by platform). Available via apt on Ubuntu
Install tesseract (see above.)

Installing via Docker

PARSING WITH DOCKER

Install Docker (and Virtualbox if applicable). May need to install Docker Legacy and Docker Toolbox for older machines.
Run ./setup_pabst to set up the docker container. Warning, this will take a while.
Run ./pabst <FILENAME> to run ocr parsing on a file.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
data		data
parsing		parsing
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
multi.py		multi.py
pabst		pabst
requirements.txt		requirements.txt
setup_pabst		setup_pabst

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pabst

Notes:

Dev

Installing viz-only, without Docker

Installing OCR deps, without Docker

Installing via Docker

PARSING WITH DOCKER

About

Releases

Packages

Contributors 4

Languages

hackNY-labs-2018/bank-parser-dash

Folders and files

Latest commit

History

Repository files navigation

Pabst

Notes:

Dev

Installing viz-only, without Docker

Installing OCR deps, without Docker

Installing via Docker

PARSING WITH DOCKER

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages