Skip to content

runawayhorse001/PySparkAudit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PySparkAudit: PySpark Data Audit

This is our PySpark Data Audit library repositories. The API can be found at: https://runawayhorse001.github.io/PySparkAudit. If you want to install it, you can

Install from PyPI

You can install the PySparkAudit from PyPI:

pip install PySparkAudit

Install from Repo

Or you can istall the PySparkAudit from the repo:

1. clone

git clone https://github.com/runawayhorse001/PySparkAudit.git

2. install

cd PySparkAudit
pip install -r requirements.txt 
python setup.py install

3. test

cd PySparkAudit/test
spark-submit test.py

4. uninstall

pip uninstall PySparkAudit