This is our PySpark Data Audit library repositories.
The API
can be found at: https://runawayhorse001.github.io/PySparkAudit.
If you want to install it, you can
You can install the PySparkAudit from PyPI:
pip install PySparkAudit
Or you can istall the PySparkAudit from the repo:
git clone https://github.com/runawayhorse001/PySparkAudit.git
cd PySparkAudit
pip install -r requirements.txt
python setup.py install
cd PySparkAudit/test
spark-submit test.py
pip uninstall PySparkAudit