This repo was created to accompany the article on Catch Your Data Quality Issues Early in The PR Review with PipeRider.
PipeRider allows you to understand how your existing data would change with the new incoming data and code changes in a comparison report. This comparison report can then be attached in the PR comment as part of the CI process.
PipeRider main use-case is as part of a dbt project. The value of PipeRider in your dbt project should be most evident when you integrate it in the CI pipeline. The process of profiling and testing your data models, detecting changes or differences in data models between different environments and posting the data profile comparison report in the PR comment can all be automated as part of your dbt project's CI process.
With its comparison report, data and analytics engineers are able to:
- Merge confidently by catching the new data or code changes that lead to data quality issues before they enter production environment.
- Validate every source code change impact on the data to ensure the changes applied do not break any downstream data applications such as BI dashboard.
This certainly improve your productivity as a dbt developer.








