@daniel-acuna put database on Amazon S3 and can be downloaded to `dedupe_output` folder as follows: ``` aws s3 cp s3://grant-dataset/dedupe/ dedupe_output/ --recursive ``` We want to create a script that merge these dataset all together.