Skip to content

Latest commit

 

History

History
27 lines (26 loc) · 830 Bytes

README.md

File metadata and controls

27 lines (26 loc) · 830 Bytes
  1. Python dependencies: pip3 install duckdb pandas tabulate

  2. Download dataset from s3: s3://alex-datasets/dmv/dmv_fuel_type_passengers.csv

  3. Copy the downloaded dataset into the root directory of the repo.

  4. Create a copy of the template (group_template), please prefix your group folder with group_:

    cp -rf group_template group_lightning_speed
  5. Run your solution:

    python3 group_lightning_speed/aggregation.py
  6. Run all solutions:

    aggregation

    python3 gather_results.py aggregation `ls -d group_*`

    join

    python3 gather_results.py join `ls -d group_*`
  7. Checkin:

    git add group_lightning_speed
    git commit -m "updated group_lightning_speed"
    git fetch && git rebase origin/main && git push origin main