Skip to content

utndatasystems/cloud-db-joins-and-aggregations

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

  1. Python dependencies: pip3 install duckdb pandas tabulate

  2. Download dataset from s3: s3://alex-datasets/dmv/dmv_fuel_type_passengers.csv

  3. Copy the downloaded dataset into the root directory of the repo.

  4. Create a copy of the template (group_template), please prefix your group folder with group_:

    cp -rf group_template group_lightning_speed
  5. Run your solution:

    python3 group_lightning_speed/aggregation.py
  6. Run all solutions:

    aggregation

    python3 gather_results.py aggregation `ls -d group_*`

    join

    python3 gather_results.py join `ls -d group_*`
  7. Checkin:

    git add group_lightning_speed
    git commit -m "updated group_lightning_speed"
    git fetch && git rebase origin/main && git push origin main

About

Test harness for join and aggregation exercises.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published