Skip to content

Commit

Permalink
add countries_code seed file
Browse files Browse the repository at this point in the history
  • Loading branch information
lironesamoun committed Apr 17, 2023
1 parent 3d06eb7 commit ebf102e
Show file tree
Hide file tree
Showing 2 changed files with 35 additions and 1 deletion.
33 changes: 32 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,33 @@
# data-engineering-capstone-project

Cloud: GCP
Infrastructure as code (IaC): Terraform
Workflow orchestration: Prefect
Data Wareshouse: BigQuery
Data transformation: DBT
Batch processing: Spark

Problem description
[Problem is well described and it's clear what the problem the project solves]

Cloud
[The project is developed in the cloud and IaC tools are used]

Data ingestion: Batch / Workflow orchestration
[End-to-end pipeline: multiple steps in the DAG, uploading data to data lake]

Data warehouse
[Tables are partitioned and clustered in a way that makes sense for the upstream queries (with explanation)]

Transformations (dbt, spark, etc)
[Tranformations are defined with dbt, Spark or similar technologies]

Dashboard
[A dashboard with 2 tiles]

Reproducibility
[Instructions are clear, it's easy to run the code, and the code works]



to do
Expand All @@ -9,4 +37,7 @@ Create a new service account
Storage Object Admin
Compute Storage Admin

Create and download the json key file
Create and download the json key file


dbt run --select global_terrorism_lite --vars '{"is_test_run": "true"}'
3 changes: 3 additions & 0 deletions dbt/seeds/countries_code.csv
Git LFS file not shown

0 comments on commit ebf102e

Please sign in to comment.