Data Warehouse of Nigerian Movies

Run this project periodically

Many items are gotten as many-in-one rather than one-in-one, so naturally default_output_processor is to remove all nextline characters from each matching result then join them all with a semicolon.
title: Filter out movies that are actually episodes of a show and make null.
ratings: Take first non-null result.
num_ratings: Convert numbers to full & actual values.

10K → 10000

3.2M → 3200000
genre: Make null and filter out if it is either Music, Talk-Show, Documentary or Short.
release_date: Fill a random month if movie has none and a constant day of 1, so date can be parsed correctly.

1995 → September 1, 1995

March 2012 → March 1, 2012
duration: Convert running time written separately and in text to equivalent in minutes.

1h 30m → 90

2h → 120

Data is loaded to a PostgresDB with data types, constraints & rules for INSERT specified.