Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

initial route variance identification work #974

Merged
merged 292 commits into from
Dec 20, 2023
Merged

initial route variance identification work #974

merged 292 commits into from
Dec 20, 2023

Conversation

csuyat-dot
Copy link
Contributor

@csuyat-dot csuyat-dot commented Dec 19, 2023

initial work related to #924.

Determined that fct_monthly_routes, dim_providers_gtfs_data and helpers.import_scheduled_trips may give us the best chance at observing route variances over time.

Started with joining fct_monthly_routes and dim_providers_gtfs_data. Filtered down to April 2023 to September 2023 for Sacramento Schedule. Observed that 1 gtfs shcedule dataset key was found.

Then looked into helpers.import_shcedule_trips. Also filtered down to the same timeframe and provider. but found 2 gtfs schedule dataset keys and no data was found for June 2023.

Tested joining the fct_month_routes/dim_providers_gfts_data & helpers.import_schedule_trips, but wondering if helpers.import_schedule_trips needs further investigation.

tiffanychu90 and others added 30 commits November 6, 2023 22:22
Also, rollback portfolio.py to remove rt site special handling
No deploy yet, getting Netlify error...
@csuyat-dot csuyat-dot merged commit 3ad0ce8 into main Dec 20, 2023
3 checks passed
@csuyat-dot csuyat-dot deleted the cs_gtfs branch December 20, 2023 17:24
@tiffanychu90
Copy link
Member

Notes here for continuing work on this issue:

  • don't use fct_monthly_routes for this, compile across multiple dates using helpers.import_scheduled_trips.
  • differences we're interested in (if we have 4 dates):
    • route_id: check nunique(gtfs_dataset_key-route_id combination) > 1.
    • if nunique == 4, then route_id is changing every month, if nunique == 2, then route_id changes once in this period
    • check nunique(gtfs_dataset_key-route_long_name-route_short_name) > 1
  • pick a couple examples to show where this is true for all operators

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants