Skip to content

Set retention on client_level tables for glean_usage tables #6815

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 10 commits into from
Feb 5, 2025
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@ owners:
- dpalmeiro@mozilla.com
labels:
incremental: true
table_type: client_level
scheduling:
dag_name: bqetl_pageload_v1
task_name: firefox_desktop_pageload_1pct__v1
Expand All @@ -16,7 +17,7 @@ bigquery:
type: day
field: submission_timestamp
require_partition_filter: true
expiration_days: 780
expiration_days: 775
range_partitioning: null
clustering:
fields:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@ owners:
- dpalmeiro@mozilla.com
labels:
incremental: true
table_type: client_level
scheduling:
dag_name: bqetl_pageload_v1
task_name: firefox_desktop_pageload_nightly__v1
Expand All @@ -16,6 +17,6 @@ bigquery:
type: day
field: submission_timestamp
require_partition_filter: true
expiration_days: 780
expiration_days: 775
range_partitioning: null
references: {}
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,9 @@ description: |-
labels:
application: firefox
schedule: daily
table_type: client_level
owners:
- jklukas@mozilla.com
- ascholtz@mozilla.com
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should this table have expiration set?

scheduling:
dag_name: bqetl_main_summary
start_date: "2021-01-19"
Expand All @@ -19,6 +20,7 @@ bigquery:
type: day
field: submission_date
require_partition_filter: true
expiration_days: 775
clustering:
fields:
- sample_id
Original file line number Diff line number Diff line change
Expand Up @@ -6,12 +6,12 @@ description: |-
We join main ping and event data together in clients_last_seen_joined_v1,
which then becomes the source for the user-facing view.
owners:
- jklukas@mozilla.com
- ascholtz@mozilla.com
labels:
application: firefox
schedule: daily
dag: bqetl_main_summary
owner1: jklukas
owner1: ascholtz
scheduling:
dag_name: bqetl_main_summary
start_date: '2021-01-19'
Expand All @@ -22,7 +22,7 @@ bigquery:
type: day
field: submission_date
require_partition_filter: true
expiration_days: null
expiration_days: 775
clustering:
fields:
- sample_id
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,8 @@ labels:
application: firefox
schedule: daily
dag: bqetl_main_summary
owner1: ascholtzs
owner1: ascholtz
table_type: client_level
scheduling:
dag_name: bqetl_main_summary
start_date: '2021-01-19'
Expand All @@ -27,7 +28,7 @@ bigquery:
type: day
field: submission_date
require_partition_filter: false
expiration_days: null
expiration_days: 775
range_partitioning: null
clustering:
fields:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,7 @@ owners:
labels:
incremental: true
schedule: daily
table_type: client_level
scheduling:
dag_name: bqetl_glean_usage
task_group: {{ app_name }}
Expand All @@ -19,6 +20,7 @@ bigquery:
type: day
field: submission_date
require_partition_filter: true
expiration_days: 775
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just out of curiosity, what number should be used for expiration days? I have seen 780 as well:

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the recent data retention sync meeting, we discussed and decided to use 775. The above PR was before a number was finalized. Open to suggestions or discussions if needed.

Copy link
Collaborator

@scholtzan scholtzan Jan 15, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we change the 780 to 775 in a separate PR? To make it consistent?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have changed 780 to 775 for a couple of tables. This file is deprecated and I will add it to my to-be-deleted list
ql/moz-fx-data-shared-prod/firefox_ios_derived/active_users_aggregates_v1/metadata.yaml#L23

clustering:
fields:
- normalized_channel
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@ owners:
labels:
incremental: true
schedule: daily
table_type: client_level
scheduling:
dag_name: bqetl_glean_usage
task_group: {{ app_name }}
Expand All @@ -21,6 +22,7 @@ bigquery:
type: day
field: submission_date
require_partition_filter: true
expiration_days: 775
clustering:
fields:
- normalized_channel
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ owners:
labels:
incremental: true
schedule: daily
table_type: client_level
scheduling:
dag_name: bqetl_glean_usage
task_group: {{ app_name }}
Expand All @@ -25,6 +26,7 @@ bigquery:
type: day
field: submission_date
require_partition_filter: true
expiration_days: 775
clustering:
fields:
- normalized_channel
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,7 @@ owners:
labels:
incremental: true
schedule: daily
table_type: client_level
scheduling:
dag_name: bqetl_glean_usage
task_group: {{ app_name }}
Expand All @@ -26,6 +27,7 @@ bigquery:
type: day
field: submission_timestamp
require_partition_filter: true
expiration_days: 775
clustering:
fields:
- sample_id
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@ owners:
labels:
incremental: true
schedule: daily
table_type: client_level
scheduling:
dag_name: bqetl_glean_usage
task_group: {{ app_name }}
Expand All @@ -15,6 +16,7 @@ bigquery:
type: day
field: submission_date
require_partition_filter: true
expiration_days: 775
clustering:
fields:
- normalized_channel
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@ owners:
labels:
incremental: true
schedule: daily
table_type: client_level
scheduling:
dag_name: bqetl_glean_usage
task_group: {{ app_name }}
Expand All @@ -20,6 +21,7 @@ bigquery:
type: day
field: submission_date
require_partition_filter: true
expiration_days: 775
clustering:
fields:
- normalized_channel
Expand Down