Skip to content

Data release: release-20260205#1763

Open
ajaits wants to merge 2 commits intodatacommonsorg:masterfrom
ajaits:release-20260205
Open

Data release: release-20260205#1763
ajaits wants to merge 2 commits intodatacommonsorg:masterfrom
ajaits:release-20260205

Conversation

@ajaits
Copy link
Contributor

@ajaits ajaits commented Feb 5, 2026

Mixer data release: release-20260205

@ajaits ajaits requested a review from hareesh-ms February 5, 2026 12:47
@gemini-code-assist
Copy link
Contributor

Summary of Changes

Hello @ajaits, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request delivers a comprehensive update to the system's data foundation, ensuring that users have access to the most current and expanded statistical information. It integrates new census and projection data from several countries, introduces a wealth of new demographic and economic indicators, and refines existing data points and structural metadata for improved accuracy and utility.

Highlights

  • Data Updates and Version Bumps: Bigtable and BigQuery dataset versions have been updated to 2026-02-05 and 2026-01-31 respectively. Population and unemployment statistics for the USA, California, and various US states have been refreshed with data extending into 2025 and 2026. Several health-related percentages for California now include historical data from 2008-2017.
  • New Statistical Variables: Added numerous statistical variables related to demographics (e.g., COVID-19 infection counts, place of birth, languages spoken at home, foreign background, average age of females, population by age and poverty status) and economic activities (exports, imports, trade balance, two-way trade with USA categorized by NAICS codes). New COVID-19 related medical condition and mortality event counts have also been introduced.
  • Data Source Additions: New data sources including Bulgaria_Census, NITIIndiaPopulationProjection, and Poland_Census have been integrated, expanding the geographical and thematic coverage of available data.
  • Structural and Typo Corrections: Corrected a typo in a statistical variable name from 'setMaxillofacial' to 'Maxillofacial'. Several geoId entries in USA_county.json and limit.json have been streamlined by removing redundant 'name' fields or entire entries. The place_types field has been added to various geoId entries in CA_County.json and max_temprature.json for better categorization.
Changelog
  • deploy/storage/base_bigtable_info.yaml
    • Updated Bigtable table versions to reflect data up to 2026-02-05.
  • deploy/storage/bigquery.version
    • Updated the BigQuery dataset version to dc_kg_2026_01_31_00_09_37.
  • internal/server/place/golden/get_related_locations/county.json
    • Adjusted UnemploymentRate_Person ranks from top and bottom.
  • internal/server/stat/golden/get_stat_all/branch.json
    • Corrected unemployment rate values for 2021-11, 2022-02, 2023-09, 2024-05, 2024-12.
    • Added new unemployment rate entries for 2025-11 and 2025-12.
  • internal/server/stat/golden/get_stat_all/result.json
    • Updated US population data for 2024 and added 2025 data.
    • Updated California population data for 2022, 2023, 2024 and added 2025 data.
  • internal/server/stat/golden/get_stats/census_pep.json
    • Updated US population data for 2024 and added 2025 data.
    • Updated California population data for 2022, 2023, 2024 and added 2025 data.
  • internal/server/statvar/golden/search_statvar/count_person.json
    • Corrected a typo in 'oral andsetMaxillofacial Studies' to 'oral and Maxillofacial Studies'.
    • Added new statistical variables related to COVID-19, place of birth (Finland, area of residence), languages spoken at home (Finnish, Sami, Swedish), and foreign background.
    • Removed several statistical variables related to 'Male Population' and 'Monoracial Population' in group quarters.
  • internal/server/statvar/golden/search_statvar/fem.json
    • Added 'Average age of females'.
    • Removed 'Disease burden (number of disability-adjusted life years (DALYs) [Trichomoniasis, Female]'.
  • internal/server/statvar/golden/search_statvar/poor.json
    • Added 'Population: 5 - 17 Years, Poor'.
  • internal/server/statvar/golden/search_statvar/women.json
    • Added new statistical variables related to pregnant women and COVID-19.
  • internal/server/v0/placestatvar/golden/get_place_stat_vars/alb.json
    • Added numerous statistical variables related to economic activity (exports, imports, trade balance, two-way trade) with the USA, categorized by NAICS codes.
    • Added several COVID-19 related medical condition and mortality event count statistical variables.
  • internal/server/v0/statpoint/golden/get_stat_value/count_person.json
    • Updated the value for Count_Person to 341784857.
  • internal/server/v0/statpoint/golden/get_stat_value/umemployed.json
    • Updated the value for Count_Person_Unemployed to 7503000.
  • internal/server/v0/triple/golden/get_triples/limit.json
    • Removed entries for 'Chugach Census Area', 'Copper River Census Area', 'Capitol Planning Region', and 'Greater Bridgeport Planning Region'.
  • internal/server/v0/triple/golden/get_triples/limit1.json
    • Updated importTime object value.
    • Swapped subjectId and subjectName for Acre1000Onwards and AbsoluteVorticity_Place_0.01Millibar, and their subjectTypes.
  • internal/server/v0/triple/golden/get_triples/place_type.json
    • Added a new triple defining tradePartner as a Property with Country as its rangeIncludes.
  • internal/server/v1/info/golden/bulk_variable_group_info/sqlite.json
    • Updated descendentStatVarCount for Economy (21905 to 21906), Health (6907 to 6973), Crime (3823 to 3775), and overall (43772 to 43773).
  • internal/server/v1/info/golden/bulk_variable_info/bulk_bt_and_sql.json
    • Updated placeCount for AdministrativeArea1 (401 to 416).
    • Added new data source dc/base/Bulgaria_Census with series summary details.
    • Added new data source dc/base/NITIIndiaPopulationProjection with series summary details.
    • Added new data source dc/base/Poland_Census with series summary details.
    • Decreased observationCount for dc/base/US_Census_ACS_5Year_Population (181550 to 172489) and overall (184687 to 175419).
    • Adjusted minValue for Eurostat data for AdministrativeArea1, EurostatNUTS2, and EurostatNUTS3.
    • Updated latestDate for Eurostat data from 2023 to 2024.
    • Increased observationCount for Eurostat data (19319 to 19991).
  • internal/server/v1/info/golden/bulk_variable_info/bulk_result.json
    • Identical changes as bulk_bt_and_sql.json.
  • internal/server/v1/info/golden/variable_group_info/demographics.json
    • Added Mean_Age_Person statistical variable.
    • Added Count_Person_5To17Years_Poor statistical variable.
    • Updated descendentStatVarCount for Person_Age (31671 to 31763), Person_AgeGroupClassification (8 to 11), Person_CitizenshipStatus (1606 to 1607), Person_Gender (28189 to 28274), ImmigrationAndCitizenship (4016 to 4022), Language (514 to 521), Person_PlaceOfBirth (1958 to 1963), Person_PlaceOfResidenceClassification (4330 to 4422), Residence (6070 to 6162), and overall (52108 to 52255).
    • Added Person_OriginBackgroundType and Person_ReligiousOrientation variable groups.
  • internal/server/v1/info/golden/variable_group_info/demographics_gbr.json
    • Added Mean_Age_Person statistical variable.
    • Added Count_Person_5To17Years_Poor statistical variable.
    • Updated descendentStatVarCount for Person_AgeGroupClassification (added 3).
    • Added Person_OriginBackgroundType and Person_ReligiousOrientation variable groups.
  • internal/server/v1/info/golden/variable_group_info/root.json
    • Updated descendentStatVarCount for Demographics (52108 to 52255), Economy (127124 to 127414), Education (55450 to 62487), Energy (10151 to 10153), Environment (3672 to 4358), Health (55295 to 55415), Housing (4910 to 5002), Uncategorized (9737 to 9757), and overall (256694 to 262829).
  • internal/server/v1/info/golden/variable_group_info/root_mtv.json
    • Updated descendentStatVarCount for Education (2370 to 3307).
  • internal/server/v1/info/golden/variable_group_info/root_mtv_jpn.json
    • Updated descendentStatVarCount for Education (2370 to 3307).
  • internal/server/v1/info/golden/variable_group_info/weather.json
    • Updated descendentStatVarCount for ObservationalWeather (56 to 534) and overall (1608 to 2086).
  • internal/server/v1/observationdates/golden/observation_dates_linked/USA_State.json
    • Added 2025 data for Count_Person for all states.
    • Modified entity counts for Count_Person_Unemployed and Count_Person_Employed for 2020-2023.
  • internal/server/v1/observations/golden/bulk_point/all_latest.json
    • Updated latest Count_Person for USA to 2025.
    • Updated latest Count_Person for geoId/06 (California) to 2025.
    • Updated latest Count_Person_Unemployed for USA to 2025-12.
    • Updated latest Count_Person_Unemployed for geoId/06 (California) to 2025-12.
    • Updated latest Count_Person_Employed for geoId/06 (California) to 2025-12.
    • Updated latest Count_Person_Unemployed for geoId/0649670 (Los Angeles County) to 2025-11.
  • internal/server/v1/observations/golden/bulk_point/preferred_latest.json
    • Updated latest Count_Person for USA to 2025.
    • Updated latest Count_Person for geoId/06 (California) to 2025.
    • Updated latest Count_Person_Unemployed for USA to 2025-12.
    • Updated latest Count_Person_Unemployed for geoId/06 (California) to 2025-12.
    • Updated latest Count_Person_Unemployed for geoId/0649670 (Los Angeles County) to 2025-11.
  • internal/server/v1/observations/golden/bulk_point_linked/all_Country.json
    • Added 2024 data for Count_Person for Bulgaria (facet 1024534950).
    • Added 2024 data for Count_Person for Finland (facet 2667716919).
    • Added 2026 data for Count_Person for India (facet 2488031303).
    • Added 2024 data for Count_Person for Poland (facet 248325961).
    • Updated 2025 data for Count_Person for USA.
    • Added new facets for Bulgaria_Census, Poland_Census, NITIIndiaPopulationProjection, Finland_Census.
  • internal/server/v1/observations/golden/bulk_point_linked/all_US_State.json
    • Updated latest Count_Person for all US states to 2025.
    • Updated latest Count_Person_Unemployed and Count_Person_Employed for all US states to 2025-12.
    • Updated latest UnemploymentRate_Person for all US states to 2025-12.
  • internal/server/v1/observations/golden/bulk_point_linked/preferred_Country.json
    • Identical changes as all_Country.json for preferred facets.
  • internal/server/v1/observations/golden/bulk_point_linked/preferred_US_State.json
    • Identical changes as all_US_State.json for preferred facets.
  • internal/server/v1/observations/golden/bulk_series/all_result.json
    • Updated Count_Person series for USA to include 2025 data.
    • Updated Count_Person series for California to include 2025 data.
    • Updated Count_Person_Unemployed series for USA to include 2025-11 and 2025-12 data, and adjusted values for 2021-2025.
    • Updated Count_Person_Unemployed series for California to include 2025-11 and 2025-12 data, and adjusted values for 2021-2025.
    • Updated Count_Person_Unemployed series for Los Angeles County to include 2025-09 and 2025-11 data, and adjusted values for 2025-08.
  • internal/server/v1/observations/golden/bulk_series/preferred_result.json
    • Identical changes as all_result.json for preferred facets.
  • internal/server/v1/observations/golden/derived_series/case1.json
    • Updated value for 2021 in a derived series.
  • internal/server/v1/page/golden/place_page/asm.Crime.json
    • Updated latest population for USA to 2025.
  • internal/server/v1/page/golden/place_page/asm.Demographics.json
    • Updated latest population for USA to 2025.
  • internal/server/v1/page/golden/place_page/asm.Economics.json
    • Updated latest population for USA to 2025.
  • internal/server/v1/page/golden/place_page/asm.Education.json
    • Updated latest population for USA to 2025.
  • internal/server/v1/page/golden/place_page/asm.Energy.json
    • Updated latest population for USA to 2025.
  • internal/server/v1/page/golden/place_page/asm.Environment.json
    • Updated latest population for USA to 2025.
  • internal/server/v1/page/golden/place_page/asm.Equity.json
    • Updated latest population for USA to 2025.
  • internal/server/v1/page/golden/place_page/asm.Housing.json
    • Updated latest population for USA to 2025.
  • internal/server/v1/page/golden/place_page/ca.Crime.json
    • Updated population series for California to include 2025 data and adjusted 2022-2024 values.
    • Updated latest population for USA to 2025.
    • Updated latest population for geoId/04 (Arizona) to 2025.
    • Updated latest population for geoId/06 (California) to 2025.
    • Updated latest population for geoId/08 (Colorado) to 2025.
    • Updated latest population for geoId/18 (Indiana) to 2025.
    • Updated latest population for geoId/22 (Louisiana) to 2025.
    • Updated latest population for geoId/29 (Missouri) to 2025.
    • Updated latest population for geoId/38 (North Dakota) to 2025.
    • Updated latest population for geoId/47 (Washington) to 2025.
    • Updated latest population for geoId/48 (Texas) to 2025.
    • Updated latest population for geoId/53 (Washington) to 2025.
    • Updated latest population for geoId/72 (Puerto Rico) to 2025.
  • internal/server/v1/page/golden/place_page/ca.Demographics.json
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/ca.Economics.json
    • Updated Count_Person_Unemployed series for California to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Updated Count_Person_Employed series for California to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Updated UnemploymentRate_Person series for California to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Updated Count_Person_Unemployed series for Los Angeles County to include 2025-09, 2025-11, and 2025-12 data, and adjusted 2025-08 value.
    • Updated Count_Person_Employed series for Los Angeles County to include 2025-09, 2025-11, and 2025-12 data, and adjusted 2025-08 value.
    • Updated UnemploymentRate_Person series for Los Angeles County to include 2025-09, 2025-11, and 2025-12 data, and adjusted 2025-08 value.
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/ca.Education.json
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/ca.Energy.json
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/ca.Environment.json
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/ca.Equity.json
    • Added historical data (2008-2017) for Percent_Person_18To64Years_Female_NoHealthInsurance, Percent_Person_18To64Years_Male_NoHealthInsurance, Percent_Person_18To64Years_NoHealthInsurance_BlackOrAfricanAmericanAlone, Percent_Person_18To64Years_NoHealthInsurance_HispanicOrLatino, Percent_Person_18To64Years_NoHealthInsurance_WhiteAlone.
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/ca.Health.json
    • Updated Percent_Person_BingeDrinking values for 2015, 2018, 2020, 2021, 2022.
    • Updated Percent_Person_Obesity values for 2018, 2020, 2021, 2022.
    • Updated Percent_Person_PhysicalInactivity values for 2018, 2020, 2021.
    • Updated Percent_Person_Smoking values for 2014, 2017, 2018, 2020, 2021.
    • Updated Percent_Person_WithArthritis values for 2016, 2018, 2020, 2021.
    • Updated Percent_Person_WithHighBloodPressure values for 2015, 2017, 2019, 2021.
    • Updated Percent_Person_WithHighCholesterol values for 2015, 2017, 2019.
    • Updated Percent_Person_WithMentalHealthNotGood values for 2014, 2015, 2018, 2020, 2021, 2022.
    • Updated Percent_Person_WithPhysicalHealthNotGood values for 2015, 2016, 2018, 2020, 2021.
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/ca.Housing.json
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/ca.Overview.json
    • Updated Count_Person_Unemployed series for California to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Updated Count_Person_Employed series for California to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Updated UnemploymentRate_Person series for California to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Added historical data (2008-2017) for Percent_Person_18To64Years_Female_NoHealthInsurance and Percent_Person_18To64Years_Male_NoHealthInsurance.
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/page/golden/place_page/county.Overview.json
    • Updated Count_Person_Unemployed series for a county to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Updated Count_Person_Employed series for a county to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Updated UnemploymentRate_Person series for a county to include 2025-09 and 2025-11 data, and adjusted 2025-08 value.
    • Added historical data (2008-2017) for Percent_Person_18To64Years_Female_NoHealthInsurance and Percent_Person_18To64Years_Male_NoHealthInsurance.
    • Updated UnemploymentRate_Person series for Los Angeles County to include 2025-09, 2025-11, and 2025-12 data, and adjusted 2025-08 value.
    • Identical population updates as ca.Crime.json.
  • internal/server/v1/propertyvalues/golden/property_values_out/bio.json
    • Corrected capitalization of 'dengue shock syndrome' to 'Dengue shock syndrome'.
  • internal/server/v1/triples/golden/bulk_triples_in/result1.json
    • Added tradePartner property.
  • internal/server/v1/variables/golden/variables/alb.json
    • Identical additions of economic activity and COVID-19 related statistical variables as internal/server/v0/placestatvar/golden/get_place_stat_vars/alb.json.
  • internal/server/v2/facet/golden/contained_in_facet/IND_AA1.json
    • Added facet for NITIIndiaPopulationProjection.
  • internal/server/v2/facet/golden/contained_in_facet/country.json
    • Added facets for Finland_Census, Poland_Census, NITIIndiaPopulationProjection, Bulgaria_Census.
  • internal/server/v2/facet/golden/series_facet/series_facet.json
    • Updated obs_count and latest_date for Count_Person facets.
  • internal/server/v2/observation/golden/contained_in_latest/CA_County.json
    • Added place_types for AdministrativeArea2 and County to all California county entries.
  • internal/server/v2/observation/golden/contained_in_latest/Country.json
    • Added new facets for Bulgaria_Census, NITIIndiaPopulationProjection, Poland_Census, Finland_Census with their latest observation data.
    • Updated latest observation for Count_Person for USA to 2025.
  • internal/server/v2/observation/golden/contained_in_latest/US_State.json
    • Updated latest observation data for Count_Person, Count_Person_Unemployed, Count_Person_Employed, and UnemploymentRate_Person for all US states to 2025, 2025-12, or 2025-11.
  • internal/server/v2/observation/golden/contained_in_latest/max_temprature.json
    • Added place_types for City, Neighborhood, and Town to various geoId entries.
  • internal/server/v2/observation/golden/derived_series/case1.json
    • Updated value for 2021 in a derived series.
  • internal/server/v2/observation/golden/direct/all.json
    • Updated Count_Person series for USA to include 2025 data, and updated obs_count and latest_date.
    • Updated Count_Person series for California to include 2025 data, and updated obs_count and latest_date.
    • Updated Count_Person_Unemployed series for USA to include 2025-11 and 2025-12 data, and adjusted 2021-2025 values, updated obs_count and latest_date.
    • Updated Count_Person_Unemployed series for California to include 2025-11 and 2025-12 data, and adjusted 2025-08 value, updated obs_count and latest_date.
    • Updated Count_Person_Unemployed series for Los Angeles County to include 2025-09 and 2025-11 data, and adjusted 2025-08 value, updated obs_count and latest_date.
  • internal/server/v2/observation/golden/direct/latest.json
    • Updated latest Count_Person for USA to 2025, and updated earliest_date and latest_date.
    • Updated latest Count_Person for California to 2025, and updated earliest_date and latest_date.
    • Updated latest Count_Person_Unemployed for USA to 2025-12, and updated earliest_date and latest_date.
    • Updated latest Count_Person_Unemployed for California to 2025-12, and updated earliest_date and latest_date.
    • Updated latest Count_Person_Unemployed for Los Angeles County to 2025-11, and updated earliest_date and latest_date.
  • internal/server/v2/propertyvalues/golden/containedin/USA_county.json
    • Removed name field from several geoId entries (Chugach Census Area, Copper River Census Area, Capitol Planning Region, Greater Bridgeport Planning Region, Lower Connecticut River Valley Planning Region, Naugatuck Valley Planning Region, Northeastern Connecticut Planning Region, Northwest Hills Planning Region, South Central Connecticut Planning Region, Southeastern Connecticut Planning Region, Western Connecticut Planning Region).
  • internal/server/v2/propertyvalues/golden/simple/locationEnum.json
    • Added AreaOfBirth to LocationClassificationEnum.
  • internal/server/v2/propertyvalues/golden/triple_in/result1.json
    • Added tradePartner property.
Activity
  • The pull request was opened by ajaits with the title 'Data release: release-20260205' and description 'Mixer data release: release-20260205'.
  • This indicates a routine data update and release process.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request appears to be a routine data release, updating various data and configuration files with new timestamps and values for release-20260205. Most changes are updates to golden test files, reflecting the new data state. While the updates are largely as expected for a data release, I've identified a significant data quality issue in one of the golden files. Several newly added statistical variables have contradictory names, such as 'Count of pregnant women, male...'. This should be addressed to maintain data integrity and avoid user confusion.

Copy link
Contributor

@beets beets left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mostly non-blocking, but please take a look at the comments. especially the last one with missing place names

"dcid": "dc/e351bke5y8c75"
},
{
"name": "Count of pregnant women, male, condition hepatitis B",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this stat var doesn't make sense to me. why is gender=male?

looking at the cl that added this, not all ended up getting fixed.
https://critique.corp.google.com/cl/850327190

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The source does have this variable though data is all 0. source generates all combinations of gender with each health condition.

"topPlaces": [
{
"dcid": "wikidataId/Q1585725",
"name": "Sofia Capital"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure of the source, but the name seems off. wikidata shows this as Sofia City
https://www.wikidata.org/wiki/Q1585725

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the data release didn't update the name, it has been like this before likely from an earlier dump from wiikipedia. this is likely showing up now because an new bulgaria stats import added data to this place.

opened b/482272045 to track this.

"dcid": "geoId/02060"
},
{
"name": "Chugach Census Area",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where did the names of these places go?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@n-h-diaz These places have been added as ProvisionalNodes. Did we loose any place import since provisional nodes didn't add names but the node has other properties like containedIn, landArea https://screenshot.googleplex.com/8G4zphp3wRJzoBs

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm yes it looks like this is due to some merging issues since the provisional nodes are defined in schema import group: specifically we are trimming schema -> schema triples (so things like typeOf: County will get dropped from place) and schema -> leaf triples (so things like name will get dropped from place). this is a bug with prophet.

short term we could either:

  • remove the provisional definitions and rebuild place (we could also just delete the whole file if it's easier for now)
  • move the provisional place nodes into place import group instead of schema and then rebuild place

long term:

  • fix the merging issue in prophet to ensure only schema group triples get dropped (if this is possible)
  • clean up the provisional nodes, so that we don't keep around provisional definitions if there's a proper definition elsewhere

Copy link
Contributor

@beets beets left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for the replies ajai. will approve to unblock the release

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants