Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Maintain per-project metadata JSONs #853

Closed
hannes-ucsc opened this issue Mar 27, 2019 · 2 comments
Closed

Maintain per-project metadata JSONs #853

hannes-ucsc opened this issue Mar 27, 2019 · 2 comments
Labels
code [subject] Production code enh [type] New feature or request epic [type] Issue consists of multiple smaller issues orange [process] Done by the Azul team

Comments

@hannes-ucsc
Copy link
Member

hannes-ucsc commented Mar 27, 2019

┆Issue is synchronized with this Jira Story
┆Project Name: azul
┆Issue Number: AZUL-527

@diekhans
Copy link

Some empirical testing of Hannes' observation that updating the metadata aper-bundle event will be disastrously slow verifies this conclusion. While a simple increment metadata.json when adding a bundle can be linear, the kind of update that will happen when metadata is migrated is O(N^2) for building metadata.json. Empirically, O(N^2) is multiple days or even weeks with a single core.

Bundle events gives us 'map', but we don't have 'reduce'. Higher-level events are needed to do the 'reduce' phase.

Updating project-level data incrementally also creates a consistent-view problem of the project data. You may have a project that is partially metadata migrated that a user is seeing.

Until we have an architectural solution I would suggest we have the download client build metadata.json.

@hannes-ucsc hannes-ucsc added orange [process] Done by the Azul team code [subject] Production code enh [type] New feature or request labels Apr 15, 2019
@theathorn theathorn added the epic [type] Issue consists of multiple smaller issues label Apr 19, 2019
@hannes-ucsc hannes-ucsc added 0temp and removed 0temp labels Aug 22, 2019
@theathorn
Copy link

Closing in favor of #2693.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
code [subject] Production code enh [type] New feature or request epic [type] Issue consists of multiple smaller issues orange [process] Done by the Azul team
Projects
None yet
Development

No branches or pull requests

3 participants