Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0 Embeddings #3

Closed
brunosan opened this issue Oct 2, 2023 · 4 comments
Closed

v0 Embeddings #3

brunosan opened this issue Oct 2, 2023 · 4 comments
Milestone

Comments

@brunosan
Copy link
Member

brunosan commented Oct 2, 2023

Aiming for a set of geographically anchored files that fully cover Earth land and coast, where each one contains a vector with a mathematical representation of the semantics on that location, subject to be fine tuned with higher resolution, or finer set of semantics.

For v0 we can restrict the embeddings to a certain RoI, a certain time window, a certain vector length.

@brunosan
Copy link
Member Author

Updated to re-scope.

@yellowcap
Copy link
Member

yellowcap commented Nov 13, 2023

We will focus on three type of embedding initially

  • Instrument (bands). Model should not expect all bands.
  • Time with time of day, year, month.
  • Location with lat/lon plus extent or resolution. A promising option is Spherical harmonics.

Refs #22 and #12

@yellowcap yellowcap reopened this Nov 13, 2023
@weiji14 weiji14 added this to the v0 Release milestone Nov 14, 2023
@brunosan
Copy link
Member Author

Per our call, DevSeed team will create and "embeddings factory" script to run ideally as CI on release versions.

The locations and times to generate will be defined {by Ode's team}, but for v0 ideally not too large.

It also seems that creating embeddings for (some of) the training dataset is a small incremental effort since that data is already prepared.

@brunosan
Copy link
Member Author

v0 documentation #118 includes how to create embeddings for any location and time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants