GitHub - henrysky/astroNN_stars_foundation: Code for Leung & Bovy 2023

Abstract

Rapid strides are currently being made in the field of 🤖artificial intelligence🧠 using Transformer-based models like Large Language Models (LLMs). The potential of these methods for creating a single, large, versatile model in astronomy has not yet been explored. In this work, we propose a framework for data-driven astronomy that uses the same core techniques and architecture as used by LLMs. Using a variety of observations and labels of stars as an example, we build a Transformer-based model and train it in a self-supervised manner with cross-survey data sets to perform a variety of inference tasks. In particular, we demonstrate that a single model can perform both discriminative and generative tasks even if the model was not trained or fine-tuned to do any specific task. For example, on the discriminative task of deriving stellar parameters from Gaia XP spectra, we achieve an accuracy of 47 K in T_eff, 0.11 dex in log(g), and 0.07 dex in [M/H], outperforming an expert XGBoost model in the same setting. But the same model can also generate XP spectra from stellar parameters, inpaint unobserved spectral regions, extract empirical stellar loci, and even determine the interstellar extinction curve. Our framework demonstrates that building and training a single foundation model without fine-tuning using data and parameters from multiple surveys to predict unmeasured observations and parameters is well within reach. Such 'Large Astronomy Models' trained on large quantities of observational data will play a large role in the analysis of current and future large surveys.

Table of Contents

Abstract
Getting Started
Examples of Basic Usage
Authors
- License

Getting Started

This repository is to make sure all figures and results are reproducible by anyone easily for this paper🤗.

If Github has issue (or too slow) to load the Jupyter Notebooks, you can go http://nbviewer.jupyter.org/github/henrysky/astroNN_stars_foundation/tree/main/

Dependencies

This project uses astroNN and MyGaiaDB to manage APOGEE and Gaia data respectively, PyTorch>=2.3 as the deep learning framework. mwdust and extinction are used to calculate extinctions. gaiadr3_zeropoint and GaiaXPy>=2.1 are used for Gaia data reduction. XGBoost>=2.0.1 as a baseline machine learning method for comparison.

Python dependencies are also listed in requirements.txt.

⚠️ You have to set magicnumber = nan in astroNN configuration file for the data reduction code to work properly.

⚠️ Using mps backend of PyTorch<=2.4.0 on Apple devices is known to yield incorrect results. Please upgrade to PyTorch>=2.4.1 or use cpu as the backend.

Some notebooks require Zhang et al. 2023 trained model to run as a comparison to our model. You can download them from here. You need to extract the model stellar_flux_model.tar.gz to the root directory of this repository and rename the folder to zhanggreenrix2023_stellar_flux_model. Their model requires TensorFlow to run.

Some notebooks require Andrae et al. 2023 Gaia DR3 "vetted" RGB catalog named table_2_catwise.fits.gz. You can download them from here. You need to put the file(s) to a folder named andae2023_catalog at the root directory of this repository.

Datasets

You can compile the dataset by running the Dataset_Reduction.ipynb notebook.

But you can skip the compilation step because the datasets are available on Zenodo and should be placed in the folder named data_files under the root directory of this repository.

If you are planning to use the Docker image, the data files are already downloaded and placed in the correct folder in the container.

Docker Image

If you have Docker installed, you can use the Dockerfile to build a Docker image upon Pytorch container from NVIDIA NGC Catalog with all dependencies installed and data files downloaded.

To build the Docker image called stars_foundation, run the following command in the root directory of this repository:

docker build -t stars_foundation .

To run the Docker container with all GPU available to the container named testing123, run the following command:

docker run --gpus all --name testing123 -it -e SHELL=/bin/bash --entrypoint bash stars_foundation

Then you can attach to the container by running:

docker exec -it testing123 bash

Now you can run all notebooks or training script inside the container

Jupyter Notebooks

Dataset_Reduction.ipynb

The notebook contains code to generate the dataset used by this paper.

Terabytes of (mostly gaia) data need to be downloaded in the process to construct the datasets.

An alternative is to download the datasets from Zenodo.
Inference_Spec2Labels.ipynb

The notebook contains code to do inference on tasks of stellar spectra to stellar parameters.
Inference_Labels2Spec.ipynb

The notebook contains code to do inference on tasks of stellar parameters to stellar spectra.
Inference_Spec2Spec.ipynb

The notebook contains code to do inference on tasks of stellar spectra to stellar spectra.
Inference_Labels2Labels.ipynb

The notebook contains code to do inference on tasks of stellar parameters to stellar parameters.
Inference_ExternalComparison.ipynb

The notebook contains code to do inference on tasks of stellar parameters to stellar parameters compared to external catalog.
Task_TopKSearch.ipynb

The notebook contains code for an example of how our model can act as a Foundation model.

Our trained model will be fine-tuned with contrastive objective to do a stars similarity searching task.

Python Script

If you use this training script to train your own model, please notice that details of your system will be saved automatically in the model folder as training_system_info.txt for developers to debug should anything went wrong. Delete the file before you share your model with others if you concern about privacy.

training.py

Python script to train the model.

Models

model_torch is a trained PyTorch model

The model has ~8.8 millions parameters trained on ~16 millions tokens from ~397k stars with 118 unque "unit vector" tokens.
model_torch_search is a trained PyTorch model

The model is fine-tuned on the main model to do a stars similarity searching task between spectra and parameters as a demonstration of how our model can act as a Foundation model.

Graphics

All these graphics can be opened and edited by draw.io.

model_overview.drawio

Source for Figure 1 in the paper,
model_specs.drawio

Source for Figure 2 in the paper.
model_foundation_showcase.drawio

Source for Figure C1 in the paper.

Examples of Basic Usage

Here are some examples of basic usage of the model using Python. For the codes to work, you need to execute them at the root directory of this repository.

Get a list of vocabulary understood by the Model

from stellarperceptron.model import StellarPerceptron

nn_model = StellarPerceptron.load("./model_torch/", device="cpu")
print(nn_model.vocabs)

Give context of a star and request for information

Although our model has a context window of 64 tokens, you do not need to fill up the whole context window.

from stellarperceptron.model import StellarPerceptron

nn_model = StellarPerceptron.load("./model_torch/", device="cpu")
# give context of two stars
# [[star1 teff, star1 logg], [star2 teff, star2 logg]]
nn_model.perceive([[4700., 2.5], [5500, 4.2]], ["teff", "logg"])
# request for information for them
print(nn_model.request(["teff"]))

Get an arbitrary Gaia XP spectrum with source_id online and request for information

import numpy as np
from utils.gaia_utils import xp_spec_online
from stellarperceptron.model import StellarPerceptron

# Gaia DR3 source_id as integer
gdr3_source_id = 2130706307446806144

bprp_coeffs = xp_spec_online(gdr3_source_id, absolute_flux=False)
nn_model = StellarPerceptron.load("./model_torch/", device="cpu")
# Give the context of a star by giving XP coefficients to the NN model
nn_model.perceive(np.concatenate([bprp_coeffs["bp"][:32], bprp_coeffs["rp"][:32]]), [*[f"bp{i}" for i in range(32)], *[f"rp{i}" for i in range(32)]])
# Request for information like teff, logg, m_h
print(nn_model.request(["teff", "logg", "m_h"]))

Plot XP spectrum from stellar parameters

import pylab as plt
from stellarperceptron.model import StellarPerceptron
from utils.gaia_utils import nn_xp_coeffs_phys, xp_sampling_grid

nn_model = StellarPerceptron.load("./model_torch/", device="cpu")
# to generate a spectrum from stellar parameters
# absolute_flux boolean flag if you want to get spectra in flux at 10 parsec or flux normalized by overall G-band flux
# other keywords are not mandatory, but you can specify them if you want to as long as they are in the vocabs
spectrum = nn_xp_coeffs_phys(nn_model, absolute_flux=True, teff=4700., logg=2.5, m_h=0.0, logebv=-7)

plt.plot(xp_sampling_grid, spectrum)
plt.xlabel("Wavelength (nm)")
plt.ylabel("Flux at 10 pc ($ \\mathrm{W} \\mathrm{nm}^{-1} \\mathrm{m}^{-2}$)")
plt.xlim(392, 992)
plt.show()

Authors

Henry Leung - henrysky

Department of Astronomy and Astrophysics, University of Toronto

Contact Henry: henrysky.leung [at] utoronto.ca
Jo Bovy - jobovy

Department of Astronomy and Astrophysics, University of Toronto

Contact Jo: bovy [at] astro.utoronto.ca

License

This project is licensed under the MIT License - see the LICENSE file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Abstract

Getting Started

Dependencies

Datasets

Docker Image

Jupyter Notebooks

Python Script

Models

Graphics

Examples of Basic Usage

Get a list of vocabulary understood by the Model

Give context of a star and request for information

Get an arbitrary Gaia XP spectrum with source_id online and request for information

Plot XP spectrum from stellar parameters

Authors

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
model_torch		model_torch
model_torch_search		model_torch_search
stellarperceptron		stellarperceptron
utils		utils
.gitignore		.gitignore
Dataset_Reduction.ipynb		Dataset_Reduction.ipynb
Dockerfile		Dockerfile
Inference_ExternalComparison.ipynb		Inference_ExternalComparison.ipynb
Inference_Labels2Labels.ipynb		Inference_Labels2Labels.ipynb
Inference_Labels2Spec.ipynb		Inference_Labels2Spec.ipynb
Inference_Spec2Labels.ipynb		Inference_Spec2Labels.ipynb
Inference_Spec2Spec.ipynb		Inference_Spec2Spec.ipynb
LICENSE		LICENSE
README.rst		README.rst
Task_TopKSearch.ipynb		Task_TopKSearch.ipynb
model_foundation_showcase.drawio		model_foundation_showcase.drawio
model_overview.drawio		model_overview.drawio
model_overview.png		model_overview.png
model_specs.drawio		model_specs.drawio
requirements.txt		requirements.txt
training.py		training.py

License

henrysky/astroNN_stars_foundation

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages