Introduction

The orbital prediction package explores the possibilities of improving traditional physics based models of orbital dynamics using machine learning techniques.

Dependencies

This package has been developed and tested on Python 3.8. We recommend using conda or Python's virtual environments for keeping dependencies separate from your system Python installation, as an example

conda create -n ssa python=3.8
conda activate ssa

or

python3 -m venv venv
source venv/bin/activate

The orbit_pred pipeline CLI can then be installed using the provided Makefile with

make install

from this directory.

Data Flow

Pipeline Components

All available commands can be seen by running

orbit_pred -h

and help for individual commands is accessed via

orbit_pred <COMMAND> -h

ETL Orbit Data from USSTRATCOM

We utilize orbit data from United States Strategic Command (USSTRATCOM) via the space-track.org website and API. In order to access this API, you must register for an account here. The data is served in the two-line element set format which is a fixed width text format that conatains the Keplerian orbital elements of an anthropogenic space object (ASO) at a point in time. We then parse the TLE data and calculate the position (r) and velocity (v) orbital state vectors.

ETL CLI

The ETL module provides a CLI with the following arguments

--st_user: The username for space-track.org
--st_password: The password for space-track.org
--norad_id_file: The path to a text file containing a single NORAD ID on each row to fetch orbit data for. If no file is passed then orbit data for all LEO ASOs will be fetched.
--last_n_days: The number of days into the past to fetch orbit data for each ASO, defaults to 30 days.
--only_latest: A boolean flag to only fetch the latest TLE for each ASO.
--output_path: The path to save the orbit data parquet file to.

Running ETL Script Examples

Running

orbit_pred etl --st_user <SPACE TRAC USERNAME> \
       --st_password <SPACE TRACK PASSOWRD> \
       --norad_id_file sample_data/test_norad_ids.txt \
       --last_n_days 10 \
       --output_path <OUTPUT>

will retrieve orbit data from the past 10 days only for the ASOs with the NORAD IDs listed in this file.

Running

orbit_pred etl  --st_user <SPACE TRAC USERNAME> \
       --st_password <SPACE TRACK PASSOWRD> \
       --only_latest \
       --output_path <OUTPUT>

will fetch just the last TLE for every ASO in LEO.

Note on API Rate Limits

The USSTRATCOM API is throttled and the amount of data that can be in the response of a single query is limited. The space-track.org API client we use automatically rate limits the number of requests however it is a good idea to review the API guidlines to understand how often you can run automated scripts against the API.

Resulting Data

The result of running the ETL script is a a pandas DataFrame that is saved in the Parquet format with the following columns:

Field	Description	Type
aso_id	The unique ID for the ASO	string
aso_name	The name of the ASO	string
epoch	The timestamp the orbital observation was taken	datetime
r_x	The `x` component of the position vector `r`	float
r_y	The `y` component of the position vector `r`	float
r_z	The `z` component of the position vector `r`	float
v_x	The `x` component of the velocity vector `v`	float
v_y	The `y` component of the velocity vector `v`	float
v_z	The `z` component of the velocity vector `v`	float
object_type	Whether the ASO is a paylod, rocket body, or debris	string

Build a Training Data Set

The training set builder uses the poliastro astrodynamics library to build a training data set of the predictions and errors made by a physical model so that we can try to train machine learning models to estimate this prediction error. The baseline physics model is a two body model that uses Cowell's formulation for modeling the perturbation in a ASO's orbit caused by the Earth. We build our training set by:

Given an orbit data point for an ASO, we find all the orbit data points for that ASO that are within n days after the given data point.
We then create a physics model starting at the at the given orbit data point and propagate the orbit to all the data points that are within n days in the future.
We use the orbit data points as ground truth to determine the error in the physical model's propagation.

Training Data Creation CLI

The CLI to create a training data set has the following arguments:

--input_path: The path to the parquet file to load the orbit observations from.
--output_path: The path to save the prediction/error training dataset to.
--last_n_days: Only use observations from the last `n` days when creating the prediction windows. Defaults to 30.
--n_pred_days: The number days in the prediction window. Defaults to 3.

Resulting Data

The result of running the training data creation script has the following columns:

Field	Description	Type
aso_id	The unique ID for the ASO	string
aso_name	The name of the ASO	string
epoch	The timestamp the orbital observation was taken	datetime
r_x	The `x` component of the position vector `r`	float
r_y	The `y` component of the position vector `r`	float
r_z	The `z` component of the position vector `r`	float
v_x	The `x` component of the velocity vector `v`	float
v_y	The `y` component of the velocity vector `v`	float
v_z	The `z` component of the velocity vector `v`	float
object_type	Whether the ASO is a paylod, rocket body, or debris	string
start_epoch	The `epoch` when the prediction was started	datetime
start_r_x	The `x` component of the position vector `r` where the prediction started	float
start_r_y	The `y` component of the position vector `r` where the prediction started	float
start_r_z	The `z` component of the position vector `r` where the prediction started	float
start_v_x	The `x` component of the velocity vector `v` where the prediction started	float
start_v_y	The `y` component of the velocity vector `v` where the prediction started	float
start_v_z	The `z` component of the velocity vector `v` where the prediction started	float
elapsed_seconds	The number of seconds between the `start_epoch` and `epoch`	float
physics_pred_r_x	The `x` component of the predicted position vector `r`	float
physics_pred_r_y	The `y` component of the predicted position vector `r`	float
physics_pred_r_z	The `z` component of the predicted position vector `r`	float
physics_pred_v_x	The `x` component of the predicted velocity vector `v`	float
physics_pred_v_y	The `y` component of the predicted velocity vector `v`	float
physics_pred_v_z	The `z` component of the predicted velocity vector `v`	float
physics_err_r_x	The prediction error in the `x` component of the position vector	float
physics_err_r_y	The prediction error in the `y` component of the position vector	float
physics_err_r_z	The prediction error in the `z` component of the position vector	float
physics_err_v_x	The prediction error in the `x` component of the velocity vector	float
physics_err_v_y	The prediction error in the `y` component of the velocity vector	float
physics_err_v_z	The prediction error in the `z` component of the velocity vector	float

Training Machine Learning Models

The ML module provides a process for using XGBoost to build baseline gradient boosted regression trees models to estimate the error made by the physics model in predicting orbits.

Features

The features used by the baseline models are:

elapsed_seconds: The number of seconds that the physical model predicted into the future.
start_pred_r_x: The x component of the position vector r where the prediction began
start_pred_r_y: The y component of the position vector r where the prediction began
start_pred_r_z: The z component of the position vector r where the prediction began
start_pred_v_x: The x component of the velocity vector v where the prediction began
start_pred_v_y: The y component of the velocity vector v where the prediction began
start_pred_v_z: The z component of the velocity vector v where the prediction began
physics_pred_r_x: The x component of the predicted position vector r
physics_pred_r_y: The y component of the predicted position vector r
physics_pred_r_z: The z component of the predicted position vector r
physics_pred_v_x: The x component of the predicted velocity vector v
physics_pred_v_y: The y component of the predicted velocity vector v
physics_pred_v_z: The z component of the predicted velocity vector v

ML Models

We independently build a baseline XGBRegressor model for each of the six error columns:

physics_err_r_x
physics_err_r_y
physics_err_r_z
physics_err_v_x
physics_err_v_y
physics_err_v_z

Training CLI

We can train the baseline models by running

orbit_pred train_models

with the following arguments:

--input_path: The path to the parquet file containing the physical model prediction/error training data.
--use_gpu: A boolean flag for whether or not to use GPUs in training. Requires CUDA dependencies are properly installed, see here for more details.
--out_dir: The directory to serialize the JSON representations of the models to.

Combining Physics and ML Models

Finally we combine the physical orbit model and the machine learning models by altering the physics predicted state vectors by the error amounts predicted by the ML models.

CLI for Making Final Orbit Predictions

The orbit prediction module has a CLI to:

Fetch the most up to date orbit data for LEO ASOs from USSTRATCOM.
Use a physical model to predict the future ASO orbits.
Correct the physical model predictions using the errors predicted by the ML models.

The CLI can be run via

orbit_pred pred_orbits

with the following arguments:

--st_user: The username for space-track.org
--st_password: The password for space-track.org
--norad_id_file: The path to a text file containing a single NORAD ID on each row to fetch orbit data for. If no file is passed then orbit data for all LEO ASOs will be fetched.
--ml_model_dir: The path to the directory containing the error prediction models serialized as JSON.
--n_days: The number of days in the future to make orbit predictions for, defaults to 3.
--timestep: The frequency in seconds to make orbit predictions for, defaults 600
--output_path The path to save the orbit prediction pickle file to. These results can be used directly by the conjunction search UI to search and visualize what the predicted space traffic will look like.

Pipeline Demo

Running the pipeline demo script will run the whole orbital prediction pipeline for the ASOs listed in this file. First we need to set the needed environment variables

export ST_USER=<SPACE TRACK USERNAME>
export ST_PASSWORD=<SPACE TRACK PASSWORD>

then we can run the pipeline via

./pipeline_demo.sh

The final and intermediate data artifacts are in /tmp/ssa_test and the trained ML models are in /tmp/ssa_test/err_models.

If you would like to see what the resulting conjunction predictions look like, run

cp /tmp/ssa_test/orbit_preds.pickle ../conjunction_search/sample_data/orbit_preds.pickle

then run the development version of the conjunction search UI as detailed here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Introduction

Dependencies

Data Flow

Pipeline Components

ETL Orbit Data from USSTRATCOM

ETL CLI

Running ETL Script Examples

Note on API Rate Limits

Resulting Data

Build a Training Data Set

Training Data Creation CLI

Resulting Data

Training Machine Learning Models

Features

ML Models

Training CLI

Combining Physics and ML Models

CLI for Making Final Orbit Predictions

Pipeline Demo

Files

README.md

Latest commit

History

README.md

File metadata and controls

Introduction

Dependencies

Data Flow

Pipeline Components

ETL Orbit Data from USSTRATCOM

ETL CLI

Running ETL Script Examples

Note on API Rate Limits

Resulting Data

Build a Training Data Set

Training Data Creation CLI

Resulting Data

Training Machine Learning Models

Features

ML Models

Training CLI

Combining Physics and ML Models

CLI for Making Final Orbit Predictions

Pipeline Demo