What it does

Digest raw data
- Signal from CHP-1
- Tracker "async"
- CHP-1 internal temperature from thermistor
Bad data ranges flagging
- From manual set execution ranges
- From acquisition signal physical limits
Converts signal to radiation
- Computes temperature correction when possible
Plots
- Overview of Clean/Dirty signal
- Daily signal with and without dark
- Overview of Direct radiation measurements
- Daily Direct radiation measurements

Digest raw data
- Signal from CHP-1
Bad data ranges flagging
- From manual set execution ranges
- From acquisition signal physical limits
Converts signal to radiation
Plots
- Overview of Clean/Dirty signal
- Daily signal with and without dark

Quality Check of radiation data (QCRad)
- Flags data using mainly the algorithm of C. N. Long and Y. Shi (2006)
Imports data from github.com/thanasisn/TSI
- Sun_Dist_Astropy Sun - LAP distance
- TSI_TOA TSI at TOA at LAP
- TSI_1au TSI
- TSI_source TSI data source
Imports atmospheric pressure data from proxies
- Pressure Atmospheric pressure at LAP
- Pressure_source Data source
Keeps an md5sum of all input files to check for bit rot and other data corruption.

TODO

Fully port all to duckdb
Replace and compare processes from "CM_21_GLB"
- All the major stages have been replaced
- Secondary processes are to be ported
Process more instruments
Import libRadtran data
May import CSid
Import other references

Some aspects on the implementation of this project.

We use a dataset of parquet files as a database for all measurements and additional data.
We are migrating the original parquet dataset scheme to Duckdb to improve overall efficiency.
The parquet dataset use one file for each month, this facilitates:
- Syncing of the data between different computers.
- Partial processing when needed without using the dataset function.
It should be easy to migrate to a pure database like duckdb or sqlite.
There are some files with extra meta data for the data in the database and the analysis performed.
We use features of the arrow library, and also data.table when it is more suitable or clear to code.
The analysis should be able to be performed with under 8Gb of RAM, but is not assured.
There is a trade-of with the disk usage/wearing, especially when starting from scratch.
New data should be easy to be added on daily base on all levels.
New process and analysis should be easy to added for all data.
Goal to become a framework for all broadband instruments data analysis and manipulation.

There is no centralized documentation for the project. Although you can refer to: