CTA Data Cleaning

Python data-cleaning pipeline code for CTA research (feature prep, cross-sectional standardization, and leak-safe preprocessing), plus a few small helpers used by that pipeline.

Supporting modules

get_corr_new.py: fast daily cross-sectional correlation (rank + Pearson)
functions.py: utilities (e.g., rolling window node generation)
strategy_backtest_metrics.py: lightweight backtest metrics/plots

Data (ignored)

Large data/artifacts are intentionally excluded from git via .gitignore (e.g., *.pq, dd_pre/, dd_3_por/).

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
functions.py		functions.py
get_corr_new.py		get_corr_new.py
main(mean_model).py		main(mean_model).py
strategy_backtest_metrics.py		strategy_backtest_metrics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CTA Data Cleaning

Supporting modules

Data (ignored)

About

Uh oh!

Releases

Packages

Languages

Wilsonnijc-bot/CTA-data-cleaning

Folders and files

Latest commit

History

Repository files navigation

CTA Data Cleaning

Supporting modules

Data (ignored)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages