Battery Pulse Diagnostics

Using rapid DC pulse sequences to predict SOC, discharge capacity, and safety metrics measured from multiple different commercially produced lithium-ion batteries.

Setup

Environment

$ conda create env -f environment.yml
$ conda activate battery_pulse_diagnostics

NOTE: May need to run $ pip install --upgrade pandas "dask[complete]" $ for tsfresh to work.

Data preprocessing

The processed and raw data can be downloaded from Zenodo: https://doi.org/10.5281/zenodo.14597394.

Raw electrochemical characterization test data was processed using the following scripts:

scripts/01_process_data.py (Raw files processed by this script are not included; this script generates the 'data_raw.h5' data base used by further processing scripts)
scripts/02_extract_pulses.py
scripts/03_extract_targets.py
scripts/04_extract_dcir_all_pulses.py
scripts/04_extract_dcir.py (Only returns DCIR at 50% SOC for data exploration, versus the prior script, which returns DCIR from all HPPC-type pulses)
scripts/05_process_data_for_ml.py

These files only need to be run once, and they generate:
- data/
  - data_raw.h5
  - data_for_ml.h5
  - data_for_ml_extracted_features.h5
  - features_pulse_hppc.csv
  - features_pulse_rapid.csv
  - features_pulse_psrp1.csv
  - features_pulse_psrp1_Cb2.csv
  - features_pulse_psrp1_1C.csv
  - features_pulse_psrp2_charging.csv
  - features_pulse_psrp2_discharging.csv
  - features_pulse_psrp2_Cb2.csv
  - features_pulse_psrp2_1C.csv
  - targets_soh.csv
  - features_dcir.csv
  - features_dcir_no_interpolation.csv

Another approach studied was extracting timeseries features using the tsfresh feature library, documented in scripts/06_run_tsfeature_extraction.py, which generates the 'data/data_for_ml_extracted_features.h5' file. This was found to be computationally expensive and not improve predictive performance compared to just using the raw data.

Model fitting

There are two main scripts for model fitting. run_kfoldcv.py runs KFold cross-validation and is set up to compare multiple models including PLSR, XGBoost, and several neural network architectures. KFold CV is used due to the computational cost of each model run, in particular for the neural network models. run_bootstrap_xgboost.py runs an XGBoost model with repeated random train/test sampling, as the lower complexity of XGBoost allows for more model runs. Results are saved in the 'results/' directory and code visualizing the results is reported in the 'notebooks/' directory.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
eis_analysis		eis_analysis
figs		figs
notebooks		notebooks
notes		notes
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
run_bootstrap_xgboost.py		run_bootstrap_xgboost.py
run_kfoldcv.py		run_kfoldcv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Battery Pulse Diagnostics

Setup

Environment

Data preprocessing

Model fitting

About

Releases 1

Packages

Languages

NREL/battery_pulse_diagnostics

Folders and files

Latest commit

History

Repository files navigation

Battery Pulse Diagnostics

Setup

Environment

Data preprocessing

Model fitting

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages