CHF benchmark using synthetic data, paper finalized fixes too #63

myerspat · 2024-08-22T13:42:18Z

PR Description

This PR includes the CHF benchmark, described in the benchmark at docs/source/benchmarks/chf.ipynb and in the docstring of pyMAISE.datasets.load_chf(). This PR also includes some quality-of-life fixes before paper publication. These are listed below.

Closes: #49
Closes: #51

What changes were made?

CHF benchmark, data, and load function at pyMAISE.datasets.load_chf()
Added tensorflow.keras.Input layer for building DNNs in pyMAISE.methods.nnHyperModel.build()
Added notebook output clearing function in pyMAISE.utils.display._try_clear(), and used this function throughout pyMAISE routine
Added progress printing using tqdm in pyMAISE.PostProcessor and pyMAISE.CVTuner
Fixed pyMAISE.PostProcessor.confusion_matrix() and included both number and percentage, handles binary and multiclass cases
Removed MSE in pyMAISE.PostProcessor.metrics() and added mean absolute percentage error (MAPE) and root mean squared percentage error (RMSPE)
Changed pyMAISE.PostProcessor.validation_plot() to scatter plot
Added pretty-printing function (pyMAISE.PostProcessor.print_model()) for readable NN hyperparameters
Move dependence of keras to tensorflow.keras only
Reran all notebooks
Added network plotting at pyMAISE.PostProcessor.nn_network_plot()

Reviewers: @mradaideh

…s, MAPE/RMSPE, scatter error plot

…fixes

mradaideh · 2024-08-22T15:58:18Z

Thanks Patrick for the work. I did go through my list and yours and they cross in many points. I will repeat those points below since they are not explicitly mentioned in your report, but sorry if they were already embedded:

1- pip install tensorflow[and-cuda] by default when we install pymaise via pip or when we get via github.
2- Install via source, add this option for quicker install without having to clone: pip install git+https://github.com/myerspat/pyMAISE.git
3- add model.summary() for keras networks by default to show all layers and nodes. This will also show total number of parameters to estimate in the network.
4- Add Keras network plotter but this does not need to be default and can be invoked as an option in the Postprocessor class.
5- You have already addressed that transform issue in xarray. I hope you can explain it somewhere in the API or in a notebook. Also, it might worth adding to your development notes (I keep forgetting them) is to add a notebook example that shows tuning on smaller dataset and then postprocess on full data which can be used to demonstrate these nuances.

Aside from that, I look forward to pip install the new version.

mradaideh · 2024-08-22T16:13:28Z

Also forgot to add for the printing of NN structure, you may add the number of nodes for the input and output layers and test the printer with LSTM and CNN to see if it consistent.

myerspat added 6 commits August 12, 2024 19:45

Changed to have a specific input layer and added model print function

7d4c5ad

Input layer, print progress and clear output, confusion matrix update…

668eeda

…s, MAPE/RMSPE, scatter error plot

removing input layer, parallel classical hyperparameter tuning

0bc1e37

finialized pre-paper edits with CHF data/loader

fd0f0b5

final runs of benchmarks and CHF benchmark

b40532f

reinstate large file precommit

f03bf0e

myerspat requested a review from mradaideh August 22, 2024 13:42

myerspat self-assigned this Aug 22, 2024

myerspat added 10 commits August 22, 2024 10:06

added load_chf to API reference

838511a

add step to create your own benchmark

eadf03a

fixing tensorflow random seed issues

dd314c7

fixing latex degree issue and plt.show()

d85da69

added CHF to readme and added this step to create your own

f087513

fixing docs for load_chf

75e9c33

Merge branch 'paper-fixes' of github.com:myerspat/pyMAISE into paper-…

b40474d

…fixes

fixing output in initial blurb

889fd2c

fixing parameter in load_chf

7fa9979

make postprocessor testing reflect new metrics

bfb844a

myerspat linked an issue Aug 22, 2024 that may be closed by this pull request

Change dictionary key name for layers and optimizers to be the same as when you call the actual Keras constructor. #51

Closed

1 task

myerspat added 4 commits August 22, 2024 15:08

added network plot, summary, and some additional docs

f8d6820

fixing doc issues with nn_network_plot

06c0982

remove clipnorm

8f1af64

finalized benchmarks and networks

ea21341

myerspat merged commit efc4672 into develop Aug 23, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CHF benchmark using synthetic data, paper finalized fixes too #63

CHF benchmark using synthetic data, paper finalized fixes too #63

myerspat commented Aug 22, 2024 •

edited

Loading

mradaideh commented Aug 22, 2024

mradaideh commented Aug 22, 2024

CHF benchmark using synthetic data, paper finalized fixes too #63

CHF benchmark using synthetic data, paper finalized fixes too #63

Conversation

myerspat commented Aug 22, 2024 • edited Loading

PR Description

What changes were made?

mradaideh commented Aug 22, 2024

mradaideh commented Aug 22, 2024

myerspat commented Aug 22, 2024 •

edited

Loading