Copyright (C) 2020 Juergen Koefinger, Max Planck Institute of Biophysics, Frankfurt am Main, Germany
With contributions from Klaus Reuter, Max Planck Computing and Data Facility, Garching, Germany.
Released under the MIT Licence, see the file LICENSE.txt.
Powerful statistical tests for ordered data
Juergen Koefinger and Gerhard Hummer
Preprint: https://doi.org/10.26434/chemrxiv.13373351 (2020)
- Python 3
- Jupyter (for Jupyter notebooks only)
- Python modules (can be installed with
pip
orconda
):- numpy
- scipy
- mpmath
- matplotlib (for Jupyter notebooks only)
The hplusminus package can be installed in the following ways:
To install via pip
please run:
pip install --user hplusminus
After downloading and unpacking the source tarball please run:
python setup.py install --user
Python 3 script to evaluate test statistics for normalized residuals. This script contains the central functionality.
See help for more information and usage:
python hplusminus_tests.py -h
python hplusminus_tests.py ./examples/alternative_model_normalized_residuals.txt
python hplusminus_tests.py ./examples/true_model_normalized_residuals.txt
Notebooks in the directory ./ipynb/ serve to explore the capabilties of our statistical tests. See notebooks themselves for more details on purpose and usage.
Evaluate statistical tests.
Addtionally to the functionality of hplusminus_tests.py, the notebook provides plots of normalized residuals and signs and a bar-plot to visually compare the p-values of the various tests.
Calculate statistical power for all tests and given model
Generate models for residuals, which can be used with the Python script hplusminus_tests.py and the Jupyter notebooks hplusminus_tests.ipyn and hplusminus_statistical_power.ipynb.
Python 3 module file containing functions for the convenient evaluation of the statistical tests.
Python 3 module file for input and output.
Python 3 module file for the calculation of the Shannon information (neg. log-probabilities) of all test statistics (rld for Run-Length Distribution). Required by Python script hplusminus_tests.py and Jupyter notebooks hplusminus_tests.ipynb and hplusminus_statistical_power.ipynb.
Python 3 module file for the calculation of p-values using the gamma distribution approximation of the cumulative Shannon information distributions (SID). Required by Python script hplusminus_tests.py and Jupyter notebooks hplusminus_tests.ipynb and hplusminus_statistical_power.ipynb.
Python module hplusminus
Examples for normalized residuals generated with generate_models_for_residuals.ipynb.
Numpy binary files containing B-spline parameters (knots and coefficients) for gamma distribution parameters for all tests. Information is read from these files. No need for user interaction.
Directory containing Jupyter notebooks.
Directory used by Jupyter notebooks for input/output.