[Laboratory for Multiscale Regenerative Technologies]
Analyzing the activity of proteases and their substrates is critical to defining the biological functions of these enzymes and to designing new diagnostics and therapeutics that target protease dysregulation in disease.
To facilitate protease research, we present Protease Activity Analysis (PAA). PAA is a Python software package with a collection of tools for analyzing protease activity data. PAA provides a modular framework for streamlined analysis across three major components:
- Database: query and input datasets of synthetic peptide substrates and their cleavage susceptibilities across a diverse set of proteases.
- Cleavage analysis: analyze and visualize enzyme-substrate activity measurements generated through in vitro screens against synthetic peptide substrates.
- In vivo sensors: deploy a set of modular machine learning functions to analyze in vivo protease activity signatures that are generated by activity-based sensors.
The repository accompanies the paper, Protease Activity Analysis: A Toolkit for Analyzing Enzyme Activity Data in Python, and is developed and maintained by the paper's authors. If you use code from the repository, please cite the follwoing paper: coming soon!.
- Install the following dependencies: conda 3.x
- Generate Github SSH keys. If you already have SSH keys you can first check to make sure.
- Add your SSH key to your Github account.
- Download this repository:
git clone git@github.com:apsoleimany/protease_activity_analysis.git
To use the protease activity analysis (paa) toolbox, first create the environment and then install the package:
cd protease_activity_analysis
conda env create -f environment.yml
conda activate paa
pip install -e .
To enter the protease environment: conda activate paa
Once inside the environment, the package can be directly imported and used in a Python shell/script:
>>> import protease_activity_analysis as paa
>>> paa.tests.test_installation()
To deactivate after you are done: conda deactivate
To get started using PAA, we recommend you select a tutorial in the tutorials
folder that is most aligned with your application of interest. We also provide three template analysis scripts in the root directory:
analyze_kinetic.py
: analysis of in vitro screening data;analyze_ms_data.py
: analaysis and visualization of in vivo activity sensor data;classify_ms_data.py
: machine learning classification of in vivo activity sensor data.
If you use the code from the repository, please cite the accompanying paper:
@article{Soleimany2022,
title = {Protease Activity Analysis: A Toolkit for Analyzing Enzyme Activity Data},
author = {Soleimany, Ava P and Martin Alonso, Carmen and Anahtar, Melodi N and Wang, Cathy S and Bhatia, Sangeeta N},
journal = {bioRxiv},
year = {2022}
}
Code should be written in compliance with the Google Python style guide with Google style docstrings.
Please direct correspondence to Ava Soleimany (avasoleimany@microsoft.com).