Genomics-based annotations help unveil the molecular composition of edible plants

Nutrition and wellbeing take a central role in today’s high pace life, but how much do we really know about the food we eat? Here, we harness existing metabolic knowledge encrypted in staple food ingredients’ genome to help us explore the composition of raw edible plants. We first show the benefit and value of looking into genome-associated functional annotations on a wide scale. Next, we rely on new experimental data to develop a framework that helps us reveal new, potentially bioactive compounds in staple food ingredients. This has significance in, first, extending current food composition knowledge and second in discovering newly detected bioactive compounds, shedding light on the potential impacts of common food ingredients beyond their nutritional value as described in food labels. Finally, we show that staple foods that are already included in our daily diets might have the potential to be ‘superfoods’ that can contribute to our wellbeing.

Here we explore to what degree, existing genome-associated metabolic annotations can offer a valuable resource to deepen the knowledge of food composition. To do so and focus on the edible parts of plants, we rely on metabologenomics, integrating genomics and metabolomics, used in the past to discover novel natural products.

Installation

To use the codein this repository please install the requirements in requirements.txt

Use pip install -r requirements.txt to install the related packages.

Code and Data

Code

plant_analysis

General plant analysis

PathEnrichmentPerOrg_bonferroni.py:

Contains the code for the pathway enrichment analysis and bonferonni correction or all plant.

Corn specific analysis

corn_analysis.py:

Contains all the analysis components and figures related to corn related compounds except for figure 5.

structure_analysisCorn.py:

Calculates the similarity between corn related compounds and creates the clustermap presented in figure 5.

thermodynamic_feasibility_analysis

feasibleReactionsPerOrg.py:

collects and computes the reactions with Gibbs free values and performes the scoring according to the Thermodynamic feasibility analysis described in the paper.

kineticsPerformance_fb.py:

Calculates the performance metrics for the thermodynamic feasibility approach and creates some of the components for figure 6.

supporting_analysis

coconutdb_comparison.py:

Calculates and plots the comparison between coconutDB and the other databases presented in this work.

kegg_lipid_content.py: Counts the number of lipids in the keeg database using the information found in the KEGG compound website.

pca_approch.ipynb: Calculates the comparision and the significance of overlap between the diffeent data sources presented in this work.

plant_family.py: Creates the phylogenetic tree for the plants included in this analysis.

Data

All the data files are ocated in the data folder.

Metabolomics data (Metabolon_Data.xlsx) - Contains the metabolomics experiments results for the plants presented in the paper.

Metabolic annotations master table (plant_masterTable_100621_ds1.csv) - Contains all the annotations collected for the plants presented in the paper.

Table_s2- A comparison of first block InchIKeys from coconutDB, our predicted to accumulate compounds, and experimentally detected compounds.

Thermodynamic feasibility score table (kinetics_all_plants_df_fb_avg.csv) - A table summarizing the thermodynamic feasibility approach results including the score, is it found in the experiments, first block inchIKey, the group of full inchIKeys for that first block representation, compound mass and the number of reactions the compound is in.

Other files used for running the analysis that can used as a demo are in the util_files folder.

All data files are available in : https://zenodo.org/record/7860346#.ZEbk43bMK3C

For any questions or inquiries please contact the Barabasi lab in https://www.barabasilab.com/

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
images		images
plant_analysis		plant_analysis
results		results
supporting_analysis		supporting_analysis
thermodynamic_feasibility_analysis		thermodynamic_feasibility_analysis
util_files		util_files
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
functions_structural_similarity.py		functions_structural_similarity.py
requirements.txt		requirements.txt
roc_auc.svg		roc_auc.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Genomics-based annotations help unveil the molecular composition of edible plants

Installation

Code and Data

Code

Data

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Barabasi-Lab/Plant-genomics

Folders and files

Latest commit

History

Repository files navigation

Genomics-based annotations help unveil the molecular composition of edible plants

Installation

Code and Data

Code

Data

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages