Google Cloud Platform (GCP) for Bioinformatics

This repository shows how to use ☁️ Google Cloud Platform public cloud services to scale bioinformatics data analysis tasks using cloud best practices for GCP. This use cases featured as exampled are called any and all of the following: genomic-scale data workflows, pipelines, analysis or batch jobs.

This content is intended for researchers - in particular, this guide is for those who are NEW to working with GCP.
You have a number of options on how to use the materials provided in this course. A summary is shown below left.

This Repo includes content you can read, watch or run:

📗 READ - one page of this Repo (MD page)
📺 WATCH - linked YouTube screencasts
📙 RUN - Jupyter Notebook example
TRY - linked GitHub Repos
📘 EXPAND - linked (external) resources
🔍 SCAN - search a list in this Repo

NOTE: If you would like to learn more advanced concepts (including script examples and patterns) about working with Google Cloud Platform, see my Repo gcp-essentails --> link

TIP: If you are NEW to bioinformatics and have a computational background...

REVIEW my bioinformatics concepts tools and terms
- Designed for cloud practioners who are NEW to Bioinformatics
- The 'student notes repo' is named Team Teri - link to 'who is Teri?'
- This Repo includes links to explanations of bioinformatics concepts, tools and platforms - link

📺 Click below to WATCH 'Lynn's Welcome Video' (4 min) on YouTube

Why would I choose to use a public cloud vendor for bioinformatics?

⭐️ SAVE MONEY run (and pay for) scalable analysis jobs only when you need to run them
⭐️ SAVE TIME use vendor-managed infrastructure & best-practice patterns for fast repeatable research
📗 READ the FAQ for GCP bioinformatics for this Repo
📕 READ Nature article: "Cloud computing for genomic data analysis and collaboration"
📗 READ the top 4 most common use cases for using the public cloud for bioinformatics researchers

Contibutions

We love contributions! See this short style guide when making pull requests to this repo.

Name		Name	Last commit message	Last commit date
Latest commit History 979 Commits
0_Setup_GCP_account		0_Setup_GCP_account
1. Drug Discovery		1. Drug Discovery
1_Files_&_Data		1_Files_&_Data
2. bioPython Basics		2. bioPython Basics
2_Virtual_Machines_&_Docker_Containers		2_Virtual_Machines_&_Docker_Containers
3. Molecular Biology		3. Molecular Biology
3_Machine_Learning		3_Machine_Learning
4. Protein denaturing		4. Protein denaturing
4_Code_&_Cloud_Service_Tools		4_Code_&_Cloud_Service_Tools
5_Serverless_Compute_with_Functions		5_Serverless_Compute_with_Functions
6_Advanced_GCP_&_Scripts		6_Advanced_GCP_&_Scripts
images		images
.gitignore		.gitignore
12_DNA.ipynb		12_DNA.ipynb
1_FAQ.md		1_FAQ.md
2_TOPICS.md		2_TOPICS.md
3_USER-STORIES.md		3_USER-STORIES.md
4_FILE-TYPES.md		4_FILE-TYPES.md
5_GENOMIC-TOOLS.md		5_GENOMIC-TOOLS.md
6_ARCHITECTURE.md		6_ARCHITECTURE.md
7_CONTRIBUTING.md		7_CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google Cloud Platform (GCP) for Bioinformatics

📺 Click below to WATCH 'Lynn's Welcome Video' (4 min) on YouTube

Why would I choose to use a public cloud vendor for bioinformatics?

Contibutions

About

Releases

Packages

Languages

License

Karthick-840/gcp-for-bioinformatics

Folders and files

Latest commit

History

Repository files navigation

Google Cloud Platform (GCP) for Bioinformatics

📺 Click below to WATCH 'Lynn's Welcome Video' (4 min) on YouTube

Why would I choose to use a public cloud vendor for bioinformatics?

Contibutions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages