Skip to content

Commit e99d4c0

Browse files
Update README.md
1 parent 909b88e commit e99d4c0

File tree

1 file changed

+10
-10
lines changed

1 file changed

+10
-10
lines changed

tutorials/README.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@
44
## Overview of Page Contents
55

66
+ [Biomedical Workflows on AWS](#bio)
7+
+ [Artificial Intelligence](#ai)
78
+ [Clinical Informatics](#ci)
89
+ [Download SRA Data](#sra)
910
+ [GWAS](#gwas)
@@ -17,7 +18,6 @@
1718
+ [Artificial Intelligence](#ai)
1819
+ [CryoEM](#cryoem)
1920
+ [Open Data](#open)
20-
+ [Generative AI on AWS](#genai)
2121

2222
---------------------------------
2323
## **Biomedical Workflows on AWS** <a name="bio"></a>
@@ -34,6 +34,15 @@ There are a lot of ways to run workflows on AWS. Here we list a few possibilitie
3434

3535
**Please also note, GPU machines cost more than most CPU machines, so be sure to shut these machines down after use, or apply an EC2 [lifecycle configuration](/docs/auto-shutdown-instance.md). You may also encounter service quotas to protect you from the accidental use of expensive machine types. If that happens, and you still want to use a certain instance type, follow these [instructions](/docs/service_quotas.md).**
3636

37+
## **Artificial Intelligence** <a name="ai"></a>
38+
Machine learning is a subfield of artificial intelligence that focuses on the development of algorithms and models that enable computers to learn from and make predictions or decisions based on data, without being explicitly programmed. Artificial intelligence and machine learning algorithms are being applied to a variety of biomedical research questions, ranging from image classification to genomic variant calling. AWS has a long list of AI/ML tutorials available and we have compiled a list here. Most recent development focuses on generative AI including use cases such as extracting information from text, transforming speech to text, and generating images from text. Sagemaker Studio allows the user to rapidly create, test, and train generative AI models and has ready to use models all contained with [JumpStart](https://docs.aws.amazon.com/sagemaker/latest/dg/studio-jumpstart.html). These models range from foundation models, fine-tunable models, and task-specific solutions.
39+
+ For examples of generative AI, look at [this AWS GitHub repo](https://github.com/aws-samples/amazon-sagemaker-generativeai).
40+
+ You can also view [our tutorials](https://github.com/STRIDES/NIHCloudLabAWS/tree/main/tutorials/notebooks/GenAI) using several AWS products
41+
+ For other AI use cases, we recommend you start with this comprehensive [on-demand workshop](https://catalog.workshops.aws/hcls-aiml/en-US/breast-cancer-classification) on how to use SageMaker Studio for a variety of AI/ML use cases including applying a classifier to RNAseq data, classifying tabular breast cancer data, buiding graph neural nets on HIV data, training a medical imaging model on chest scans, summarize scientific literature using foundation models, MLOps using gene expression data, and finally, performing antibody structure prediction.
42+
+ AWS has a very general tutorial [here](https://aws.amazon.com/getting-started/hands-on/build-train-deploy-machine-learning-model-sagemaker/) on how to build out an AI pipeline on SageMaker.
43+
+ These [general examples](https://github.com/aws/amazon-sagemaker-examples/tree/main/introduction_to_applying_machine_learning) will teach you how to use Sagemaker tools more broadly.
44+
+ You can also submit a training job to SageMaker, and have your final model uploaded to S3 using [PyTorch](https://sagemaker.readthedocs.io/en/stable/frameworks/pytorch/using_pytorch.html#train-a-model-with-pytorch), [Tensorflow](https://docs.aws.amazon.com/sagemaker/latest/dg/tf.html) or [Apache MXNet](https://docs.aws.amazon.com/sagemaker/latest/dg/mxnet.html).
45+
3746
## **Clinical Informatics** <a name="ci"></a>
3847
Clinical informatics, also known as healthcare informatics or medical informatics, is an interdisciplinary field that applies data science to healthcare data to improve patient care, enhance clinical processes, and facilitate medical research. It often involves integrating diverse data types including electronic health records, demographic, or environmental data. AWS offers two on demand workshops that walk you through AWS HealthLake for Population Health data analysis. [This first workshop](https://catalog.us-east-1.prod.workshops.aws/workshops/4849824d-084a-4a64-a237-f05027f54abc/en-US) shows you how to ingest data to HealthLake, query those data using Athena, visualize these data using QuickSight, then join FHIR data with environmental data and visualize the combined dataset. [The second workshop](https://catalog.us-east-1.prod.workshops.aws/workshops/498fdbc5-46e1-4cb0-97a0-f4ec3a30f26a/en-US/activity-streaming-data) also ingests data into HealthLake, then visualizes medical device data, uses AI to summarize clinical notes, and then transcribes clinical audio files and summarizes them.
3948

@@ -78,15 +87,6 @@ The [Accelerating Therapeutics for Opportunities in Medicine (ATOM) Consortium](
7887

7988
These notebooks were created to run in Google Colab, so if you run them in AWS, you will need to make a few modification. First, we recommend you use a [Sagemaker Studio Notebook](https://docs.aws.amazon.com/sagemaker/latest/dg/studio-updated.html) rather than a User-Managed notebook simply because it will have Tensorflow and other dependencies installed. Be sure to attach a GPU to your instance (T4 is fine). Also, you will need to comment out `%tensorflow_version 2.x` since that is a Colab-specific command. You will also need to `pip install` a few packages as needed. If you get errors with `deepchem`, try running `pip install --pre deepchem[tensorflow]` and/or `pip install --pre deepchem[torch]`. Also, some notebooks will require a Tensorflow kernel, while others require Pytorch. You may also run into a Pandas error, reach out to the ATOM GitHub developers for the best solution to this issue.
8089

81-
## **Artificial Intelligence** <a name="ai"></a>
82-
Machine learning is a subfield of artificial intelligence that focuses on the development of algorithms and models that enable computers to learn from and make predictions or decisions based on data, without being explicitly programmed. Artificial intelligence and machine learning algorithms are being applied to a variety of biomedical research questions, ranging from image classification to genomic variant calling. AWS has a long list of AI/ML tutorials available and we have compiled a list here. Most recent development focuses on generative AI including use cases such as extracting information from text, transforming speech to text, and generating images from text. Sagemaker Studio allows the user to rapidly create, test, and train generative AI models and has ready to use models all contained with [JumpStart](https://docs.aws.amazon.com/sagemaker/latest/dg/studio-jumpstart.html). These models range from foundation models, fine-tunable models, and task-specific solutions.
83-
+ For examples of generative AI, look at [this AWS GitHub repo](https://github.com/aws-samples/amazon-sagemaker-generativeai).
84-
+ For other AI use cases, we recommend you start with this comprehensive [on-demand workshop](https://catalog.workshops.aws/hcls-aiml/en-US/breast-cancer-classification) on how to use SageMaker Studio for a variety of AI/ML use cases including applying a classifier to RNAseq data, classifying tabular breast cancer data, buiding graph neural nets on HIV data, training a medical imaging model on chest scans, summarize scientific literature using foundation models, MLOps using gene expression data, and finally, performing antibody structure prediction.
85-
+ AWS has a very general tutorial [here](https://aws.amazon.com/getting-started/hands-on/build-train-deploy-machine-learning-model-sagemaker/) on how to build out an AI pipeline on SageMaker.
86-
+ These [general examples](https://github.com/aws/amazon-sagemaker-examples/tree/main/introduction_to_applying_machine_learning) will teach you how to use Sagemaker tools more broadly.
87-
+ You can also submit a training job to SageMaker, and have your final model uploaded to S3 using [PyTorch](https://sagemaker.readthedocs.io/en/stable/frameworks/pytorch/using_pytorch.html#train-a-model-with-pytorch), [Tensorflow](https://docs.aws.amazon.com/sagemaker/latest/dg/tf.html) or [Apache MXNet](https://docs.aws.amazon.com/sagemaker/latest/dg/mxnet.html).
88-
89-
9090
## **CryoEM** <a name="cryoem"></a>
9191
Cryo-Electron Microscopy (cryoEM), is a powerful imaging technique used in structural biology to visualize the structures of biological macromolecules, such as proteins, nucleic acids, and large molecular complexes, at near-atomic or even atomic resolution. It has revolutionized the field of structural biology by providing detailed three-dimensional structures of biomolecules, which is crucial for understanding their functions.
9292

0 commit comments

Comments
 (0)