Skip to content

Commit a408582

Browse files
authored
Merge pull request #440 from PNNL-CompBio/pancpdo-manifest
Updated manifest and README
2 parents a0781c3 + 1b70f7c commit a408582

File tree

2 files changed

+59
-874
lines changed

2 files changed

+59
-874
lines changed

build/pancpdo/README.md

Lines changed: 28 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,32 @@
1-
## HCMI Data
1+
## Pancreatic PDO Data
2+
3+
4+
Here we will store the scripts required to process the omics data from the
5+
Genomic Data Commons together with the drug response data.
6+
7+
The GDC hosts the panc pdo omcis data, so to update we need an
8+
up-to-date manifest, obtained as follows:
9+
10+
11+
1. Navigate to the [GDC Data
12+
Portal](https://portal.gdc.cancer.gov/analysis_page?app=Projects),
13+
and select 'ORGANOID-PANCREATIC'
14+
2. Click on the 'Cases' button, and select the download button where
15+
it lists the number of files.
16+
3. This will download the ENTIRE Manifest
17+
4. Filter the manifest for RNASeq, WGS mutations, and copy number
18+
(though i dont think thi dataset has copy number)
19+
calls using the following command:
20+
```
21+
cat ~gdc_manifest.2025-07-08.091940.txt | grep 'rna_seq\|md5'
22+
| 'grep counts\|md5' | grep 'txt\|maf\|tsv\|md5' > new_manifest.txt
23+
cp new_manifest.txt full_manifest.txt
24+
25+
```
26+
27+
The other data is stored [on synapse](https://www.synapse.org/Synapse:syn64597875).
28+
229

3-
Here we will store the scripts required to process the data from the [Human Cancer Models Initiative](https://ocg.cancer.gov/programs/HCMI)
430

531
## Build Docker
632

0 commit comments

Comments
 (0)