You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+17-17Lines changed: 17 additions & 17 deletions
Original file line number
Diff line number
Diff line change
@@ -1,12 +1,12 @@
1
-
# 1. MIRACUM-Pipe-docker
1
+
# MIRACUM-Pipe-docker
2
2
3
3
This repo offers a framework to easily work with the dockerized version of [MIRACUM-Pipe](https://github.com/AG-Boerries/MIRACUM-Pipe)
4
4
5
-
## 1.0. Disclaimer
5
+
## Disclaimer
6
6
7
7
MIRACUM-Pipe is intended for research use only!
8
8
9
-
## 1.1. Setup and installation
9
+
## Setup and installation
10
10
11
11
In order to run the miracum pipeline, one needs to setup tools and databases which we are not allowed to ship due to license issues.
12
12
We prepared this project in a way which allows you to easily add the specific components into the pipeline.
@@ -66,7 +66,7 @@ See `setup.sh -h` to list the available options. By default, we do not install t
66
66
67
67
- Move both created files to the annovar/humandb folder.
68
68
69
-
## 1.2. How to configure and run it
69
+
## How to configure and run it
70
70
71
71
The project structure is as follows:
72
72
@@ -92,7 +92,7 @@ There are three levels of configuration:
92
92
- `conf/custom.yaml` contains settings for the entire runtime environment and overwrites `default.yaml`'s values
93
93
- In each patient directory one a `patient.yaml` can be created in which every setting of the other two configs can be overwritten.
94
94
95
-
### 1.2.1. Setting up a patient
95
+
### Setting up a patient
96
96
97
97
It is intended to create a patient folder in `input` for each patient containing `patient.yaml`. Further, we recommend to define in it at least the following parameters:
98
98
@@ -103,7 +103,7 @@ annotation:
103
103
protocol: wes # possible values are either wes for whole exome sequencing, requires a tumor and matched germline sample, or panel for tNGS, only the tumor samples is necessary.
104
104
```
105
105
106
-
#### 1.2.1.1 Example for whole-exome sequencing; protocol parameter: wes
106
+
#### Example for whole-exome sequencing; protocol parameter: wes
107
107
108
108
Place the germline R1 and R2 files as well as the tumor files (R1 and R2) into the *input* folder. Either name them `germline_R{1/2}.fastqz.gz` and `tumor_R{1/2}.fastq.gz` or adjust your `patient.yaml` accordingly:
109
109
@@ -118,7 +118,7 @@ common:
118
118
protocol: wes
119
119
```
120
120
121
-
#### 1.2.1.2 Example for tNGS; protocol paramter: panel
121
+
#### Example for tNGS; protocol paramter: panel
122
122
123
123
Place the tumor files (R1 and R2) into the *input* folder. Adjust your `patient.yaml` accordingly:
124
124
@@ -140,7 +140,7 @@ tools:
140
140
flatReference: FlatReference_TruSight_Tumor.cnn
141
141
```
142
142
143
-
### 1.2.2. Setting up the environment
143
+
### Setting up the environment
144
144
145
145
The `costum.yaml`is intended to add parameters specifying the local environment. This could encompass the resources available, i.e. number of cores and memory, the processing author as well as the reference genome and / or capture region files.
146
146
Of course, all the settings could be set in the `patient.yaml`as well.
@@ -170,11 +170,11 @@ reference:
170
170
captureCorFactors : targetCapture_cor_factors.rda
171
171
```
172
172
173
-
### 1.2.3. Run the pipeline
173
+
### Run the pipeline
174
174
175
175
There are multiple possibilities to run the pipeline:
176
176
177
-
#### 1.2.3.1 Whole-exome sequencing
177
+
#### Whole-exome sequencing
178
178
179
179
Assumption: Patient folder name *Patient_example* within the *input* folder under assets/input.
180
180
@@ -202,7 +202,7 @@ For more information see at the help of the command by running:
202
202
./miracum_pipe.sh
203
203
```
204
204
205
-
#### 1.2.3.2 tNGS
205
+
#### tNGS
206
206
207
207
Assumption: Patient folder name *TST170_example* within the *input* folder under assets/input.
208
208
@@ -230,7 +230,7 @@ For more information see at the help of the command by running:
230
230
./miracum_pipe.sh
231
231
```
232
232
233
-
### 1.2.4. Parallel computation
233
+
### Parallel computation
234
234
235
235
The MIRACUM-Pipe consits of five major steps (tasks) of which several can be computed in parallel:
236
236
@@ -241,21 +241,21 @@ The MIRACUM-Pipe consits of five major steps (tasks) of which several can be com
241
241
After the pipeline finishes successfully, it creates the file `.processed` into the patient's direcotry. Per default processed patients are skipped.
242
242
The flag `-f` forces a recomputation and neglects that file. Furhtermore, sometimes it is required to rerun a single task. Therefore, use the flag `-t`.
243
243
244
-
## 1.3. Logging
244
+
## Logging
245
245
246
246
MIRACUM-pipe writes its logfiles into `output/<patient_name>/log`. For each task in the pipeline an own logfile is created. With the help of these logfiles one can monitor the current status of the pipeline process.
247
247
248
-
## 1.4. Parallell & sequential computing
248
+
## Parallell & sequential computing
249
249
250
250
In `conf/custom.yaml` one can setup ressource parameters as cpucores and memory. If not intentionally called the pipeline on as single thread (sequentially), several tasks compute in parallel. The ressources are divided, thus you can enter the real 100% ressource you want to offer the entire pipline processes. Single threaded is intended to be used incase of limited hardware ressources or very large input files.
251
251
252
252
**BEWARE**: if you set tmp to be a tempfs (into ram), please consider this, while deciding the process ressources.
253
253
254
-
## 1.5. Run the docker image behind a proxy
254
+
## Run the docker image behind a proxy
255
255
256
256
If you need a proxy to connect to the internet you need to change the command running the docker. We provided the needed parameters in the docker run command. You only need to uncomment the lines 86-93 in [miracum_pipe.sh](https://github.com/AG-Boerries/MIRACUM-Pipe-docker/blob/master/miracum_pipe.sh), add the proxy server address and port and comment or delete the lines 76-83.
257
257
258
-
## 1.6 Limitations
258
+
## Limitations
259
259
260
260
MIRACUM-Pipe is currently testfor the whole-exome protocol for the capture kits V5UTR and V6. The tool used for mutation signature analysis is currently only compatible with the following kits:
261
261
@@ -274,6 +274,6 @@ The name of the kit has to be supplied with the *captureRegionName* parameter. W
274
274
275
275
For the tNGS protocol MIRACUM-Pipe is tested for the Illumina TruSight Tumor 170 panel.
276
276
277
-
## 1.7. License
277
+
## License
278
278
279
279
This work is licensed under [GNU Affero General Public License version 3](https://opensource.org/licenses/AGPL-3.0).
0 commit comments