Skip to content

File Format: Mutation Data (Counts)

Mark Keller edited this page Jan 31, 2019 · 1 revision

Mutation count files are split by category type (e.g. SBS_96, DBS_78, etc...) where rows correspond to samples and columns correspond to samples. The first column contains the Sample ID and the subsequent columns represent counts of mutations per category for each sample.

An example SBS_96 counts file looks as follows:

A[C>A]A A[C>A]C A[C>A]G A[C>A]T ... T[T>G]T
TCGA-OR-A5J1-090 1 4 0 12 ... 3
TCGA-OR-A5J1-091 2 0 0 5 ... 0
... ... ... ... ... ... ...

The ordering of the category columns does not matter.