Replies: 2 comments 6 replies
-
Hello Efstathios,
|
Beta Was this translation helpful? Give feedback.
-
Dear @PoisonAlien, unfortunately based on the subsetting of MAF objects I'm facing another error-in detail when I'm trying to subset the following MAF object into two separate MAF objects and concatenate them at a later step: TCGAmutations::tcga_available()
brca.maf <- TCGAmutations::tcga_load(study = "BRCA")
# possible approach to filter genes with overall low mutational frequency
dt <- brca.maf@data
dat.sel <- dt %>% distinct(Hugo_Symbol,Tumor_Sample_Barcode) %>% mutate(value=1) %>% pivot_wider(names_from = "Tumor_Sample_Barcode", values_from="value",values_fill=0)
to.plot <- data.table(mutation = dat.sel$Hugo_Symbol,
frequency = rowMeans(dat.sel[,-1],na.rm=T)) # perhaps filter based on the total frequency-for example, < than 2%
pass.freq <- to.plot[to.plot$frequency>=0.01,]
pass.freq.genes <- pass.freq$mutation
maf.brca.selgenes <- subsetMaf(brca.maf, genes = pass.freq.genes, mafObj = T, isTCGA = T)
brca.tp53.sel <- subsetMaf(maf.brca.selgenes,genes="TP53", query= "HGVSp_Short%in%c('p.R273H','p,R248W', 'p.R273C', 'p.R273L','p.R248Q','p.R175H')", mafObj = T, isTCGA = T)
all_samples = levels(getSampleSummary(x = maf.brca.selgenes)[,Tumor_Sample_Barcode])
TP53.hotspots <- levels(getSampleSummary(x = brca.tp53.sel)[,Tumor_Sample_Barcode])
TP53.mutants = genesToBarcodes(maf =maf.brca.selgenes, genes = "TP53", justNames = TRUE)
dt <- setdiff(TP53.mutants$TP53,TP53.hotspots)
dt2 <- setdiff(all_samples,dt)
braf_maf_other = subsetMaf(maf = maf.brca.selgenes, tsb = dt2, mafObj = T, isTCGA = T)
Error in subsetMaf(maf = maf.brca.selgenes, tsb = dt2, mafObj = T, isTCGA = T) :
Subsetting has resulted in zero non-synonymous variants!
sessionInfo()
R version 4.0.3 (2020-10-10)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 18363)
Matrix products: default
locale:
[1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252
[3] LC_MONETARY=English_United States.1252 LC_NUMERIC=C
[5] LC_TIME=English_United States.1252
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] forcats_0.5.1 stringr_1.4.0 dplyr_1.0.6 purrr_0.3.4
[5] readr_1.4.0 tidyr_1.1.3 tibble_3.1.2 ggplot2_3.3.4
[9] tidyverse_1.3.1 TCGAmutations_0.3.0 data.table_1.14.0 maftools_2.6.05
Thus, why this is returning an error? My ultimate goal was to combine the maf samples that had specific TP53 hotspot mutations, along with the rest of the samples that do not have any TP53 mutation, in order then to search for putative somatic interactions-however, I can't identify where the error occurs in which of the subset steps; Thank you in advance, Efstathios |
Beta Was this translation helpful? Give feedback.
-
Dear Anand,
I would like to ask a particular question concerning the robust utilization of the R packages maftools and TCGAmutations:
in particular, based on a current project analysis, we are aiming on identify putative co-mutation/occurrence patterns based on TP53 mutations in breast cancer and other putative cancer drivers, in order to prioritize possible co-dependencies which could sensitize breast cancer cell lines and validate putative experimental designs; on this premise, based on the available TCGA breast cancer cohort from TCGAmutations, my main methodological questions are the following:
somaticInteractions
function as illustrated here would suffice?http://bioconductor.org/packages/release/bioc/vignettes/maftools/inst/doc/maftools.html#9_Analysis
And then specifically check any significant p-values that include as one gene the TP53? and as event "co-occurrence"?
If some initial filters would be applied prior running the above function, for example excluding any genes with overall mutational frequency of less than 0.1, is this possible through maftools? In addition, to include only specific TP53 hotspot mutations, such as R273H ?
A parallel analysis based on your expertise, could be to separate the maf object into two distinct "maf" populations; that is TP53 mutated, and wild type TP53? and identify the top differentially mutated genes, and investigate also any mutational patterns based on oncoplots?
Thank you in advance,
Efstathios
Beta Was this translation helpful? Give feedback.
All reactions