-
Notifications
You must be signed in to change notification settings - Fork 0
/
chapt3.tex
executable file
·2720 lines (2465 loc) · 218 KB
/
chapt3.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
\chapter{Synthetic Lethal Analysis of Gene Expression Data}
\label{chap:SLIPT}
Having developed a statistical \gls{synthetic lethal} detection methodology, \gls{SLIPT}, it was next applied to publicly available cancer \gls{gene expression} datasets. The analysis presented in this Chapter focuses on breast cancer for which \gls{TCGA} \glslink{gene expression}{expression} data \citep{TCGA2012} from a patient cohort and \gls{siRNA} screen data, from experiments conducted in MCF10A cells \citep{Telford2015}, were available. Stomach cancer data \citep{TCGA2014GC} was used to replicate findings in an independent dataset, with this cancer chosen because it also occurs in syndromic \gls{HDGC} patients. The \gls{TCGA} data also has the advantages of having other clinical and molecular profiles, including \glslink{somatic}{somatic} \gls{mutation} across many of the same samples, in addition to a considerable sample size for \gls{RNA-Seq} \glslink{gene expression}{expression} data generated with common \gls{TCGA} procedures to minimise batch effects. %Some findings were replicated in the CCLE \citep{Barretina2012} for comparison to the cell line experiments.
Synthetic lethal candidate partners for \textit{CDH1} were identified at both the gene and \gls{pathway} level. \gls{SLIPT} gene candidates were analysed by cluster analysis for common \glslink{gene expression}{expression} profiles across samples and relationships with clinical factors and \glspl{mutation} in key breast cancer genes. These genes were also compared to the gene candidates from primary and secondary (validation) screens conducted by \citet{Telford2015} on isogenic cell lines. For comparison, the \gls{SLIPT} methodology was also applied using \gls{mutation} data for \textit{CDH1} against \glslink{gene expression}{expression} of candidate partners (as described in Section~\ref{methods:SLIPT}) which may better represent the null \glspl{mutation} in \gls{HDGC} patients and the experimental cell model \citep{Chen2014}. Pathways were analysed by over-representation analysis (with resampling for comparisons with \gls{siRNA} data) and supported by a \gls{metagene} analysis of \glspl{pathway} gene signatures. The \glspl{pathway} meta\gls{gene expression} profiles were used to replicate known relationships between clinical and molecular characteristics for breast cancer and to demonstrate application of \gls{SLIPT} directly on \glspl{metagene} to detect \gls{synthetic lethal} \glspl{pathway}.
\section{Synthetic Lethal Genes in Breast Cancer} \label{chapt3:exprSL_genes}
The \gls{SLIPT} methodology (as described in Section~\ref{methods:SLIPT}) was applied to the normalised \gls{TCGA} breast cancer \gls{gene expression} dataset ($n = 1168$). As shown in Table~\ref{tab:gene_SL}, the most significant \textcolor{black}{of the 5165} genes had strong evidence of \glslink{gene expression}{expression}-based association with \textit{CDH1} (high $\chi^2$ values) with fewer samples exhibiting low \glslink{gene expression}{expression} of both genes than expected statistically. Eukaryotic translation genes were among the highest scoring gene candidates, including initiation factors, elongation factors, and ribosomal proteins. These are clearly necessary for cancer cells to grow and proliferate, with sustained \gls{gene expression} needed to maintain growth signalling \glspl{pathway} and resist apoptosis or immune factors, translation genes may be subject to \gls{non-oncogene addiction} for \textit{CDH1}-deficient cells.
While these are among the strongest \gls{synthetic lethal} candidates, translational genes are \textcolor{black}{crucial} to the viability of healthy cells and dosing for a selective \gls{synthetic lethal} effect against these may be difficult compared to other biological functions which may also be supported among the \gls{SLIPT} candidate genes. Furthermore, few known biological functions of \textit{CDH1} were among the strongest SL candidates, so the remaining candidate genes may also be informative since they are likely to contain these expected functions in addition to novel relationships for \textit{CDH1}. Thus further \gls{pathway} level analyses were also conducted to examine biological functions over-represent\-ed among synthetic candidate genes and to identify \gls{synthetic lethal} \glspl{pathway}.
\begin{table*}[!ht]
\caption{Candidate \gls{synthetic lethal} gene partners of \textit{CDH1} from SLIPT}
\label{tab:gene_SL}
\centering
\resizebox{0.8 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{>{\em}sl^c^c^c^c^c}
\rowstyle{\bfseries}
\em{Gene} & Observed\tnote{*} & Expected\tnote{*} & $\chi^2$ value & p-value & p-value (\gls{FDR}) \\
\hline
\rowcolor{black!10}
TRIP10 & 62 & 130 & 162 & $5.65 \times 10^{-34}$ & $1.84 \times 10^{-31}$ \\
\rowcolor{black!5}
EEF1B2 & 56 & 130 & 158 & $3.10 \times 10^{-33}$ & $9.45 \times 10^{-31}$ \\
\rowcolor{black!10}
GBGT1 & 61 & 131 & 156 & $1.08 \times 10^{-32}$ & $3.14 \times 10^{-30}$ \\
\rowcolor{black!5}
ELN & 81 & 130 & 149 & $3.46 \times 10^{-31}$ & $8.82 \times 10^{-29}$ \\
\rowcolor{black!10}
TSPAN4 & 78 & 130 & 146 & $1.63 \times 10^{-30}$ & $3.79 \times 10^{-28}$ \\
\rowcolor{black!5}
GLIPR2 & 72 & 130 & 146 & $1.68 \times 10^{-30}$ & $3.86 \times 10^{-28}$ \\
\rowcolor{black!10}
RPS20 & 73 & 131 & 145 & $1.89 \times 10^{-30}$ & $4.28 \times 10^{-28}$ \\
\rowcolor{black!5}
RPS27A & 80 & 130 & 143 & $5.53 \times 10^{-30}$ & $1.18 \times 10^{-27}$ \\
\rowcolor{black!10}
EEF1A1P9 & 63 & 130 & 141 & $1.91 \times 10^{-29}$ & $3.74 \times 10^{-27}$ \\
\rowcolor{black!5}
C1R & 73 & 130 & 141 & $2.05 \times 10^{-29}$ & $3.97 \times 10^{-27}$ \\
\rowcolor{black!10}
LYL1 & 73 & 130 & 140 & $2.99 \times 10^{-29}$ & $5.74 \times 10^{-27}$ \\
\rowcolor{black!5}
RPLP2 & 71 & 130 & 139 & $4.88 \times 10^{-29}$ & $9.07 \times 10^{-27}$ \\
\rowcolor{black!10}
C10orf10 & 73 & 130 & 138 & $6.72 \times 10^{-29}$ & $1.20 \times 10^{-26}$ \\
\rowcolor{black!5}
DULLARD & 74 & 131 & 138 & $9.29 \times 10^{-29}$ & $1.61 \times 10^{-26}$ \\
\rowcolor{black!10}
PPM1F & 64 & 130 & 136 & $1.61 \times 10^{-28}$ & $2.65 \times 10^{-26}$ \\
\rowcolor{black!5}
OBFC2A & 69 & 130 & 136 & $2.49 \times 10^{-28}$ & $3.93 \times 10^{-26}$ \\
\rowcolor{black!10}
RPL11 & 70 & 130 & 136 & $2.56 \times 10^{-28}$ & $3.97 \times 10^{-26}$ \\
\rowcolor{black!5}
RPL18A & 70 & 130 & 135 & $3.08 \times 10^{-28}$ & $4.70 \times 10^{-26}$ \\
\rowcolor{black!10}
MFNG & 76 & 131 & 133 & $7.73 \times 10^{-28}$ & $1.12 \times 10^{-25}$ \\
\rowcolor{black!5}
RPS17 & 77 & 131 & 133 & $8.94 \times 10^{-28}$ & $1.29 \times 10^{-25}$ \\
\rowcolor{black!10}
MGAT1 & 73 & 130 & 132 & $1.44 \times 10^{-27}$ & $2.03 \times 10^{-25}$ \\
\rowcolor{black!5}
RPS12 & 72 & 130 & 128 & $8.57 \times 10^{-27}$ & $1.12 \times 10^{-24}$ \\
\rowcolor{black!10}
C10orf54 & 73 & 130 & 127 & $1.37 \times 10^{-26}$ & $1.75 \times 10^{-24}$ \\
\rowcolor{black!5}
LOC286367 & 72 & 130 & 126 & $2.20 \times 10^{-26}$ & $2.70 \times 10^{-24}$ \\
\rowcolor{black!10}
GMFG & 70 & 130 & 126 & $2.20 \times 10^{-26}$ & $2.70 \times 10^{-24}$ \\
\hline
\end{tabular}
\begin{tablenotes}
\raggedright %\small
Strongest candidate \gls{synthetic lethal} partners for \textit{CDH1} \textcolor{black}{of 5165 genes detected} by \gls{SLIPT} \textcolor{black}{(with FDR p-value $<0.05$)} in \gls{TCGA} breast cancer expression data
\item[*] Observed and expected numbers of samples which had low \glslink{gene expression}{expression} of both genes
\end{tablenotes}
\end{threeparttable}
}
\end{table*}
The modified \acrshort{mtSLIPT} methodology (as described in Section~\ref{methods:SLIPT}) was also applied to the normalised \gls{TCGA} breast cancer \gls{gene expression} dataset, against \glslink{somatic}{somatic} loss of function \glspl{mutation} in \textit{CDH1}. As shown in Table~\ref{tab:gene_mtSL}, the most significant genes also had strong evidence of \glslink{gene expression}{expression} associated with \textit{CDH1} \glspl{mutation} (high $\chi^2$ values) with fewer samples with \textit{CDH1} exhibiting low \glslink{gene expression}{expression} each candidate gene than expected statistically. These genes were not as strongly supported as the \glslink{gene expression}{expression} analysis (in Table~\ref{tab:gene_SL}), however, nor were as many genes detected. This is perhaps unsurprising due to the lower sample size with matching \glslink{somatic}{somatic} \gls{mutation} data and the lower frequency of \textit{CDH1} \glspl{mutation} compared to low \glslink{gene expression}{expression} defined by $\sfrac{1}{3}$ quantiles.
\begin{table*}[!ht]
\caption{Candidate synthetic lethal gene partners of \textit{CDH1} from mtSLIPT}
\label{tab:gene_mtSL}
\centering
\resizebox{0.8 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{>{\em}sl^c^c^c^c^c}
\rowstyle{\bfseries}
\em{Gene} & Observed\tnote{*} & Expected\tnote{*} & $\chi^2$ value & p-value & p-value (\gls{FDR}) \\
\hline
\rowcolor{black!10}
TFAP2B & 8 & 36.7 & 89.5 & $3.60 \times 10^{-20}$ & $8.37 \times 10^{-17}$ \\
\rowcolor{black!5}
ZNF423 & 15 & 36.7 & 78.8 & $7.89 \times 10^{-18}$ & $1.22 \times 10^{-14}$ \\
\rowcolor{black!10}
CALCOCO1 & 11 & 36.7 & 76.8 & $2.09 \times 10^{-17}$ & $2.59 \times 10^{-14}$ \\
\rowcolor{black!5}
RBM5 & 13 & 36.7 & 75.7 & $3.65 \times 10^{-17}$ & $4.00 \times 10^{-14}$ \\
\rowcolor{black!10}
BTG2 & 7 & 36.7 & 71.7 & $2.72 \times 10^{-16}$ & $1.81 \times 10^{-13}$ \\
\rowcolor{black!5}
RXRA & 6 & 36.7 & 70.5 & $5.00 \times 10^{-16}$ & $2.97 \times 10^{-13}$ \\
\rowcolor{black!10}
SLC27A1 & 11 & 36.7 & 70.3 & $5.42 \times 10^{-16}$ & $2.97 \times 10^{-13}$ \\
\rowcolor{black!5}
MEF2D & 12 & 36.7 & 69.6 & $7.86 \times 10^{-16}$ & $3.95 \times 10^{-13}$ \\
\rowcolor{black!10}
NISCH & 12 & 36.7 & 69.6 & $7.86 \times 10^{-16}$ & $3.95 \times 10^{-13}$ \\
\rowcolor{black!5}
AVPR2 & 9 & 36.7 & 69.2 & $9.36 \times 10^{-16}$ & $4.58 \times 10^{-13}$ \\
\rowcolor{black!10}
CRY2 & 13 & 36.7 & 68.9 & $1.07 \times 10^{-15}$ & $4.98 \times 10^{-13}$ \\
\rowcolor{black!5}
RAPGEF3 & 13 & 36.7 & 68.9 & $1.07 \times 10^{-15}$ & $4.98 \times 10^{-13}$ \\
\rowcolor{black!10}
NRIP2 & 10 & 36.7 & 68.2 & $1.58 \times 10^{-15}$ & $7.18 \times 10^{-13}$ \\
\rowcolor{black!5}
DARC & 12 & 36.7 & 66.4 & $3.76 \times 10^{-15}$ & $1.54 \times 10^{-12}$ \\
\rowcolor{black!10}
SFRS5 & 12 & 36.7 & 66.4 & $3.76 \times 10^{-15}$ & $1.54 \times 10^{-12}$ \\
\rowcolor{black!5}
NOSTRIN & 5 & 36.7 & 65.1 & $7.40 \times 10^{-15}$ & $2.70 \times 10^{-12}$ \\
\rowcolor{black!10}
KIF13B & 12 & 36.7 & 63.4 & $1.69 \times 10^{-14}$ & $5.16 \times 10^{-12}$ \\
\rowcolor{black!5}
TENC1 & 10 & 36.7 & 62.5 & $2.67 \times 10^{-14}$ & $7.40 \times 10^{-12}$ \\
\rowcolor{black!10}
MFAP4 & 12 & 36.7 & 60.5 & $7.17 \times 10^{-14}$ & $1.67 \times 10^{-11}$ \\
\rowcolor{black!5}
ELN & 13 & 36.7 & 59.7 & $1.07 \times 10^{-13}$ & $2.32 \times 10^{-11}$ \\
\rowcolor{black!10}
SGK223 & 14 & 36.7 & 59 & $1.51 \times 10^{-13}$ & $3.05 \times 10^{-11}$ \\
\rowcolor{black!5}
KIF12 & 11 & 36.7 & 58.8 & $1.74 \times 10^{-13}$ & $3.34 \times 10^{-11}$ \\
\rowcolor{black!10}
SELP & 11 & 36.7 & 58.8 & $1.74 \times 10^{-13}$ & $3.34 \times 10^{-11}$ \\
\rowcolor{black!5}
CIRBP & 9 & 36.7 & 58.7 & $1.83 \times 10^{-13}$ & $3.41 \times 10^{-11}$ \\
\rowcolor{black!10}
CTDSP1 & 9 & 36.7 & 58.7 & $1.83 \times 10^{-13}$ & $3.41 \times 10^{-11}$ \\
\hline
\end{tabular}
\begin{tablenotes}
\raggedright %\small
Strongest candidate \gls{synthetic lethal} partners for \textit{CDH1} \textcolor{black}{of 3743 genes detected} by \acrshort{mtSLIPT} \textcolor{black}{(with FDR p-value $<0.05$)} in \gls{TCGA} in breast cancer expression and mutation data
\item[*] Observed and expected numbers of \textit{CDH1} mutant \gls{TCGA} breast tumours with low expression of partner genes
\end{tablenotes}
\end{threeparttable}
}
\end{table*}
The \acrshort{mtSLIPT} candidates had more genes involved in cell and gene regulation, particularly \acrshort{DNA} and \acrshort{RNA} binding factors. The strongest candidates also included microtubule (\textit{KIF12}), microfibril (\textit{MFAP4}), and cell adhesion (\textit{TENC1}) genes consistent with the established cytoskeletal role of \textit{CDH1}. The elastin gene (\textit{ELN}) was notably strongly supported by both \glslink{gene expression}{expression} and \gls{mutation} \gls{SLIPT} analysis of \text{CDH1} supporting interactions with extracellular proteins and the tumour microenvironment.
%%appendix
%\label{tab:gene_mtSL}
\subsection{Synthetic Lethal Pathways in Breast Cancer} \label{chapt3:exprSL_pathways}
Translational \glspl{pathway} were strongly over-represented in \gls{SLIPT} partners, as shown in Table~\ref{tab:pathway_exprSL}. \textcolor{black}{The Reactome pathways analysed here include those which overlap as shown in Appendix Figure~\ref{fig:pathway_overlap}.} These \textcolor{black}{pathways} include ribosomal subunits, initiation, peptide elongation, and termination. Regulatory processes involving \acrshort{mRNA} including 3' untranslated region (UTR) binding, L13a-mediated translational silencing, and nonsense-mediated decay were also implicated. These are consistent with protein translation being subject to ``\gls{non-oncogene addiction}'' \citep{Luo2009}, as a core process that is dysregulated to sustain cancer proliferation and survival \citep{Gao2015}.
Immune \glspl{pathway}, including the adaptive immune system and responses to infectious diseases were also strongly implicated as \gls{synthetic lethal} with loss of \gls{E-cadherin}. This is consistent with the alterations of immune response being a hallmark of cancer \cite{Hanahan2000}, since evading the immune system is necessary for cancer survival. Either of these systems are potential means to target \textit{CDH1}-deficient cells, although these were not detected in an isolated cell line experimental screen \citep{Telford2015} and the differences between the findings in patient data are described in more detail in Section~\ref{chapt3:compare_pathway}.
\begin{table*}[!ht]
\caption{Pathways for \textit{CDH1} partners from SLIPT}
\label{tab:pathway_exprSL}
\centering
\resizebox{1 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{lccc}
\cellcolor{white} \textbf{Pathways Over-represented} & \textbf{Pathway Size} & \textbf{SL Genes} & \textbf{p-value (\gls{FDR})} \\
\hline
\rowcolor{black!10}
Eukaryotic Translation Elongation & 86 & 81 & $1.3 \times 10^{-207}$ \\
\rowcolor{black!5}
Peptide chain elongation & 83 & 78 & $5.6 \times 10^{-201}$ \\
\rowcolor{black!10}
Eukaryotic Translation Termination & 83 & 77 & $1.2 \times 10^{-196}$ \\
\rowcolor{black!5}
Viral \acrshort{mRNA} Translation & 81 & 76 & $1.2 \times 10^{-196}$ \\
\rowcolor{black!10}
Formation of a pool of free 40S subunits & 93 & 81 & $3.7 \times 10^{-194}$ \\
\rowcolor{black!5}
Nonsense Mediated Decay independent of the Exon Junction Complex & 88 & 77 & $5.3 \times 10^{-187}$ \\
\rowcolor{black!10}
L13a-mediated translational silencing of Ceruloplasmin \glslink{gene expression}{expression} & 103 & 82 & $9.6 \times 10^{-183}$ \\
\rowcolor{black!5}
3' -UTR-mediated translational regulation & 103 & 82 & $9.6 \times 10^{-183}$ \\
\rowcolor{black!10}
GTP hydrolysis and joining of the 60S ribosomal subunit & 104 & 82 & $1.9 \times 10^{-181}$ \\
\rowcolor{black!5}
Nonsense-Mediated Decay & 103 & 80 & $6.2 \times 10^{-176}$ \\
\rowcolor{black!10}
Nonsense Mediated Decay enhanced by the Exon Junction Complex & 103 & 80 & $6.2 \times 10^{-176}$ \\
\rowcolor{black!5}
Adaptive Immune System & 412 & 167 & $6.5 \times 10^{-174}$ \\
\rowcolor{black!10}
Eukaryotic Translation Initiation & 111 & 82 & $5.7 \times 10^{-173}$ \\
\rowcolor{black!5}
Cap-dependent Translation Initiation & 111 & 82 & $5.7 \times 10^{-173}$ \\
\rowcolor{black!10}
SRP-dependent cotranslational protein targeting to membrane & 104 & 79 & $2.0 \times 10^{-171}$ \\
\rowcolor{black!5}
Translation & 141 & 91 & $6.1 \times 10^{-170}$ \\
\rowcolor{black!10}
Infectious disease & 347 & 146 & $1.6 \times 10^{-166}$ \\
\rowcolor{black!5}
Influenza Infection & 117 & 81 & $1.9 \times 10^{-163}$ \\
\rowcolor{black!10}
Influenza Viral \acrshort{RNA} Transcription and Replication & 108 & 77 & $1.9 \times 10^{-160}$ \\
\rowcolor{black!5}
Influenza Life Cycle & 112 & 77 & $2.5 \times 10^{-156}$ \\
\hline
\end{tabular}
\begin{tablenotes}
\raggedright %\small
\textcolor{black}{The most significant pathways from g}ene set over-representation analysis (hypergeometric test) for Reactome \glspl{pathway} in \gls{SLIPT} partners for \textit{CDH1}.
\end{tablenotes}
\end{threeparttable}
}
\end{table*}
It is also notable that the \glspl{pathway} over-represented in \gls{SLIPT} candidate genes have strongly significant over-representation of Reactome \glspl{pathway} based on the hypergeometric test (as described in Section~\ref{methods:enrichment}). Even after adjusting stringently for multiple testing, biologically related \glspl{pathway} were supported together. These \glspl{pathway} are further supported by testing for \glspl{synthetic lethal} against \textit{CDH1} \glspl{mutation} (\acrshort{mtSLIPT}) with many of these \glspl{pathway} also among the most strongly supported in this analysis (shown in Appendix Table~\ref{tab:pathway_mtSL}). This \gls{mutation}-based analysis more closely represents the null \textit{CDH1} \glspl{mutation} in \gls{HDGC} \citep{Guilford1998} and the experimental MCF10A cell model \citep{Chen2014}. There was still support for translational and immune \glspl{pathway} not detected in the isolated experimental system. \glspl{GPCR} also among the most strongly supported \glspl{pathway}, supporting the experimental findings of \citet{Telford2015} for these intracellular signalling \glspl{pathway} already being targeted for other diseases.
%%appendix
%\label{tab:pathway_mtSL}
\FloatBarrier
\subsection{Expression Profiles of Synthetic Lethal Partners} \label{chapt3:exprSL_clusters}
Due to the sheer number of gene candidates \textcolor{black}{(e.g., 5165 for SLIPT)}, investigations proceeded into correlation structure and \glspl{pathway} over-represent\-ation. These analyses also examined \glslink{gene expression}{expression} patterns of \gls{synthetic lethal} gene candidates. This serves to explore the functional similarity of the \gls{synthetic lethal} partners of \textit{CDH1}, with the eventual aim to assess their utility as drug targets. As shown in Figure~\ref{fig:slipt_expr} (which clusters \textit{CDH1} lowly expressing samples separately), there were several large clusters of genes among the \glslink{gene expression}{expression} profiles of the \textit{CDH1} \gls{synthetic lethal} candidate partners. The clustering suggests co-regulation of genes or \glspl{pathway} correlation between partner gene candidates. A number of candidates from an experimental \gls{RNAi} screen study performed by \citet{Telford2015} were also identified by this approach. In addition, novel gene candidates were also identified, which had not been observed \textcolor{black}{to affect} viability in isogenic cell line experiments. %or in some cases, even opposite effects of selective cell death or toxicity to both isogenic cell lines.
\begin{figure*}[!htbp]
%\begin{mdframed}
\centering
\resizebox{0.99 \textwidth}{!}{
\includegraphics{CDH1_Heatmaps_Genes_Split_By_CDH1_z-trans_exprSL_cordistx_Pub.png}
}
\caption[Synthetic lethal \glslink{gene expression}{expression} profiles of analysed samples]{\small \textbf{Synthetic lethal \glslink{gene expression}{expression} profiles of analysed samples.} \Gls{gene expression} profile heatmap (correlation distance, complete linkage) of all samples (separated by the $\sfrac{1}{3}$ quantile of \textit{CDH1} \glslink{gene expression}{expression}) analysed in \gls{TCGA} breast cancer dataset for \gls{gene expression} of 5165 candidate partners of \gls{E-cadherin} (\textit{CDH1}) from \gls{SLIPT} prediction (with \gls{FDR} adjusted $p < 0.05$). Deeply clustered, inter-correlated genes form several main groups, each containing genes that were SL candidates or lethal in an \gls{siRNA} screen \citep{Telford2015}. Screen results for \gls{synthetic lethal} (SL), the reverse effect (RSL), or lethal cell viability are shown as reported by \citet{Telford2015}. Clusters had different sample groups highly expressing the \gls{synthetic lethal} candidates in \textit{CDH1} low samples, notably `normal-like', `basal-like', and \gls{ER} negative samples have elevated \glslink{gene expression}{expression} in one or more distinct clusters showing complexity and variation among candidate \gls{synthetic lethal} partners. \textit{CDH1} low samples also contained most of samples with \textit{CDH1} \glspl{mutation} (shown in black). Negative values for \gls{mutation} and screen data are shown in light grey, with missing data in white.
%This suggests that multiple targets may be needed to target \textit{CDH1} deficiency across genetic backgrounds and that combination therapy may be more effective.
}
\label{fig:slipt_expr}
%\end{mdframed}
\end{figure*}
In these \glslink{gene expression}{expression} profiles, a gene with a moderate or high signal across samples exhibiting low \textit{CDH1} \glslink{gene expression}{expression} would represent a potential drug target. However, it appears that several molecular subtypes of cancer have elevation of different clusters of \gls{synthetic lethal} candidates in samples with low \textit{CDH1}. This clustering \textcolor{black}{(shown by red correlated blocks of genes in Figure~\ref{fig:slipt_expr})} suggests that different targets (or combinations) could be effective in different patients, suggesting potential utility for stratification. In particular, \gls{ER} negative, basal-like subtype, and ``normal-like'' tumours \citep{Eroles2012, Parker2009, Dai2015} have elevation of genes specific to particular clusters, indicative of some \gls{synthetic lethal} interactions being specific to a particular molecular subtype or genetic background. Thus \gls{synthetic lethal} drug therapy against these subtypes may be ineffective if it were designed against genes in another cluster.
A similar correlation structure was observed among the candidates tested against \textit{CDH1} \gls{mutation} (\acrshort{mtSLIPT}), as shown in Appendix Figure~\ref{fig:slipt_expr_mtSL}. This clustering analysis similarly identified several major clusters of putative \gls{synthetic lethal} partner genes. In this case, many partner genes had consistently high \glslink{gene expression}{expression} across most of the (predominantly lobular subtype) \textit{CDH1} breast cancer samples. However, a major exception to this in the \textit{CDH1} \glslink{gene expression}{expression} analysis were the normal tissue samples which were excluded from the \gls{mutation} data (as they were not tested for tumour-specific genotypes). This supports \gls{synthetic lethal} interventions being more applicable to \textit{CDH1} \gls{mutant} tumours. There was still considerable correlation structure, particularly among \textit{CDH1} \gls{wild-type} samples, sufficient to distinguish gene clusters. \textcolor{black}{In contrast, the predominantly ductal \textit{CDH1} \gls{wild-type}} basal-like subtype and \gls{ER} negative samples had depleted \glslink{gene expression}{expression} among most candidate \gls{synthetic lethal} partners. This is consistent with \gls{synthetic lethal} interventions only being effective in lobular \gls{ER} positive breast cancers in which they are a more common, as recurrent (\glslink{driver mutation}{driver}) \gls{mutation}. However, the remaining samples are still informative for \gls{synthetic lethal} analysis (by \gls{SLIPT}) as it requires highly expressing \textit{CDH1} samples for comparison.
%\FloatBarrier
The \textit{CDH1} \gls{mutant} samples (as shown in Figure~\ref{fig:slipt_expr}) were predominantly among the low \textit{CDH1} expressing samples, clustering throughout them with similar expression profiles to other samples exhibiting low \textit{CDH1} expression. Thus the molecular profiles of \textit{CDH1} low samples were indistinguishable from \textit{CDH1} \gls{mutant} samples, with the exception of normal samples (that do not have \glslink{somatic}{somatic} \gls{mutation} data available). \textcolor{black}{Many} of the \textit{CDH1} \gls{mutant} samples (in Appendix Figure~\ref{fig:slipt_expr_mtSL}) had among the lowest \textit{CDH1} \glslink{gene expression}{expression}, and some of the \gls{synthetic lethal} partners were also highly expressed in low expressing \textit{CDH1} \gls{wild-type} samples, despite these not being considered as ``inactivated'' by \acrshort{mtSLIPT} analysis.
Together these results support the use of low \textit{CDH1} \glslink{gene expression}{expression} as a strategy for detecting \textit{CDH1} inactivation. This has the benefit of increasing sample size (including samples such as normal tissue which do not have \glslink{somatic}{somatic} \gls{mutation} data available) and increasing the expected number of mutually inactive (low-low) samples for the directional criteria of (mt)\gls{SLIPT} which enables it to better distinguish significant deviations below this (as discussed in Section~\ref{chapt5:compare_methods}). This also circumvents the assumption that all (detected) \glspl{mutation} are inactivating (although synonymous \glspl{mutation} were excluded from the analysis), which may not be the case for several highly expressing \textit{CDH1} \gls{mutant} samples that do not cluster together in Figure~\ref{fig:slipt_expr} or Appendix Figure~\ref{fig:slipt_expr_mtSL}. One of these exhibits among the lowest \glslink{gene expression}{expression} for many predicted \gls{synthetic lethal} partners and would not be vulnerable to inactivation of these genes. As such, correctly genotyping inactivating \glspl{mutation} will be \gls{essential} in clinical practice for \gls{synthetic lethal} targeting of \gls{tumour suppressor} genes, particularly for other genes such as \textit{TP53} where oncogenic and \gls{tumour suppressor} \glspl{mutation} (with different molecular consequences) are both \textcolor{black}{common}. Using \glslink{gene expression}{expression} as a measure of gene function also avoids the \textcolor{black}{assumption} that \glspl{mutation} are \glslink{somatic}{somatic}, rather than \glslink{germline}{germline}, and that gene inactivation occurs by detectable \glspl{mutation}, rather than other mechanisms\textcolor{black}{,} such as epigenetic changes. These \textcolor{black}{factors} may also account for some of the lowly expressing \textit{CDH1} \gls{wild-type} samples clustering with similar profiles to \gls{mutant} samples.
%%appendix
%\label{fig:slipt_expr_mtSL}
%Figure 3. Heatmap of \gls{RNA-Seq} \gls{gene expression} in predicted SL partners of \textit{CDH1} showing distinct subgroups of SL partners and \glslink{edge}{links} between SL partner \glslink{gene expression}{expression} and clinical variables.
\FloatBarrier
\subsubsection{Subgroup Pathway Analysis}
%Table 5. Gene set enrichment results for subgroups of \textit{CDH1} SL partners shows functional variation.
\begin{table*}[!hp]
\caption{Pathways for clusters of \textit{CDH1} partners from SLIPT}
\label{tab:pathway_clusters}
\centering
%\begin{tiny}
%\makebox[\textwidth][c]{
\resizebox{1 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{lccc}
%\caption{Pathway composition for clusters of \textit{CDH1} partners from SLIPT}
%\label{tab:pathway_clusters}
\large{\textbf{Pathways Over-represented in Cluster 1}} & \large{\textbf{Pathway Size}} & \large{\textbf{Cluster Genes}} & \large{\textbf{p-value (\gls{FDR})}} \\ %(833 genes)
\hline
\rowcolor{Cluster_Blue!20}
Collagen formation & 67 & 10 & $4.0 \times 10^{-11}$ \\
\rowcolor{Cluster_Blue!15}
Extracellular matrix organisation & 238 & 21 & $1.8 \times 10^{-9}$ \\
\rowcolor{Cluster_Blue!20}
Collagen biosynthesis and modifying enzymes & 56 & 8 & $1.8 \times 10^{-9}$ \\
\rowcolor{Cluster_Blue!15}
Uptake and actions of bacterial toxins & 22 & 5 & $9.5 \times 10^{-9}$ \\
\rowcolor{Cluster_Blue!20}
Elastic fibre formation & 37 & 6 & $1.9 \times 10^{-8}$ \\
\rowcolor{Cluster_Blue!15}
Muscle contraction & 62 & 7 & $2.4 \times 10^{-7}$ \\
\rowcolor{Cluster_Blue!20}
Fatty acid, triacylglycerol, and ketone body metabolism & 117 & 10 & $4.9 \times 10^{-7}$ \\
\rowcolor{Cluster_Blue!15}
XBP1(S) activates chaperone genes & 51 & 6 & $6.6 \times 10^{-7}$ \\
\rowcolor{Cluster_Blue!20}
IRE1alpha activates chaperones & 54 & 6 & $1.2 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!15}
Neurotoxicity of clostridium toxins & 10 & 3 & $1.3 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!20}
Retrograde neurotrophin signalling & 10 & 3 & $1.3 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!15}
Assembly of collagen fibrils and other multimeric structures & 40 & 5 & $1.9 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!20}
Collagen degradation & 58 & 6 & $2.0 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!15}
Arachidonic acid metabolism & 41 & 5 & $2.1 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!20}
Synthesis of PA & 26 & 4 & $3.0 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!15}
Signalling by NOTCH & 80 & 7 & $3.3 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!20}
Signalling to RAS & 27 & 4 & $3.7 \times 10^{-6}$ \\
\rowcolor{Cluster_Blue!15}
Integrin cell surface interactions & 82 & 7 & $4.2 \times 10^{-6}$ \\
% \rowcolor{Cluster_Blue!20}
% Smooth Muscle Contraction & 28 & 4 & $4.4 \times 10^{-6}$ \\
% \rowcolor{Cluster_Blue!15}
% ECM proteoglycans & 66 & 6 & $6.3 \times 10^{-6}$ \\
\hline
\\
\cellcolor{white} \large{\textbf{Pathways Over-represented in Cluster 2}} & \large{\textbf{Pathway Size}} & \large{\textbf{Cluster Genes}} & \large{\textbf{p-value (\gls{FDR})}} \\ %(833 genes)
\hline
\rowcolor{Cluster_Green!20}
Eukaryotic Translation Elongation & 86 & 75 & $1.1 \times 10^{-181}$ \\
\rowcolor{Cluster_Green!15}
Viral \acrshort{mRNA} Translation & 81 & 72 & $9.8 \times 10^{-179}$ \\
\rowcolor{Cluster_Green!20}
Peptide chain elongation & 83 & 72 & $1.9 \times 10^{-175}$ \\
\rowcolor{Cluster_Green!15}
Eukaryotic Translation Termination & 83 & 72 & $1.9 \times 10^{-175}$ \\
\rowcolor{Cluster_Green!20}
Formation of a pool of free 40S subunits & 93 & 75 & $1.9 \times 10^{-171}$ \\
\rowcolor{Cluster_Green!15}
Nonsense Mediated Decay independent of the Exon Junction Complex & 88 & 72 & $9.9 \times 10^{-168}$ \\
\rowcolor{Cluster_Green!20}
L13a-mediated translational silencing of Ceruloplasmin \glslink{gene expression}{expression} & 103 & 75 & $3.0 \times 10^{-159}$ \\
\rowcolor{Cluster_Green!15}
3' -UTR-mediated translational regulation & 103 & 75 & $3.0 \times 10^{-159}$ \\
\rowcolor{Cluster_Green!20}
Nonsense-Mediated Decay & 103 & 75 & $3.0 \times 10^{-159}$ \\
\rowcolor{Cluster_Green!15}
Nonsense Mediated Decay enhanced by the Exon Junction Complex & 103 & 75 & $3.0 \times 10^{-159}$ \\
\rowcolor{Cluster_Green!20}
SRP-dependent cotranslational protein targeting to membrane & 104 & 75 & $3.2 \times 10^{-158}$ \\
\rowcolor{Cluster_Green!15}
GTP hydrolysis and joining of the 60S ribosomal subunit & 104 & 75 & $3.2 \times 10^{-158}$ \\
\rowcolor{Cluster_Green!20}
Eukaryotic Translation Initiation & 111 & 75 & $4.5 \times 10^{-151}$ \\
\rowcolor{Cluster_Green!15}
Cap-dependent Translation Initiation & 111 & 75 & $4.5 \times 10^{-151}$ \\
\rowcolor{Cluster_Green!20}
Influenza Infection & 117 & 75 & $1.4 \times 10^{-145}$ \\
\rowcolor{Cluster_Green!15}
Influenza Viral \acrshort{RNA} Transcription and Replication & 108 & 72 & $5.7 \times 10^{-145}$ \\
\rowcolor{Cluster_Green!20}
Translation & 141 & 81 & $8.0 \times 10^{-143}$ \\
\rowcolor{Cluster_Green!15}
Influenza Life Cycle & 112 & 72 & $2.3 \times 10^{-141}$ \\
% \rowcolor{Cluster_Green!20}
% Infectious disease & 347 & 103 & $2.2 \times 10^{-95}$ \\
% \rowcolor{Cluster_Green!15}
% Formation of the ternary complex, and subsequently, the 43S complex & 47 & 33 & $6.8 \times 10^{-80}$ \\
\hline
\\
\cellcolor{white} \large{\textbf{Pathways Over-represented in Cluster 3}} & \large{\textbf{Pathway Size}} & \large{\textbf{Cluster Genes}} & \large{\textbf{p-value (\gls{FDR})}} \\ %(833 genes)
\hline
\rowcolor{Cluster_Orange!30}
Adaptive Immune System & 412 & 90 & $6.1 \times 10^{-61}$ \\
\rowcolor{Cluster_Orange!20}
Chemokine receptors bind chemokines & 52 & 27 & $6.7 \times 10^{-56}$ \\
\rowcolor{Cluster_Orange!30}
Generation of second messenger molecules & 29 & 21 & $6.5 \times 10^{-55}$ \\
\rowcolor{Cluster_Orange!20}
Immunoregulatory interactions between a Lymphoid and a non-Lymphoid cell & 64 & 29 & $6.5 \times 10^{-55}$ \\
\rowcolor{Cluster_Orange!30}
TCR signalling & 62 & 27 & $8.9 \times 10^{-51}$ \\
\rowcolor{Cluster_Orange!20}
Peptide ligand-binding receptors & 161 & 40 & $1.5 \times 10^{-45}$ \\
\rowcolor{Cluster_Orange!30}
Translocation of ZAP-70 to Immunological synapse & 16 & 14 & $3.1 \times 10^{-43}$ \\
\rowcolor{Cluster_Orange!20}
Costimulation by the CD28 family & 51 & 22 & $4.0 \times 10^{-43}$ \\
\rowcolor{Cluster_Orange!30}
PD-1 signalling & 21 & 15 & $4.0 \times 10^{-41}$ \\
\rowcolor{Cluster_Orange!20}
Class A/1 (Rhodopsin-like receptors) & 258 & 50 & $6.7 \times 10^{-41}$ \\
\rowcolor{Cluster_Orange!30}
Phosphorylation of CD3 and TCR zeta chains & 18 & 14 & $1.3 \times 10^{-40}$ \\
\rowcolor{Cluster_Orange!20}
Interferon gamma signalling & 74 & 24 & $5.0 \times 10^{-39}$ \\
\rowcolor{Cluster_Orange!30}
GPCR ligand binding & 326 & 57 & $1.8 \times 10^{-38}$ \\
\rowcolor{Cluster_Orange!20}
Cytokine Signalling in Immune system & 268 & 48 & $8.9 \times 10^{-37}$ \\
\rowcolor{Cluster_Orange!30}
Downstream TCR signalling & 45 & 18 & $1.8 \times 10^{-35}$ \\
\rowcolor{Cluster_Orange!20}
G$_{\alpha i}$ signalling events & 167 & 33 & $2.2 \times 10^{-33}$ \\
\rowcolor{Cluster_Orange!30}
Cell surface interactions at the vascular wall & 99 & 21 & $1.3 \times 10^{-26}$ \\
\rowcolor{Cluster_Orange!20}
Interferon Signalling & 164 & 28 & $1.7 \times 10^{-26}$ \\
% \rowcolor{Cluster_Orange!30}
% Extracellular matrix organisation & 238 & 35 & $2.7 \times 10^{-25}$ \\
% \rowcolor{Cluster_Orange!20}
% Antigen activates B Cell Receptor leading to generation of second messengers & 32 & 12 & $7.2 \times 10^{-25}$ \\
\hline
\end{tabular}
\begin{tablenotes}
\raggedright %\small
%Pathway over-representation analysis for Reactome \glspl{pathway} with the number of genes in each \glspl{pathway} (Pathway Size), number of genes within the \glspl{pathway} identified (Cluster Genes), and the \gls{pathway} over-representation p-value (adjusted by \gls{FDR}) from the hypergeometric test.
\end{tablenotes}
\end{threeparttable}
}
\end{table*}
\begin{table*}[!ht]
%\caption{Pathways for clusters of \textit{CDH1} partners from SLIPT}
Table~\ref{tab:pathway_clusters}: Pathways for clusters of \textit{CDH1} partners from SLIPT
%\label{tab:pathway_clusters}
\centering
%\begin{tiny}
%\makebox[\textwidth][c]{
\resizebox{1 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{lccc}
\\
\hline
\cellcolor{white} \large{\textbf{Pathways Over-represented in Cluster 4}} & \large{\textbf{Pathway Size}} & \large{\textbf{Cluster Genes}} & \large{\textbf{p-value (\gls{FDR})}} \\ %(833 genes)
\hline
\rowcolor{Cluster_Red!20}
Extracellular matrix organisation & 238 & 48 & $8.0 \times 10^{-41}$ \\
\rowcolor{Cluster_Red!15}
Class A/1 (Rhodopsin-like receptors) & 258 & 47 & $2.8 \times 10^{-36}$ \\
\rowcolor{Cluster_Red!20}
GPCR ligand binding & 326 & 54 & $2.1 \times 10^{-34}$ \\
\rowcolor{Cluster_Red!15}
G$_{\alpha s}$ signalling events & 83 & 22 & $1.4 \times 10^{-31}$ \\
\rowcolor{Cluster_Red!20}
GPCR downstream signalling & 472 & 68 & $1.1 \times 10^{-29}$ \\
\rowcolor{Cluster_Red!15}
Haemostasis & 423 & 61 & $3.3 \times 10^{-29}$ \\
\rowcolor{Cluster_Red!20}
Platelet activation, signalling and aggregation & 180 & 31 & $7.1 \times 10^{-28}$ \\
\rowcolor{Cluster_Red!15}
Binding and Uptake of Ligands by Scavenger Receptors & 40 & 14 & $9.9 \times 10^{-27}$ \\
\rowcolor{Cluster_Red!20}
RA biosynthesis \glspl{pathway} & 22 & 11 & $2.5 \times 10^{-26}$ \\
\rowcolor{Cluster_Red!15}
Response to elevated platelet cytosolic Ca$^{2+}$ & 82 & 19 & $3.0 \times 10^{-26}$ \\
\rowcolor{Cluster_Red!20}
Developmental Biology & 420 & 57 & $3.5 \times 10^{-26}$ \\
\rowcolor{Cluster_Red!15}
G$_{\alpha i}$ signalling events & 167 & 28 & $7.3 \times 10^{-26}$ \\
\rowcolor{Cluster_Red!20}
Platelet degranulation & 77 & 18 & $1.6 \times 10^{-25}$ \\
\rowcolor{Cluster_Red!15}
Gastrin-CREB signalling \glspl{pathway} via PKC and MAPK & 171 & 28 & $2.5 \times 10^{-25}$ \\
\rowcolor{Cluster_Red!20}
Muscle contraction & 62 & 16 & $4.7 \times 10^{-25}$ \\
\rowcolor{Cluster_Red!15}
G$_{\alpha q}$ signalling events & 150 & 25 & $3.2 \times 10^{-24}$ \\
\rowcolor{Cluster_Red!20}
Retinoid metabolism and transport & 34 & 12 & $5.0 \times 10^{-24}$ \\
\rowcolor{Cluster_Red!15}
Phase 1 - Functionalisation of compounds & 67 & 16 & $6.5 \times 10^{-24}$ \\
% \rowcolor{Cluster_Red!20}
% Signalling by Retinoic Acid & 42 & 13 & $6.7 \times 10^{-24}$ \\
% \rowcolor{Cluster_Red!15}
% Degradation of the extracellular matrix & 102 & 19 & $1.4 \times 10^{-22}$ \\
\hline
\end{tabular}
\begin{tablenotes}
\raggedright %\small
Pathway over-representation analysis for Reactome \glspl{pathway} with the number of genes in each \glspl{pathway} (Pathway Size), number of genes within the \glspl{pathway} identified (Cluster Genes), and the \gls{pathway} over-representation p-value (adjusted by \gls{FDR}) from the hypergeometric test.
\end{tablenotes}
\end{threeparttable}
}
\end{table*}
%%committee
Synthetic lethal gene candidates for \textit{CDH1} from \gls{SLIPT} analysis of \gls{RNA-Seq} \gls{gene expression} data were also used for \gls{pathway} over-representation analyses (as described in Section~\ref{methods:enrichment}). The correlation structure in the \glslink{gene expression}{expression} of candidates \gls{synthetic lethal} genes in \textit{CDH1} low tumours (lowest $\sfrac{1}{3}$\textsuperscript{rd} quantile of \glslink{gene expression}{expression}) was examined for distinct biological \glspl{pathway} in subgroups of genes elevated in different clusters of samples. These genes were highly expressed in different samples with their clinical factors including \gls{ER} status and \glspl{intrinsic subtype}, from the \gls{PAM50} procedure \citep{Parker2009} shown in Figure~\ref{fig:slipt_expr}.
%%paper
As shown by the most over-represented \glspl{pathway} in Table~\ref{tab:pathway_clusters}, each correlated cluster of candidate \gls{synthetic lethal} partners of \textit{CDH1} contains functionally different genes. %Each correlated subgroup of \gls{synthetic lethal} candidate genes has markedly different biological functions.
Cluster 1 contains genes with less evidence of over-represented \glspl{pathway} than other clusters, corresponding to less correlation between genes within the cluster, and to it being a relatively small group. While there is some indication that collagen biosynthesis, microfibril elastic fibres, extracellular matrix, and metabolic \glspl{pathway} may be over-represent\-ed in Cluster 1, these results are mainly based on small \glspl{pathway} containing few \gls{synthetic lethal} genes. Genes in Cluster 2 exhibited low \glslink{gene expression}{expression} in normal tissue samples compared to tumour samples (as shown in Figure~\ref{fig:slipt_expr}) and show compelling evidence of over-represent\-ation of post-transcriptional gene regulation and protein translation processes. Similarly, Cluster 3 has over-represent\-ation of immune signalling \glspl{pathway} (including chemokines, secondary messenger, and TCR signalling) and downstream intracellular signalling cascades such as \gls{GPCR} and G$_{\alpha i}$ signalling events. While \glspl{pathway} over-represent\-ation was weaker among genes in Cluster 4, they contained intracellular signalling \glspl{pathway} and were highly expressed in normal samples (in contrast to Cluster 2). Cluster 4 also involved extracellular factors and stimuli such as extracellular matrix, platelet activation, ligand receptors, and retinoic acid signalling.
Based on these results, potential \gls{synthetic lethal} partners of \textit{CDH1} include processes known to be dysregulated in cancer, such as translational, cytoskeletal, and immune processes. Intracellular signalling cascades such as the \glspl{GPCR} and extracellular stimuli for these \glspl{pathway} were also implicated in potential \glspl{synthetic lethal} with \textit{CDH1}.
Similar translational, cytoskeletal, and immune processes were identified among \gls{SLIPT} partners with respect to \textit{CDH1} \gls{mutation}, shown in Appendix Table~\ref{tab:pathway_clusters_mtSL}. While \gls{GPCR} signalling was replicated in \acrshort{mtSLIPT} analysis, there was also stronger over-representation for NOTCH, ERBB2, and PI3K/AKT signalling in \gls{mutation} analysis consistent with these signals being important for proliferation of \textit{CDH1}-deficient tumours. The GCPR and PI3K/AKT \glspl{pathway} are of particular interest as \glspl{pathway} with oncogenic \glspl{mutation} that can be targeted and downstream effects on translation (a strongly supported process across analyses). Extracellular matrix \glspl{pathway} (e.g., elastic fibre formation) were also supported across analyses (in Table~\ref{tab:pathway_clusters} and Appendix Table~\ref{tab:pathway_clusters_mtSL}) consistent with the established cell-cell signalling role of \textit{CDH1} and the importance of the tumour microenvironment for cancer proliferation.
%%appendix
%\label{tab:pathway_clusters_mtSL}
\FloatBarrier
\section{Comparing Synthetic Lethal Gene Candidates} \label{chapt3:compare_SL_genes}
%\subsection{Comparison with differential \glslink{gene expression}{expression}} \label{chapt3:compare_differential_expression}
%A \gls{transcriptome} experiment has been conducted by the Cancer Genetics Laboratory to test their \textit{CDH1}$^{-/-}$ null MCF10A cell lines compared to an otherwise isogenic \gls{wild-type} \citep{Chen2014}. While differential \glslink{gene expression}{expression} analysis was inconclusive due to few technical replicates, this data was also useful to determine genes which were not detectable in MCF10A cell lines which would not be expected to detect \glspl{synthetic lethal} in \gls{siRNA} screen data even if they were predicted to be \gls{synthetic lethal} in \glslink{gene expression}{expression} data.
\subsection{Primary siRNA Screen Candidates} \label{chapt3:primary_screen}
Gene candidates were compared between computational (\gls{SLIPT} in \gls{TCGA} breast cancer data) and experimental (the primary \gls{siRNA} screen performed by \citet{Telford2015}) approaches in Figure~\ref{fig:Venn_allgenes}. The number of genes detected by both methods did not produce a significant overlap but these may be difficult to compare due to vast differences between the detection methods. There were similar issues in the comparison of \acrshort{mtSLIPT} genes tested against \textit{CDH1} \glspl{mutation} (in Appendix Figure~\ref{fig:Venn_allgenes_mtSL}), despite excluding genes not tested by both methods in either test. However, these intersecting genes may still be functionally informative or amenable to drug triage as they were replicated across both methods and \glspl{pathway} over-represent\-ation differed between the sections of the Venn diagram (as shown in Figure~\ref{fig:Venn_allgenes}).
\begin{figure}[!p]
%\begin{mdframed}
\centering
\resizebox{0.6 \columnwidth}{!}{
\includegraphics{Venn_exprSL_siRNA_allgenes_reduced_Pub.png}
}
\caption[Comparison of SLIPT with siRNA]{\small \textbf{Comparison of \gls{SLIPT} with \gls{siRNA}.} Testing the overlap of gene candidates for \gls{E-cadherin} \gls{synthetic lethal} partners between computational (\gls{SLIPT}) and experimental screening (\gls{siRNA}) approaches. The $\chi^2$ test suggests that the overlap is no more than would be expected by chance ($p = 0.281$). Only genes tested by both methods were included. %A Venn diagram of all 16298 genes tested by both approaches.
}
\label{fig:Venn_allgenes}
%\end{mdframed}
\end{figure}
%%appendix
%\label{fig:Venn_allgenes_mtSL}
%\FloatBarrier
\begin{figure*}[!p]
%\begin{mdframed}
\begin{center}
\resizebox{1.0 \textwidth}{!}{
\includegraphics{exprSLIPT_siRNA_vs_Correlation_with_CDH1_nlp.pdf}
}
\end{center}
\caption[Comparison of SLIPT and siRNA genes with correlation]{\small \textbf{Comparison of \gls{SLIPT} and \gls{siRNA} genes with correlation.} The $\chi^2$ p-values for genes tested by \gls{SLIPT} (in \gls{TCGA} breast cancer) \glslink{gene expression}{expression} analysis were compared against Pearson correlation of \gls{gene expression} with \textit{CDH1}. Genes detected by \gls{SLIPT} or \gls{siRNA} are coloured according to the legend.
}
\label{fig:compare_points_correlation_SL}
%\end{mdframed}
\end{figure*}
\subsection{Comparison with Correlation} \label{chapt3:compare_correlation}
Another potential means to triage drug target candidates is by correlation of \glslink{gene expression}{expression} profiles with \textit{CDH1}. Correlation with \textit{CDH1} was compared to \gls{SLIPT} and \gls{siRNA} results in Figure~\ref{fig:compare_points_correlation_SL}. As expected, the genes \textcolor{black}{not detected by \gls{SLIPT} (including \gls{siRNA} candidates)} were distributed around a correlation of zero. Genes with higher correlation with \textit{CDH1} (either direction) \textcolor{black}{were more significant.} The majority of \gls{SLIPT} candidates had negative correlations, particularly genes detected by both approaches, although these were typically weak correlations and are unlikely to be sufficient to detect such genes on their own. This is supported by simulation results in Section~\ref{chapt5:compare_methods}.
%\label{fig:compare_points_correlation_SL}
There were not \textcolor{black}{many} strong positive correlations with \textit{CDH1} among \gls{siRNA} candidates, consistent with previous findings that co-expression was not predictive of \glspl{synthetic lethal} \citep{Jerby2014, Lu2015}. Negative correlation may not be indicative of \glspl{synthetic lethal} either as many \gls{siRNA} candidates also had positive \textcolor{black}{correlation with \textit{CDH1}}. The \gls{SLIPT} methodology has \textcolor{black}{therefore been} shown to detect genes with both positive and negative correlations, although it does appear to preferentially detect negatively correlated genes to some extent. These findings were replicated with the \acrshort{mtSLIPT} approach against \textit{CDH1} \gls{mutation} (in Appendix Figure~\ref{fig:compare_points_correlation_mtSL}), although the range of the $\chi^2$ p-values differs due to lower sample size for \gls{mutation} analysis.
\FloatBarrier
The apparent tendency for genes detected by \gls{SLIPT} or \gls{siRNA} to have negative correlations with \textit{CDH1} \glslink{gene expression}{expression} was not due to the smaller number of genes in these groups. The distribution of \textit{CDH1} correlations differed across these gene groups (as shown by Figure~\ref{fig:compare_correlation_SL} and Appendix Figure~\ref{fig:compare_correlation_mtSL}) and tended to be lower in \gls{SLIPT} candidates (as supported by \gls{ANOVA} in Table~\ref{tab:compare_correlation_SL}). However, these are relatively weak correlations and further triage of gene candidates by correlation is not suitable. The genes detected both \gls{SLIPT} and \gls{siRNA} did not differ from \gls{SLIPT} genes and the number of positively correlated \gls{SLIPT} genes was very small.The use of correlation itself is also less effective than \gls{SLIPT} to predict \gls{synthetic lethal} partners in the first place (as shown in Section~\ref{chapt5:compare_correlation}).
\begin{figure*}[!htp]
%\begin{mdframed}
\begin{center}
\resizebox{0.75 \textwidth}{!}{
\includegraphics{vioplotx_exprSLIPT_siRNA_vs_CDH1_Correlation_with_CDH1.pdf}
}
\end{center}
\caption[Comparison of SLIPT and siRNA genes with correlation]{\small \textbf{Comparison of \gls{SLIPT} and \gls{siRNA} genes with correlation.} Genes detected as candidate \gls{synthetic lethal} partners by \gls{SLIPT} (in \gls{TCGA} breast cancer) \glslink{gene expression}{expression} analysis and experimental screening (with \gls{siRNA}) were compared against Pearson correlation of \gls{gene expression} with \textit{CDH1}. There were significant differences in correlation between gene groups (as shown in Table~\ref{tab:compare_correlation_SL}).
}
\label{fig:compare_correlation_SL}
%\end{mdframed}
\end{figure*}
\begin{table*}[!htb]
\caption{\acrshort{ANOVA} for synthetic lethality and correlation with \textit{CDH1}}
\label{tab:compare_correlation_SL}
\noindent\makebox[\textwidth][c]{% %centering
\resizebox{0.75 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{lccccc}
\hline
& DF & Sum Squares & Mean Squares & F-value & p-value \\
\hline
\rowcolor{black!10}
siRNA & 1 & 0.027 & 0.027 & 2.8209 & 0.09306 \\
\rowcolor{black!5}
SLIPT & 1 & 134.603 & 134.603 & 14115.9824 & $<$0.0001 \\
\rowcolor{black!10}
siRNA$\times$SLIPT & 1 & $7.14 \times 10^{-5}$ & $9.54 \times 10^{-3}$ & 0.0073 & 0.93212 \\
\hline
\end{tabular}
\begin{tablenotes}
\raggedright \small
Analysis of variance for correlation with \textit{CDH1} against \gls{synthetic lethal} detection approaches (with an interaction term). Only genes tested by both methods were included in this analysis.
\end{tablenotes}
\end{threeparttable}
}
}
\end{table*}
\FloatBarrier
\subsection{Comparison with Primary Screen Viability} \label{chapt3:compare_viability}
A similar comparison of \gls{SLIPT} results was made with the viability ratio (\textit{CDH1}$^{-/-}$ \gls{mutant} to \gls{wild-type}) of MCF10A cells in the primary \gls{siRNA} screen performed by \citet{Telford2015}. The significance and viability thresholds used for \gls{SLIPT} and \gls{siRNA} detection of \gls{synthetic lethal} candidate partners of \textit{CDH1} are shown in Figure~\ref{fig:compare_points_viability_SL}. Not all of the genes below the viability thresholds were necessarily selected to be candidate partners, however, as additional criteria were used in each case: directional criteria as for \gls{SLIPT} (in Section~\ref{methods:SLIPT}) and minimum \gls{wild-type} viability for \gls{siRNA} \citep{Telford2015}.
\begin{figure*}[!htbp]
%\begin{mdframed}
\begin{center}
\resizebox{1.0 \textwidth}{!}{
\includegraphics{exprSLIPT_siRNA_vs_Viability_Ratio_with_CDH1_nlp.pdf}
}
\end{center}
\caption[Comparison of SLIPT and siRNA genes with screen viability]{\small \textbf{Comparison of \gls{SLIPT} and \gls{siRNA} genes with screen viability.} The $\chi^2$ p-values (log-scale) for genes tested by \gls{SLIPT} (in \gls{TCGA} breast cancer) were compared against the viability ratio of \textit{CDH1} \gls{mutant} and \gls{wild-type} cells in the primary \gls{siRNA} screen. Genes detected by \gls{SLIPT} or \gls{siRNA} are coloured according to the legend. Lines show the thresholds of significance with \gls{SLIPT} \textcolor{black}{($p<0.05$)} and of viability \textcolor{black}{(ratio $<0.75$ or $<0.85$)} used by \citet{Telford2015}. % with a grey line for $p=0.05$.
}
\label{fig:compare_points_viability_SL}
%\end{mdframed}
%\end{figure*}
%\begin{figure*}[!htbp]
%\begin{mdframed}
\begin{center}
\resizebox{0.6 \textwidth}{!}{
\includegraphics{vioplotx_exprSLIPT_siRNA_vs_Viability_Ratio_with_CDH1.pdf}
}
\end{center}
\caption[Comparison of SLIPT genes with siRNA screen viability]{\small \textbf{Comparison of \gls{SLIPT} genes with \gls{siRNA} screen viability.} Genes detected as candidate \gls{synthetic lethal} partners by \gls{SLIPT} (in \gls{TCGA} breast cancer) \glslink{gene expression}{expression} analysis were compared against the viability ratio of \textit{CDH1} \gls{mutant} and \gls{wild-type} cells in the primary \gls{siRNA} screen. %There were clear no differences in viability between genes detected by \gls{SLIPT} and those not detected.
\textcolor{black}{The genes identified by \gls{SLIPT} had a higher viability ratio (by t-test: $t=2.1553$, $p=0.03117$), although the effect size was relatively small (mean SLIPT$-$ 1.029, mean SLIPT$+$ 1.037).} % with the differences being primarily due to viability thresholds being used to detect \glspl{synthetic lethal} by \citet{Telford2015}.
}
\label{fig:compare_viability_SL}
%\end{mdframed}
\end{figure*}
There does not appear to be a clear relationship between \gls{SLIPT} and \gls{siRNA} candidates. The genes detected by one approach but not the other were numerous in Figure~\ref{fig:Venn_allgenes} and Appendix Figure~\ref{fig:Venn_allgenes_mtSL}. \textcolor{black}{The} genes detected by one approach are not necessarily near the thresholds for the other. In this respect, the \gls{SLIPT} approach with patient data and the \gls{siRNA} cell line experiments are independent means to identify \gls{synthetic lethal} candidates. While genes detected by both approaches were not necessarily more strongly supported by either, the genes with a viability closer to 1 (no \gls{synthetic lethal} effect) in \gls{siRNA} included those with more significant \gls{SLIPT} p-values, whereas more extreme viability ratios tended to be less significant (as shown by Figure~\ref{fig:compare_points_viability_SL}). However, it should be noted that genes with more moderate viability ratios were more common and \gls{SLIPT} was capable (despite adjusting for multiple testing) of detecting significant genes with extreme viability ratios, particularly those considerably lower than 1.
%
Lower viability ratios were used by \citet{Telford2015} to detect \gls{synthetic lethal} candidates in the primary screen. However, there was little support for \gls{SLIPT} candidates differing with respect to viability ratio (as shown in Figures~\ref{fig:compare_viability_SL} and~\ref{fig:compare_viability_mtSL}) %However, the genes identified by \gls{SLIPT} had a higher mean viability ratio (by t-test: $t=2.1553$, $p=0.03117$). However, the effect size was small (mean SLIPT$-$ 1.029, mean SLIPT$+$ 1.037)
and the vast majority of \gls{SLIPT} candidate genes did not have different viability in the primary screen to genes not identified by \gls{SLIPT}.
\FloatBarrier
\subsection{Comparison with Secondary siRNA Screen Validation}
\label{chapt3:secondary_screen}
It should be noted that genes with a lower viability ratio were not necessarily the most strongly supported by experimental screening. The primary screen (with 4 pooled \glspl{siRNA} for each gene) has been used for the majority of comparisons in this thesis because the \gls{genome}-wide panel of target genes screened enables a large number of genes to be compared with \gls{SLIPT} results from \gls{gene expression} and \glslink{somatic}{somatic} \gls{mutation} analysis. A secondary screen was also performed by \citet{Telford2015} on the isogenic MCF10A breast cell lines to validate the individual (i.e., non-pooled) \glspl{siRNA} separately, with the strongest candidates being those exhibiting \gls{synthetic lethal} viability ratios replicated across independently targeting \glspl{siRNA}. The strongest candidates from the primary screen were subject to a further secondary screen for validation by independent replication with 4 gene knockdowns with different targeting \glspl{siRNA}. This was performed for the top 500 candidates (with the lowest viability ratio) from the primary screen: 482 of these genes were also tested by \gls{SLIPT} in breast cancer.% (and the 486 genes tested by \gls{SLIPT} in stomach cancer).
The secondary screen results show that \gls{SLIPT} candidate genes were \textcolor{black}{significantly} ($p=7.49 \times 10^{-3}$ by Fisher's exact test) more likely to be validated with detection by more independently targeting \glspl{siRNA} in the secondary screen. Gene detected by \gls{SLIPT} are thus informative of more robust partner genes, in addition to providing support that these interactions are consistent with \glslink{gene expression}{expression} profiles from heterogeneous patient samples across genetic backgrounds. As shown in Table~\ref{tab:secondary_screen}, there is significant %($p=7.49 \times 10^{-3}$ by Fisher's exact test) %$8.67 \times 10^{-3} by \chi^2 (4 df)
association between \gls{SLIPT} candidates and stronger validations of \gls{siRNA} candidates. Since there were more SLIPT$-$ genes among those not validated and more SLIPT$+$ genes among those validated with several \glspl{siRNA}, this supports the use of SLIPT as a \gls{synthetic lethal} discovery procedure which may augment such screening experiments.
\begin{table*}[!ht]
\caption{Comparison of \gls{SLIPT} genes against secondary \gls{siRNA} screen} % in breast cancer}
\label{tab:secondary_screen}
\begin{center}
\resizebox{0.6 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{>{\cellcolor{white}}rrcccccl}
& & \multicolumn{5}{c}{\bfseries Secondary Screen} & \\ \cline{3-7}
\rowcolor{black!10}
& \multicolumn{1}{r|}{\cellcolor{white} \textbf{siRNAs}\tnote{*}} & 0/4 & 1/4 & 2/4 & 3/4 & \multicolumn{1}{c|}{4/4} & \cellcolor{white} \textbf{Total} \\ \cline{2-8}
\rowcolor{black!5}
\multicolumn{1}{r|}{\cellcolor{white}} & \multicolumn{1}{r|}{Observed} & 70 & 46 & 31 & 8 & \multicolumn{1}{c|}{2} & \multicolumn{1}{l|}{} \\
\rowcolor{black!10}
\multicolumn{1}{r|}{\cellcolor{white} \multirow{-2}{*}{\bfseries SLIPT$+$}} & \multicolumn{1}{r|}{Expected} & 85 & 44 & 10 & 4 & \multicolumn{1}{c|}{2} & \multicolumn{1}{l|}{\multirow{-2}{*}{157}} \\ \cline{2-8}
\rowcolor{black!5}
\multicolumn{1}{r|}{\cellcolor{white}} & \multicolumn{1}{r|}{Observed} & 190 & 90 & 31 & 10 & \multicolumn{1}{c|}{4} & \multicolumn{1}{l|}{} \\
\rowcolor{black!10}
\multicolumn{1}{r|}{\cellcolor{white}\multirow{-2}{*}{\bfseries SLIPT$-$}} & \multicolumn{1}{r|}{Expected} & 175 & 91 & 42 & 12 & \multicolumn{1}{c|}{4} & \multicolumn{1}{l|}{\multirow{-2}{*}{325}} \\ \cline{2-8}
\rowcolor{black!5}
\cellcolor{white} & \multicolumn{1}{r|}{\cellcolor{white} \bfseries Total} & \multicolumn{1}{c}{280} & \multicolumn{1}{c}{136} & \multicolumn{1}{c}{52} & \multicolumn{1}{c}{18} & \multicolumn{1}{c|}{6} & \multicolumn{1}{l|}{482} \\ \cline{3-8}
\end{tabular}
\begin{tablenotes}
\raggedright %\small
\item[*] Number of \glspl{siRNA} (targeting the same gene) to successfully reproduce \glspl{synthetic lethal} in MCF10A cells \citep{Telford2015}
\end{tablenotes}
\end{threeparttable}
}
\end{center}
\end{table*}
While the individual genes detected by either approach do not necessarily match (and are potentially false-positives), the biological functions important in \textit{CDH1}-deficient cancers and potential mechanisms for specific targeting of them can be further supported by \glspl{pathway} analysis of the \textcolor{black}{genes} detected by either method. The genes detected by both approaches may therefore be more informative at the \gls{pathway} level, where it is less likely for a \gls{pathway} to be consistently detected by chance. As the \gls{SLIPT} candidates differ from the \gls{siRNA} candidates (in addition to those detected by both approaches which were more likely to be validated), they can provide information about additional mechanisms by which \textit{CDH1}-deficient cancers proliferate, and vulnerabilities that may be exploited against them by using the \gls{synthetic lethal} \glspl{pathway}.
\FloatBarrier
\subsection{Comparison to Primary Screen at Pathway Level} \label{chapt3:compare_pathway}
\begin{table*}[!hp]
\caption{Pathways for \textit{CDH1} partners from SLIPT and \gls{siRNA}}
\label{tab:Venn_over-representation}
\centering
\resizebox{0.8 \textwidth}{!}{
\begin{tabular}{sl^c^c^c}
\rowstyle{\bfseries}
Predicted only by \gls{SLIPT} (4025 genes) & Pathway Size & Genes Identified & p-value (\gls{FDR}) \\
\hline
\rowcolor{Cluster_Red!20}
Eukaryotic Translation Elongation & 80 & 75 & $1.5 \times 10^{-182}$ \\
\rowcolor{Cluster_Red!15}
Peptide chain elongation & 77 & 72 & $2.9 \times 10^{-176}$ \\
\rowcolor{Cluster_Red!20}
Viral \acrshort{mRNA} Translation & 75 & 70 & $4.9 \times 10^{-172}$ \\
\rowcolor{Cluster_Red!15}
Eukaryotic Translation Termination & 76 & 70 & $5.9 \times 10^{-170}$ \\
\rowcolor{Cluster_Red!20}
Formation of a pool of free 40S subunits & 87 & 74 & $9.5 \times 10^{-166}$ \\
\rowcolor{Cluster_Red!15}
Nonsense Mediated Decay independent of the Exon Junction Complex & 81 & 70 & $1.2 \times 10^{-160}$ \\
\rowcolor{Cluster_Red!20}
L13a-mediated translational silencing of Ceruloplasmin \glslink{gene expression}{expression} & 97 & 75 & $3.8 \times 10^{-155}$ \\
\rowcolor{Cluster_Red!15}
3' -UTR-mediated translational regulation & 97 & 75 & $3.8 \times 10^{-155}$ \\
\rowcolor{Cluster_Red!20}
GTP hydrolysis and joining of the 60S ribosomal subunit & 98 & 75 & $6.0 \times 10^{-154}$ \\
\rowcolor{Cluster_Red!15}
Nonsense-Mediated Decay & 96 & 73 & $5.2 \times 10^{-150}$ \\
\rowcolor{Cluster_Red!20}
Nonsense Mediated Decay enhanced by the Exon Junction Complex & 96 & 73 & $5.2 \times 10^{-150}$ \\
\rowcolor{Cluster_Red!15}
SRP-dependent cotranslational protein targeting to membrane & 97 & 73 & $7.8 \times 10^{-149}$ \\
\rowcolor{Cluster_Red!20}
Eukaryotic Translation Initiation & 105 & 75 & $4.7 \times 10^{-146}$ \\
\rowcolor{Cluster_Red!15}
Cap-dependent Translation Initiation & 105 & 75 & $4.7 \times 10^{-146}$ \\
\rowcolor{Cluster_Red!20}
Translation & 133 & 83 & $4.0 \times 10^{-142}$ \\
\rowcolor{Cluster_Red!15}
Influenza Viral \acrshort{RNA} Transcription and Replication & 102 & 71 & $2.9 \times 10^{-137}$ \\
\rowcolor{Cluster_Red!20}
Influenza Infection & 111 & 74 & $3.7 \times 10^{-137}$ \\
\rowcolor{Cluster_Red!15}
Influenza Life Cycle & 106 & 71 & $2.3 \times 10^{-133}$ \\
\rowcolor{Cluster_Red!20}
Infectious disease & 326 & 125 & $4.2 \times 10^{-120}$ \\
\rowcolor{Cluster_Red!15}
Extracellular matrix organisation & 189 & 77 & $5.4 \times 10^{-95}$ \\
\hline
\\
\rowstyle{\bfseries}
Detected only by \gls{siRNA} screen (1599 genes) & Pathway Size & Genes Identified & p-value (\gls{FDR}) \\
\hline
\rowcolor{Cluster_Blue!20}
Class A/1 (Rhodopsin-like receptors) & 282 & 44 & $1.3 \times 10^{-27}$ \\
\rowcolor{Cluster_Blue!15}
GPCR ligand binding & 363 & 52 & $5.8 \times 10^{-26}$ \\
\rowcolor{Cluster_Blue!20}
G$_{\alpha q}$ signalling events & 159 & 26 & $6.7 \times 10^{-23}$ \\
\rowcolor{Cluster_Blue!15}
Gastrin-CREB signalling \glspl{pathway} via PKC and MAPK & 180 & 27 & $2.0 \times 10^{-21}$ \\
\rowcolor{Cluster_Blue!20}
G$_{\alpha i}$ signalling events & 184 & 27 & $5.3 \times 10^{-21}$ \\
\rowcolor{Cluster_Blue!15}
Downstream signal transduction & 146 & 23 & $7.6 \times 10^{-21}$ \\
\rowcolor{Cluster_Blue!20}
Signalling by PDGF & 172 & 25 & $4.0 \times 10^{-20}$ \\
\rowcolor{Cluster_Blue!15}
Peptide ligand-binding receptors & 175 & 25 & $8.5 \times 10^{-20}$ \\
\rowcolor{Cluster_Blue!20}
Signalling by ERBB2 & 146 & 22 & $1.3 \times 10^{-19}$ \\
\rowcolor{Cluster_Blue!15}
DAP12 interactions & 159 & 23 & $2.6 \times 10^{-19}$ \\
\rowcolor{Cluster_Blue!20}
DAP12 signalling & 149 & 22 & $2.7 \times 10^{-19}$ \\
\rowcolor{Cluster_Blue!15}
Organelle biogenesis and maintenance & 264 & 33 & $5.5 \times 10^{-19}$ \\
\rowcolor{Cluster_Blue!20}
Signalling by NGF & 266 & 33 & $8.2 \times 10^{-19}$ \\
\rowcolor{Cluster_Blue!15}
Downstream signalling of activated FGFR1 & 134 & 20 & $1.1 \times 10^{-18}$ \\
\rowcolor{Cluster_Blue!20}
Downstream signalling of activated FGFR2 & 134 & 20 & $1.1 \times 10^{-18}$ \\
\rowcolor{Cluster_Blue!15}
Downstream signalling of activated FGFR3 & 134 & 20 & $1.1 \times 10^{-18}$ \\
\rowcolor{Cluster_Blue!20}
Downstream signalling of activated FGFR4 & 134 & 20 & $1.1 \times 10^{-18}$ \\
\rowcolor{Cluster_Blue!15}
Signalling by FGFR & 146 & 21 & $1.3 \times 10^{-18}$ \\
\rowcolor{Cluster_Blue!20}
Signalling by FGFR1 & 146 & 21 & $1.3 \times 10^{-18}$ \\
\rowcolor{Cluster_Blue!15}
Signalling by FGFR2 & 146 & 21 & $1.3 \times 10^{-18}$ \\
\hline
\\
\rowstyle{\bfseries}
Intersection of \gls{SLIPT} and \gls{siRNA} screen (604 genes) & Pathway Size & Genes Identified & p-value (\gls{FDR}) \\
\hline
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Visual phototransduction & 54 & 9 & $6.9 \times 10^{-10}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
G$_{\alpha s}$ signalling events & 48 & 7 & $1.6 \times 10^{-7}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Retinoid metabolism and transport & 24 & 5 & $1.7 \times 10^{-7}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
Acyl chain remodelling of PS & 10 & 3 & $6.5 \times 10^{-6}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Transcriptional regulation of white adipocyte differentiation & 51 & 6 & $6.5 \times 10^{-6}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
Chemokine receptors bind chemokines & 22 & 4 & $6.5 \times 10^{-6}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Signalling by NOTCH4 & 11 & 3 & $6.9 \times 10^{-6}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
Defective EXT2 causes exostoses 2 & 11 & 3 & $6.9 \times 10^{-6}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Defective EXT1 causes exostoses 1, TRPS2 and CHDS & 11 & 3 & $6.9 \times 10^{-6}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
Platelet activation, signalling and aggregation & 146 & 12 & $6.9 \times 10^{-6}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Phase 1 - Functionalisation of compounds & 41 & 5 & $1.3 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
Amine ligand-binding receptors & 13 & 3 & $1.7 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Acyl chain remodelling of PE & 14 & 3 & $2.4 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
Signalling by GPCR & 300 & 23 & $2.4 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Molecules associated with elastic fibres & 29 & 4 & $2.6 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
DAP12 interactions & 128 & 10 & $2.6 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Cytochrome P$_{450}$ - arranged by substrate type & 30 & 4 & $3.2 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
GPCR ligand binding & 147 & 11 & $3.8 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20!Cluster_Blue!20}
Acyl chain remodelling of PC & 16 & 3 & $4.0 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15!Cluster_Blue!15}
Response to elevated platelet cytosolic Ca$^{2+}$ & 66 & 6 & $4.2 \times 10^{-5}$ \\
\hline
\end{tabular}
}
\end{table*}
These \gls{pathway} over-representation analyses (performed as described in Section~\ref{methods:enrichment}) correspond to genes separated into \gls{SLIPT} or \gls{siRNA} screen candidates unique to either method, or detected by both (Table~\ref{tab:Venn_over-representation}). The \gls{SLIPT}-specific gene candidates were involved most strongly with translational and immune regulatory \glspl{pathway}, although extracellular matrix \glspl{pathway} were also supported. These \glspl{pathway} were largely consistent with those identified in Table~\ref{tab:pathway_exprSL} and in the clustering analysis (Table~\ref{tab:pathway_clusters}). The genes detected only by the \gls{siRNA} screen had over-represent\-ation of cell signalling \glspl{pathway}, including many containing genes known to be involved in cancer (e.g., MAPK, PDGF, ERBB2, and FGFR), with the detection of Class A GPCRs supporting the independent analyses by \citet{Telford2015}. The intersection of computational and experimental \gls{synthetic lethal} partners of \textit{CDH1} had stronger evidence for over-represent\-ation of \gls{GPCR} \glspl{pathway} and more specific subclasses, such as visual phototransduction ($p=6.9 \times 10^{-10}$) and G$_{\alpha s}$ signalling events ($p=1.7 \times 10^{-7}$), than other signalling \glspl{pathway}.
The \glspl{pathway} analysis for \acrshort{mtSLIPT} against \textit{CDH1} \glspl{mutation} (in Table~\ref{tab:Venn_over-representation_mtSL}) had similar results to \gls{SLIPT}, particularly for \acrshort{mtSLIPT}-specific \glspl{pathway}. The specific \glspl{pathway} composition of the intersection of these analyses differed from \gls{SLIPT} against low \textit{CDH1} \glslink{gene expression}{expression}. However, signalling \glspl{pathway} were also detected, including \glspl{GPCR}, NOTCH, EERB2, PDGF, and SCF-KIT. These findings indicate the signalling \glspl{pathway} are among the most suitable vulnerability to exploit in targeting \textit{CDH1}-deficient tumours as they can be detected in both a patient cohort (with \gls{TCGA} \glslink{gene expression}{expression} data) and experimentally tested by inhibition with \gls{siRNA} or drugs \citep{Telford2015}. However, it is possible that the \gls{siRNA} screen, that was conducted in an isolated experimental system of MCF10A cell lines, was pre-disposed to preferentially detect kinase signalling \glspl{pathway} (which are amenable to pharmacological inhibition and clinical application). Nevertheless, the other \glspl{pathway} identified by \gls{SLIPT} may still be informative of the role of \textit{CDH1} loss of function in cancers or mechanisms by which further gene loss leads to specific inviability.
%%appendix
%\label{tab:Venn_over-representation_mtSL}
\FloatBarrier
\subsubsection{Resampling Genes for Pathway Enrichment} \label{chapt3:compare_pathway_perm}
A high number of significantly over-represented \glspl{pathway} were detected between \gls{SLIPT} in \gls{TCGA} \glslink{gene expression}{expression} data and \gls{siRNA} genes despite relatively few genes being detected by both approaches. These strongly supported \glspl{pathway} are not unexpected, since \gls{synthetic lethal} \glspl{pathway} are more robustly conserved \citep{Dixon2008} and the computational approach using patient samples from complex tumour micro-environment has considerably different strengths to an experimental screen \citep{Telford2015} based on genetically homogenous cell line models in an isolated laboratory environment. For instance, it is unlikely for immune signalling to be detected in an isolated cell culture system.
While many \glspl{pathway} were highly over-represented in the genes detected by both \gls{SLIPT} and \gls{siRNA}, some of these \glspl{pathway} were also highly over-represented in the \gls{siRNA} candidate genes and these may not reflect the results of \gls{SLIPT} in expression data. A resampling approach (as described in Section~\ref{methods:permutation}) was used to assess whether \gls{SLIPT} and the \gls{siRNA} candidate genes had more frequently over-represented \glspl{pathway} than expected by chance. This resampling procedure tests whether \glspl{pathway} were over-represented in the genes detected by both approaches beyond that expected from any subset of \gls{siRNA} candidates. Thus resampling can determine whether \gls{SLIPT} independently supports these putative \gls{synthetic lethal} \glspl{pathway} (without assuming an underlying test statistic distribution).
A resampling approach is also applicable to testing whether the number of genes detected by each approach significantly intersected. As shown in Figure~\ref{fig:perm_sample}, resampling did not find evidence of significant depletion or over-represent\-ation for experimental \gls{synthetic lethal} candidate genes in the computationally predicted \gls{synthetic lethal} partners of \textit{CDH1}, and thus the observed overlap may be due to chance. This is consistent with previous findings (as shown in Figure~\ref{fig:Venn_allgenes}) and does not preclude \glspl{pathway} relationships being supported by resampling.
\begin{figure}[!ht]
%\begin{mdframed}
\centering
\resizebox{0.66 \columnwidth}{!}{
\includegraphics{sample_size_dist_exprSL_1M_Pub.png}
}
\caption[Resampled intersection of SLIPT and siRNA candidate genes]{\small \textbf{Resampled intersection of \gls{SLIPT} and \gls{siRNA} candidate genes.} Resampling analysis of intersect size from genes detected by \gls{SLIPT} and \gls{siRNA} screening approaches over 1 million replicates. The proportion of expected intersection sizes for random samples below or above the observed intersection size respectively, lacking significant over-represent\-ation or depletion of \gls{siRNA} screen candidates within the \gls{SLIPT} predictions for \textit{CDH1}.
%However, the \glspl{pathway} composition of this intersect may still be informative. %%covered in text
}
\label{fig:perm_sample}
%\end{mdframed}
\end{figure}
A permutation analysis was performed to resample the genes tested by both approaches to investigate whether the observed \glspl{pathway} over-represent\-ation could have occurred in a randomly selected sample of genes from the experimental candidates, that is, whether the \glspl{pathway} predictions from \gls{SLIPT} could be expected by chance (as described in Sections~\ref{methods:venn_analysis} and~\ref{methods:permutation}).
%The observed 604 genes detected by both approaches (as shown in Figure~\ref{fig:Venn_allgenes}) was not a significantly over-represent\-ation ($p = 0.12982$) or depletion ($p = 0.85841$) of computationally predicted \gls{synthetic lethal} partners of \textit{CDH1} among experimental \gls{synthetic lethal} candidates (as shown in Figure~\ref{fig:perm_sample}). This reinforces the results of the $\chi^2$ analysis,
While the number of \gls{siRNA} candidate genes also detected by \gls{SLIPT} was not statistically significant ($p=0.281$), this may be due to the vastly different limitations of the approaches and the correlation structure of \gls{gene expression} not being independent (as assumed for multiple testing procedures). The intersection may still be functionally relevant to \textit{CDH1}-deficient cancers, such as the \glspl{pathway} data in Table~\ref{tab:Venn_over-representation}. The resampling analysis for \glspl{pathway} was compared to the \glspl{pathway} over-represent\-ation for \gls{SLIPT} predicted \gls{synthetic lethal} partners in Table~\ref{tab:pathway_perm}. Similarly, the \glspl{pathway} resampling for intersection between \gls{SLIPT} predictions and experimental screen candidates was compared to \glspl{pathway} over-represent\-ation in Table~\ref{tab:pathway_perm_overlap} for intersection with \gls{siRNA} data.
\begin{table*}[!b]
\caption{Pathways for \textit{CDH1} partners from SLIPT}
\label{tab:pathway_perm}
\centering
\resizebox{1 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{sl^c^c}
\rowstyle{\bfseries}
Reactome Pathway & Over-representation & Permutation \\
\hline
\rowcolor{Cluster_Red!20}
\textbf{Eukaryotic Translation Elongation} & $1.3 \times 10^{-207}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
\textbf{Peptide chain elongation} & $5.6 \times 10^{-201}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\textbf{Viral \acrshort{mRNA} Translation} & $1.2 \times 10^{-196}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
\textbf{Eukaryotic Translation Termination} & $1.2 \times 10^{-196}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\textbf{Formation of a pool of free 40S subunits} & $3.7 \times 10^{-194}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
\textbf{Nonsense Mediated Decay independent of the Exon Junction Complex} & $5.3 \times 10^{-187}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\textbf{L13a-mediated translational silencing of Ceruloplasmin \glslink{gene expression}{expression}} & $9.6 \times 10^{-183}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
\textbf{3' -UTR-mediated translational regulation} & $9.6 \times 10^{-183}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\textbf{GTP hydrolysis and joining of the 60S ribosomal subunit} & $1.9 \times 10^{-181}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
\textbf{Nonsense-Mediated Decay} & $6.2 \times 10^{-176}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\textbf{Nonsense Mediated Decay enhanced by the Exon Junction Complex} & $6.2 \times 10^{-176}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
Adaptive Immune System & $6.5 \times 10^{-174}$ & $0.15753$ \\
\rowcolor{Cluster_Red!20}
\textbf{Eukaryotic Translation Initiation} & $5.7 \times 10^{-173}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
\textbf{Cap-dependent Translation Initiation} & $5.7 \times 10^{-173}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\textbf{SRP-dependent cotranslational protein targeting to membrane} & $2.0 \times 10^{-171}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
\textbf{Translation} & $6.1 \times 10^{-170}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
Infectious disease & $1.6 \times 10^{-166}$ & $0.23231$ \\
\rowcolor{Cluster_Red!15}
\textbf{Influenza Infection} & $1.9 \times 10^{-163}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\textbf{Influenza Viral \acrshort{RNA} Transcription and Replication} & $1.9 \times 10^{-160}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
\textbf{Influenza Life Cycle} & $2.5 \times 10^{-156}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\textit{Extracellular matrix organisation} & $1.1 \times 10^{-152}$ & $0.071761$ \\
\rowcolor{Cluster_Red!15}
GPCR ligand binding & $1.1 \times 10^{-143}$ & $0.55801$ \\
\rowcolor{Cluster_Red!20}
Class A/1 (Rhodopsin-like receptors) & $1.5 \times 10^{-142}$ & $0.58901$ \\
\rowcolor{Cluster_Red!15}
\textit{GPCR downstream signalling} & $7.6 \times 10^{-140}$ & $0.098357$ \\
\rowcolor{Cluster_Red!20}
Haemostasis & $1.9 \times 10^{-134}$ & $0.27059$ \\
\rowcolor{Cluster_Red!15}
Developmental Biology & $2.0 \times 10^{-123}$ & $0.52737$ \\
\rowcolor{Cluster_Red!20}
Metabolism of lipids and lipoproteins & $3.3 \times 10^{-120}$ & $0.724$ \\
\rowcolor{Cluster_Red!15}
Cytokine Signalling in Immune system & $2.6 \times 10^{-119}$ & $0.39661$ \\
\rowcolor{Cluster_Red!20}
Peptide ligand-binding receptors & $3.7 \times 10^{-109}$ & $0.61102$ \\
\rowcolor{Cluster_Red!15}
\textbf{G$_{\alpha i}$ signalling events} & $8.9 \times 10^{-100}$ & $< 1.241 \times 10^{-5}$ \\
\iffalse
\rowcolor{Cluster_Red!20}
Axon guidance & $1.4 \times 10^{-96}$ & $0.66232$ \\
\rowcolor{Cluster_Red!15}
Platelet activation, signalling and aggregation & $3.7 \times 10^{-94}$ & $0.29662$ \\
\rowcolor{Cluster_Red!20}
Immunoregulatory interactions between a Lymphoid and a non-Lymphoid cell & $1.4 \times 10^{-93}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
Formation of the ternary complex, and subsequently, the 43S complex & $7.0 \times 10^{-91}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
Translation initiation complex formation & $9.6 \times 10^{-87}$ & $6.8667 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
Ribosomal scanning and start codon recognition & $9.6 \times 10^{-87}$ & $6.8667 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
\begin{tabular}[c]{@{}l@{}}Activation of the \acrshort{mRNA} upon binding of the cap-binding complex and eIFs,\\and subsequent binding to 43S \end{tabular} & $8.7 \times 10^{-86}$ & $6.8667 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!15}
Chemokine receptors bind chemokines & $5.1 \times 10^{-82}$ & $< 1.241 \times 10^{-5}$ \\
\rowcolor{Cluster_Red!20}
Signalling by NGF & $1.2 \times 10^{-81}$ & $0.37142$ \\
\rowcolor{Cluster_Red!15}
Toll-Like Receptors Cascades & $5.3 \times 10^{-80}$ & $0.63013$ \\
\rowcolor{Cluster_Red!20}
Interferon gamma signalling & $6.3 \times 10^{-80}$ & $0.61493$ \\
\rowcolor{Cluster_Red!15}
Transmembrane transport of small molecules & $5.3 \times 10^{-78}$ & $0.21216$ \\
\rowcolor{Cluster_Red!20}
Signalling by Rho GTPases & $1.1 \times 10^{-77}$ & $0.078314$ \\
\rowcolor{Cluster_Red!15}
Degradation of the extracellular matrix & $7.3 \times 10^{-77}$ & $0.769$ \\
\rowcolor{Cluster_Red!20}
Interferon Signalling & $1.1 \times 10^{-76}$ & $0.18211$ \\
\rowcolor{Cluster_Red!15}
NGF signalling via TRKA from the plasma membrane & $1.4 \times 10^{-74}$ & $0.60076$ \\
\rowcolor{Cluster_Red!20}
Gastrin-CREB signalling \glspl{pathway} via PKC and MAPK & $3.1 \times 10^{-74}$ & $0.93109$ \\
\rowcolor{Cluster_Red!15}
Rho GTPase cycle & $3.2 \times 10^{-73}$ & $0.11446$ \\
\rowcolor{Cluster_Red!20}
DAP12 interactions & $2.0 \times 10^{-71}$ & $0.57671$ \\
\rowcolor{Cluster_Red!15}
Cell surface interactions at the vascular wall & $3.3 \times 10^{-71}$ & $0.66232$ \\
\fi
\hline
\end{tabular}
\begin{tablenotes}
\raggedright %\small
Over-representation (hypergeometric test) and Permutation p-values adjusted for multiple tests across \glspl{pathway} (\gls{FDR}). Significant \glspl{pathway} are marked in bold (\gls{FDR} $ < 0.05$) and italics (\gls{FDR} $ < 0.1$).
\end{tablenotes}
\end{threeparttable}
}
\end{table*}
\begin{table*}[!htp]
\caption{Pathways for \textit{CDH1} partners from SLIPT and \gls{siRNA} primary screen}
\label{tab:pathway_perm_overlap}
\centering
\resizebox{0.9 \textwidth}{!}{
\begin{threeparttable}
\begin{tabular}{sl^c^c}
\rowstyle{\bfseries}
Reactome Pathway & Over-representation & Permutation \\
\hline
\rowcolor{Cluster_Red!20!Cluster_Blue!20}