Cancer Res Treat.  2019 Apr;51(2):672-684. 10.4143/crt.2018.137.

Prediction of Acquired Taxane Resistance Using a Personalized Pathway-Based Machine Learning Method

Affiliations
  • 1Department of Biochemistry, Konkuk University School of Medicine, Seoul, Korea. palelamp@kku.ac.kr

Abstract

PURPOSE
This study was conducted to develop and validate an individualized prediction model for automated detection of acquired taxane resistance (ATR).
MATERIALS AND METHODS
Penalized regression, combinedwith an individualized pathway score algorithm,was applied to construct a predictive model using publically available genomic cohorts of ATR and intrinsic taxane resistance (ITR). To develop a model with enhanced generalizability, we merged multiple ATR studies then updated the learning parameter via robust cross-study validation.
RESULTS
For internal cross-study validation, the ATR model produced a perfect performance with an overall area under the receiver operating curve (AUROC) of 1.000 with an area under the precision-recall curve (AUPRC) of 1.000, a Brier score of 0.007, a sensitivity and a specificity of 100%. The model showed an excellent performance on two independent blind ATR cohorts (overall AUROC of 0.940, AUPRC of 0.940, a Brier score of 0.127). When we applied our algorithm to two large-scale pharmacogenomic resources for ITR, the Cancer Genome Project (CGP) and the Cancer Cell Line Encyclopedia (CCLE), an overall ITR cross-study AUROC was 0.70, which is a far better accuracy than an almost random level reported by previous studies. Furthermore, this model had a high transferability on blind ATR cohorts with an AUROC of 0.69, suggesting that general predictive features may be at work across both ITR and ATR.
CONCLUSION
We successfully constructed a multi-study-derived personalized prediction model for ATR with excellent accuracy, generalizability, and transferability.

Keyword

Taxoids; Paclitaxel; Docetaxel; Drug resistance; Molecular diagnosis; Machine learning

MeSH Terms

Cell Line
Cohort Studies
Drug Resistance
Genome
Humans
Learning
Machine Learning*
Methods*
Paclitaxel
Sensitivity and Specificity
Taxoids
Paclitaxel
Taxoids

Figure

  • Fig. 1. Flow diagram describing the selection process of genomic studies for acquired taxane resistance. GEO, Gene Expression Omnibus; AE, ArrayExpress; PM, PubMed.

  • Fig. 2. Workflow for the development and validation of machine learning model for predicting acquired taxane resistance (ATR). The pipeline consists of three main parts: cross-study normalization, transformation into pathway information and model construction. The study cohort was preprocessed and splited into an internal development and validation cohort and an external blind validation cohort. An empirical Bayes approach (Combat) method was used for cross-study normalization. Transforming gene expression level information into pathway-level score for each individual sample was conducted using three curated pathway databases (Kyoto Encyclopedia of Genes and Genomes [KEGG], Pathway Interaction Database [PID], and BioCarta). Using these pathway-level score matrix, penalized regression model was constructed. Parameter optimization of the prediction model was conducted using leave-one-out cross validation (LOOCV) with Efficient Parameter Selection via Global Optimization (EPSGO) algorithm. QC, quality control; CGP, Cancer Genome Project; PTX, paclitaxel; DTX, docetaxel; CCLE, Cancer Cell Line Encyclopedia; EM, Empirical Bayes Method; PDS, pathway dysregulation scores; PC, principal component; AUROC, area under the receiver operating curve; AUPRC, area under the precision-recall curve; ACC, accuracy.

  • Fig. 3. Multi-study–derived, individualized pathway learning model for predicting acquired taxane resistance (ATR). (A) Pathway deregulation score (PDS) matrix for the three development cohorts (GSE36135, GSE28784, GSE23779). Each row (744 pathway features from 11,520 input gene features) represents the z‐score-normalized PDS for each individual sample in each cohort. The color bars in the bottom indicate drug sunsitivity status, type of taxane and study cohort. (B) An example of principal curve of the pathway. The principal curve is individually learned with each pathways of the development cohorts. The data points and the principal curve are projected onto the three principal components (PCs). The principal curve goes through the cloud of samples and is directed so that control samples (sensitive to taxane) are near the beginning of the curve. (C) Hyperparameter optimization for elastic-net with Efficient Parameter Selection via Global Optimization (EPSGO). Cross-study validation deviance as a function of both tuning hyperparameters α and λ is shown. α controls the tradeoff between the ridge and lasso penalties, whereas λ controls the overall amount of penalization. The red arrow highlights the final EPSGO solution where the deviance is within 1SE of the minimum (α=0.682 and λ=0.004). (D) Heatmap of the pathways with non-zero coefficient. From 744 input pathways, 39 pathways with non-zero coefficients were selected. The names of the final pathways are labelled on the right side of PDS matrix shown in panel A. S, sensitive; DTX, docetaxel; PTX, paclitaxel.

  • Fig. 4. Acquired taxane resistance (ATR)–trained model performances on internal and external validation ATR cohorts. Receiver operating characteristic (ROC) and precision-recall curve are used to show ability to predict. (A) Model performances on internal cross-validation ATR cohorts. (B) Model performances on external blind ATR cohorts. DTX, docetaxel; PTX, paclitaxel; AUC, area under the curve.

  • Fig. 5. Intrinsic taxane resistance (ITR)‒trained model performances on internal (ITR) and external validation (acquired taxane resistance [ATR]) cohorts. Receiver operating characteristic (ROC) and precision-recall curve are used to show ability to predict. (A) Model performances on internal validation of ITR cohorts (CCLE-PTX, CGP-DTX, and CGP-PTX). (B) Model performances on external ATR cohorts. AUC, area under the curve; CCLE, Cancer Cell Line Encyclopedia; PTX, paclitaxel; CGP, Cancer Genome Project; DTX, docetaxel.


Reference

References

1. Nussbaumer S, Bonnabry P, Veuthey JL, Fleury-Souverain S. Analysis of anticancer drugs: a review. Talanta. 2011; 85:2265–89.
Article
2. Haibe-Kains B, El-Hachem N, Birkbak NJ, Jin AC, Beck AH, Aerts HJ, et al. Inconsistency in large pharmacogenomic studies. Nature. 2013; 504:389–93.
Article
3. Bouhaddou M, DiStefano MS, Riesel EA, Carrasco E, Holzapfel HY, Jones DC, et al. Drug response consistency in CCLE and CGP. Nature. 2016; 540:E9–10.
Article
4. Domingo-Domenech J, Vidal SJ, Rodriguez-Bravo V, Castillo-Martin M, Quinn SA, Rodriguez-Barrueco R, et al. Suppression of acquired docetaxel resistance in prostate cancer through depletion of notch- and hedgehog-dependent tumorinitiating cells. Cancer Cell. 2012; 22:373–88.
Article
5. Zwart A, Rechache N. Gene expression data of sensitive, Docetaxel resistant and paclitaxel resistant MDA-MB-231 cells. GEO DataSets. Series GSE28784 [Internet]. National Center for Biotechnology Information; 2011 [cited 2018 May 2]. Available from: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE28784.
6. Landen CN Jr, Goodman B, Katre AA, Steg AD, Nick AM, Stone RL, et al. Targeting aldehyde dehydrogenase cancer stem cells in ovarian cancer. Mol Cancer Ther. 2010; 9:3186–99.
Article
7. Luo W, Schork NJ, Marschke KB, Ng SC, Hermann TW, Zhang J, et al. Identification of polymorphisms associated with hypertriglyceridemia and prolonged survival induced by bexarotene in treating non-small cell lung cancer. Anticancer Res. 2011; 31:2303–11.
8. Marin-Aguilera M, Codony-Servat J, Kalko SG, Fernandez PL, Bermudo R, Buxo E, et al. Identification of docetaxel resistance genes in castration-resistant prostate cancer. Mol Cancer Ther. 2012; 11:329–39.
Article
9. Hughey JJ, Butte AJ. Robust meta-analysis of gene expression using the elastic net. Nucleic Acids Res. 2015; 43:e79.
Article
10. Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003; 4:249–64.
Article
11. Troyanskaya O, Cantor M, Sherlock G, Brown P, Hastie T, Tibshirani R, et al. Missing value estimation methods for DNA microarrays. Bioinformatics. 2001; 17:520–5.
Article
12. Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007; 8:118–27.
Article
13. Drier Y, Sheffer M, Domany E. Pathway-based personalized analysis of cancer. Proc Natl Acad Sci U S A. 2013; 110:6388–93.
Article
14. Hastie T, Stuetzle W. Principal curves. J Am Stat Assoc. 1989; 84:502–16.
Article
15. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010; 33:1–22.
Article
16. Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Series B Stat Methodol. 2005; 67:301–20.
Article
17. Sill M, Hielscher T, Becker N, Zucknick M. c060: extended inference with Lasso and elastic-net regularized Cox and generalized linear models. J Stat Softw. 2014; 62:1–22.
Article
18. Kuhn M. Building predictive models in R using the caret package. J Stat Softw. 2008; 28:1–26.
19. Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000; 28:27–30.
Article
20. Schaefer CF, Anthony K, Krupa S, Buchoff J, Day M, Hannay T, et al. PID: the pathway interaction database. Nucleic Acids Res. 2009; 37:D674–9.
Article
21. Nishimura D. BioCarta. Biotech Softw Internet Rep. 2001; 2:117–20.
Article
22. Dong Z, Zhang N, Li C, Wang H, Fang Y, Wang J, et al. Anticancer drug sensitivity prediction in cell lines from baseline gene expression through recursive feature selection. BMC Cancer. 2015; 15:489.
Article
23. Greenberger LM, Sampath D. Resistance to taxanes. In : Teicher BA, editor. Cancer drug resistance. Totowa, NJ: Humana Press;2006. p. 329–58.
24. Zeng L, Kizaka-Kondoh S, Itasaka S, Xie X, Inoue M, Tanimoto K, et al. Hypoxia inducible factor-1 influences sensitivity to paclitaxel of human lung cancer cell lines under normoxic conditions. Cancer Sci. 2007; 98:1394–401.
Article
25. Szakacs G, Paterson JK, Ludwig JA, Booth-Genthe C, Gottesman MM. Targeting multidrug resistance in cancer. Nat Rev Drug Discov. 2006; 5:219–34.
Article
26. Seguin L, Desgrosellier JS, Weis SM, Cheresh DA. Integrins and cancer: regulators of cancer stemness, metastasis, and drug resistance. Trends Cell Biol. 2015; 25:234–40.
Article
Full Text Links
  • CRT
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr