MFmap : a semi-supervised generative model matching cell lines to tumours and cancer subtypes
PloS One. Bd. 16. H. 12. San Francisco, CA: Public Library of Science 2021 e026118
Erscheinungsjahr: 2021
ISBN/ISSN: 1932-6203
Publikationstyp: Zeitschriftenaufsatz (Forschungsbericht)
Sprache: Englisch
Doi/URN: 10.1371/journal.pone.0261183
Inhaltszusammenfassung
Translating in vitro results from experiments with cancer cell lines to clinical applications requires the selection of appropriate cell line models. Here we present MFmap (model fidelity map), a machine learning model to simultaneously predict the cancer subtype of a cell line and its similarity to an individual tumour sample. The MFmap is a semi-supervised generative model, which compresses high dimensional gene expression, copy number variation and mutation data into cancer subtype informe...Translating in vitro results from experiments with cancer cell lines to clinical applications requires the selection of appropriate cell line models. Here we present MFmap (model fidelity map), a machine learning model to simultaneously predict the cancer subtype of a cell line and its similarity to an individual tumour sample. The MFmap is a semi-supervised generative model, which compresses high dimensional gene expression, copy number variation and mutation data into cancer subtype informed low dimensional latent representations. The accuracy (test set F1 score > 90%) of the MFmap subtype prediction is validated in ten different cancer datasets. We use breast cancer and glioblastoma cohorts as examples to show how subtype specific drug sensitivity can be translated to individual tumour samples. The low dimensional latent representations extracted by MFmap explain known and novel subtype specific features and enable the analysis of cell-state transformations between different subtypes. From a methodological perspective, we report that MFmap is a semi-supervised method which simultaneously achieves good generative and predictive performance and thus opens opportunities in other areas of computational biology.» weiterlesen» einklappen
Autoren
Klassifikation
DFG Fachgebiet:
Medizin
DDC Sachgruppe:
Informatik