Identifying Top Gene Contributors to Triple Negative Breast Cancer Health Disparities Among African American Women: A Machine Learning Approach

Date

2019-03-05

Authors

Liu, Jin
Radler, Charlene
Hayatshahi, Hamed
Morid, Mohammad Amin
Green, Amyia
Fluker, Kenneth Jr.
Ahuactzin, Emilio

ORCID

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Purpose: Triple Negative Breast Cancer (TNBC) is a breast cancer subtype which multiple studies have shown to be disproportionately prevalent among premenopausal African American women. The factors contributing to the TNBC health disparities remain unclear. Methods: Here, we developed a highly accurate, reproducible machine learning classification model that used patient gene expression values as predictor attributes to classify 100 TNBC patients as either African American or non-African American. Results: By using weighting methods and comparison of classification performance at varying levels of attributes, our study identified a subset of genes able to accurately classify TNBC patients by race. Intriguingly, the top genes of this subset are linked to diabetes, indicating that diabetes may associate with the TNBC health disparities. Conclusions: Our study demonstrated the factors contributing the TNBC health disparities and provided a subset of genes that may be targetable for precision medicine development to address disparity of TNBC among the African American female population.

Description

Keywords

Citation

Collections