To (1) use All of Us (AoU) data to validate a previously published single-center model predicting need for surgery among individuals with glaucoma, (2) train new models using AoU data, and (3) share insights regarding this novel data source for ophthalmic research.
Development and evaluation of machine learning models.
Electronic health record data were extracted from AoU for 1231 adults diagnosed with primary open-angle glaucoma. The single-center model was applied to AoU data for external validation. AoU data were then used to train new models for predicting need for glaucoma surgery using multivariable logistic regression, artificial neural networks, and random forests. Five-fold cross-validation was performed. Model performance was evaluated based on area under the receiver operating characteristic curve (AUC), accuracy, precision and recall.
The mean (standard deviation) age of the All of Us cohort was 69.1 (10.5) years, with 57.3% women and 33.5% Black, significantly exceeding representation in the single-center cohort (p=0.04 and p<0.001, respectively). Of 1231 participants, 286 (23.2%) needed glaucoma surgery. When applying the single-center model to AoU data, accuracy was 0.69, and AUC was only 0.49. Using AoU data to train new models resulted in superior performance: AUCs ranged from 0.80 (logistic regression) to 0.99 (random forests).
Models trained with national AoU data achieved superior performance compared to using single-center data. Although AoU does not currently include ophthalmic imaging, it offers several strengths over similar big-data sources such as claims data. AoU is a promising new data source for ophthalmic research.