Our objectives were to design and validate methods to identify relapse and hematopoietic stem cell transplantation (HSCT) in children with acute lymphoblastic leukemia (ALL) using administrative data representing hospitalizations at US pediatric institutions.
We developed daily billing and ICD-9 code definitions to identify relapses and HSCTs within a cohort of children with newly diagnosed ALL between January 1, 2004, and December 31, 2013, previously assembled from the Pediatric Health Information System (PHIS) database. Chart review for children with ALL at the Children’s Hospital of Philadelphia (CHOP) and Texas Children’s Hospital (TCH) was performed to establish relapse and HSCT gold standards for sensitivity and positive predictive value (PPV) calculations. We estimated incidences of relapse and HSCT in the PHIS ALL cohort.
We identified 362 CHOP and 314 TCH ALL patients in PHIS and established true positives by chart review. Sensitivity and PPV for identifying both relapse and HSCT in PHIS were > 90% at both hospitals. Five-year relapse incidence in the 10 150-patient PHIS cohort was 10.3% (95% CI 9.8%-10.9%) with 7.1% (6.6%-7.6%) of children underwent HSCTs. Patients in higher-risk demographic groups had higher relapse and HSCT rates. Our analysis also identified differences in incidences of relapse and HSCT by race, ethnicity, and insurance status.
Administrative data can be used to identify relapse and HSCT accurately in children with ALL whether they occur on- or off-therapy, in contrast with published approaches. This method has wide potential applicability for estimating these incidences in pediatric ALL, including patients not enrolled on clinical trials.

© 2020 Wiley Periodicals, Inc.

Author