Tuberculosis is the leading cause of infectious disease-related death, surpassing even the immunodeficiency virus. Treatment loss to follow up and irregular medication use contribute to persistent morbidity and mortality. This increases bacillus drug resistance and has a negative impact on disease control.
This study aims to develop a computational model that predicts the loss to follow up treatment in tuberculosis patients, thereby increasing treatment adherence and cure, reducing efforts regarding treatment relapses and decreasing disease spread.
This is a case-controlled study. Included in the data set were 103,846 tuberculosis cases from the state of São Paulo. They were collected using the TBWEB, an information system used as a tuberculosis treatment monitor, containing samples from 2006 to 2016. This set was later resampled into 6 segments with a 1-1 ratio. This ratio was used to avoid any bias during the model construction.
The Classification and Regression Trees were used as the prediction model. Training and test sets accounted for 70% in the former and 30% in the latter of the tuberculosis cases. The model displayed an accuracy of 0.76, F-measure of 0.77, sensitivity of 0.80 and specificity of 0.71. The model emphasizes the relationship between several variables that had been identified in previous studies as related to patient cure or loss to follow up treatment in tuberculosis patients.
It was possible to construct a predictive model for loss to follow up treatment in tuberculosis patients using Classification and Regression Trees. Although the fact that the ideal predictive ability was not achieved, it seems reasonable to propose the use of Classification and Regression Trees models to predict likelihood of treatment follow up to support healthcare professionals in minimising the loss to follow up.

Copyright © 2020 Elsevier B.V. All rights reserved.

Author