Radiomics features may predict outcome in diffuse large B-cell lymphoma (DLBCL). Currently, multiple segmentation methods are used to calculate metabolic tumor volume (MTV). We assessed the influence of segmentation method on the discriminative power of radiomics features in DLBCL for patient level and for the largest lesion. 50 baseline F-fluorodeoxyglucose positron emission tomography computed tomography (PET/CT) scans of DLBCL patients who progressed or relapsed within 2 years after diagnosis were matched on uptake time and reconstruction method with 50 baseline PET/CT scans of DLBCL patients without progression. Scans were analysed using 6 semi-automatic segmentation methods (standardized uptake value (SUV)4.0, SUV2.5, 41% of the maximum SUV, 50% of the SUVpeak, majority vote (MV)2 and MV3, respectively). Based on these segmentations, 490 radiomics features were extracted at patient level and 486 features for the largest lesion. To quantify the agreement between features extracted from different segmentation methods, the intra-class correlation (ICC) agreement was calculated for each method compared to SUV4.0. The feature space was reduced by deleting features that had high Pearson correlations (≥0.7) with the previously established predictors MTV and/or SUVpeak. Model performance was assessed using stratified repeated cross-validation with 5 folds and 2000 repeats yielding the mean receiver-operating characteristics curve integral (CV-AUC) for all segmentation methods using logistic regression with backward feature selection. The percentage of features yielding an ICC ≥0.75 compared to the SUV4.0 segmentation was lowest for A50P both at patient level and for the largest lesion, with 77.3% and 66.7% of the features yielding an ICC ≥0.75, respectively. Features were not highly correlated with MTV, with at least 435 features at patient level and 409 features for the largest lesion for all segmentation methods with a correlation coefficient <0.7. Features were highly correlated with SUVpeak (at least 190 and 134 were uncorrelated, respectively). CV-AUCs ranged between 0.69±0.11 and 0.84±0.09 for patient level, and between 0.69±0.11 and 0.73±0.10 for lesion level. Even though there are differences in the actual radiomics feature values derived and selected features between segmentation methods, there is no substantial difference in the discriminative power of radiomics features between segmentation methods.
Copyright © 2021 by the Society of Nuclear Medicine and Molecular Imaging, Inc.