Jian-Dong Yin, Li-Rong Song, He-Cheng Lu, Xu Zheng
Abstract
Key words: Rectal cancer; Diffusion weighted imaging; Apparent diffusion coefficient;Texture analysis
Colorectal cancer is the third leading cause of cancer worldwide, and rectal cancer accounts for approximately 30%-35% of colorectal cancer cases[1,2]. Advances in surgical techniques, chemotherapy and radiation therapy regimens have led to decreased local recurrence rates and mortality[3,4]. Available treatment options vary by tumor stage. The extent of tumor invasion into the bowel wall (pathological T stage)and the number of lymph nodes affected by the lesions (pathological N stage) are important prognostic factors for local recurrence and overall survival[5]. Rectal cancer guidelines from the National Comprehensive Cancer Network recommend neoadjuvant chemoradiotherapy (NAT) for patients with lymph node involvement before surgery[6]. Compared with surgery alone, the use of NAT followed by surgical resection for locally advanced rectal carcinoma (stage T3-4 and/or N1-2) has been shown to be associated with a 50%-61% reduction in the risk of local recurrence[7,8].Hence, an accurate evaluation of T and N stage is essential for treatment planning.
High-resolution magnetic resonance imaging (MRI) has been widely recommended for local staging of primary rectal cancer before treatment, but has some limitations[9].It is difficult to distinguish T2 from the T3 stage of tumors as the peritumoral inflammatory reaction is similar to tumor penetration through the muscular rectal wall[10]. In addition, the detection of tissue edema, fibrosis and inflammation can be less accurate after NAT[11]. Also, preoperative detection of nodal involvement with morphological criteria such as short-axis diameter, shape, border smoothness and signal heterogeneity is another challenge[12,13]. Thus, improved techniques for T and N staging are of great importance.
Diffusion-weighted imaging (DWI) is a functional MRI technique for detecting the movement of water molecules in the extracellular space, which could reflect the varying cellularity of a tumor. Previous studies have indicated that the apparent diffusion coefficient (ADC) calculated from DWI could be a valuable imaging biomarker of tumor property[14]. Earlier research findings also suggested that ADC values were helpful in the detection of rectal cancers as well as the prediction of pathological complete response after NAT[15,16]. However, the ADC calculation only described the average, maximum, or minimum signal intensity within the tumors,failing to reflect their heterogeneity. Due to the intrinsic chaotic environment of the tumor, ADC values are not sufficiently sensitive to small changes or the precise status of tumors[17].
Texture analysis, which has emerged as one of the “radiomics” approaches for interpretation of medical imaging, is a tool for extracting quantitative features by measuring the spatial variation of gray levels on a pixel-by-pixel basis within given images[18]. It provides a more objective method to characterize tissue heterogeneity within the lesion that is closely correlated with tumor grading and staging. Some textures extracted from MRI or computed tomography (CT) have shown potential in several aspects, such as distinguishing benign from malignant lesions, staging preoperative cancer (N or T stage), predicting gene expression type and forecasting the result of tumor treatment[19-21]. It has been suggested that skewness and kurtosis differ significantly in cervical cancer between positivevsnegative pelvic lymph node metastatic status[22]. In addition, contrast and difference in variance showed higher values for nonresponders than partial responders to chemotherapy[23]. It has also been shown that lower entropy, higher uniformity and lower standard deviation were associated with poorer 5-year overall survival in patients with colorectal cancer[24].Moreover, one previous study demonstrated that higher heterogeneity of the tumor was a powerful predictor of pathologic regional lymph node metastasis in esophageal cancer[25].
The heterogeneity within the tumor originating from intratumoral spatial variation in the cellularity, angiogenesis, extravascular extracellular matrix, and necrosis could be captured by DWI images and ADC maps with texture analysis. Therefore, the aim of this study was to investigate whether texture features derived from DWI images and ADC maps were associated with the pathological stage T1-2vsT3-4 and pathological stage N0vsN1-2 in rectal cancer. To our best knowledge, texture analysis of DWI images combined with ADC maps for preoperative staging of rectal cancer has not been documented.
The study was approved by the Ethics Review Board of Shengjing Hospital of China Medical University (2020PS011K), and written informed consent was obtained from each patient.
All cases (n= 362) with rectal MRIs were browsed using the Picture Archiving and Communication System in our institution between September 2018 and November 2019. Inclusion criteria were as follows: (1) Patients with preoperative DWI; (2)Patients who underwent radical resection within 1 month after high-resolution MRI scanning; (3) Patients with rectal adenocarcinoma confirmed by surgical specimen;and (4) Patients with only one lesion identified for subsequent analysis. Initially, the population comprised 168 rectal cancer patients. Fifty-three patients were excluded for the following reasons: (1) Patients underwent NAT or endoscopic biopsy before MRI scanning; (2) Image quality was poor due to apparent motion artifacts on the DWI sequence; (3) Pathological data were incomplete (such as lack of N- or T-staging information); and (4) Patients had pathologically proven mucinous adenocarcinoma which tended to have a low cellular density resulting in high ADC values that might bias results. Finally, 115 eligible patients were selected for subsequent analyses. The clinical characteristics of these patients are shown in Table 1. The flowchart of this study is displayed in Figure 1.
All MRI examinations were performed on a 3.0-Tesla (T) scanner (Ingenia 3.0, Philips Medical System, Best, The Netherlands) with an eight-channel phased-array surface coil in the supine position. There was no bowel preparation or intravenous antispasmodic agents administered. An axial DWI sequence was performed for all patients. The acquisition parameters were as follows: Repetition time/echo time,6000/76 ms; flip angle, 90°; matrix size, 288 × 288; field of view, 450 mm; slices, 48;slice thickness, 5 mm; spacing between slices, 1 mm;bvalues, 0 and 1000 s/mm2.
Table 1 Clinical and pathological characteristics of the patients
ADC maps were generated with MATLAB 2018a (Mathworks, Natick, MA, United States) according to loaded DWI images using the following formula: ADC = (lnSI0-lnSI)/(b-b0) where SI0and SI represent signal intensity atbvalues of 0 and 1000 s/mm2, respectively.
Region of interest (ROI) segmentation was performed independently by two radiologists with 10 years of experience in interpreting pelvic MRIs who were blinded to the pathological results. The ADC maps were imported into image processing software (ImageJ, National Institutes of Health, Bethesda, MD, United States) for segmentation of ROIs. Rectal cancer was determined as a local mass or abnormal wall thickening that showed intermediate intensity of signals on T2WI, hyperintensity on DWI and hypointensity on the ADC map. ROIs were manually delineated along the border of the low signal area on the single slice of the ADC map, which showed the largest tumor diameter with reference to T2WI and DWI. Obvious necrosis, gas and lumen content areas were avoided to minimize bias. The contours of ROIs on ADC maps were copied to the exact same location of the corresponding DWI images (b= 0 andb= 1000).
Texture parameters were extracted from ADC maps, DWIb=0images and DWIb=1000images using in-house software programmed with MATLAB 2018a. Twelve texture features were calculated based on three images for each patient from the gray level cooccurrence matrix, the gray level run-length matrix and wavelet. The mean ADC values, minimum and maximum ADC values (ADCmean, ADCmin, ADCmax) were also calculated. Therefore, a total of 39 features were measured for each patient. A detailed description of these features is provided in Table 2.
Figure 1 Flow chart adopted in this study. MRI: Magnetic resonance imaging; DWI: Diffusion-weighted imaging; NAT: Neoadjuvant chemoradiotherapy; ADC:Apparent diffusion coefficient; ROI: Region of interest.
A Kolmogorov-Smirnov test for each feature was first performed to confirm that the samples followed a normal distribution. If the distribution was normal (P≥ 0.05), an independent samplet-test was used to compare parameters between T1-2 and T3-4 stages, and between N0 and N1-2 stages. Otherwise, the Mann-WhitneyUtest was used[26]. Those significantly different parameters were selected for subsequent analysis. Multivariate logistic regression analysis was performed with the entry of variables to identify independent factors for T3-4 and N1-2 tumors. In addition,Spearman correlation analysis was performed to assess the correlation between features and tumor stages. Receiver operating characteristic (ROC) curve analysis was conducted to evaluate the diagnostic performance of the established logistic models for prediction of T3-4 and N1-2 tumors by calculating the area under the ROC curve(AUC), which was drawn with the professional statistics software MedCalc (version 14.10.20, http://www.medcalc.org/). The corresponding sensitivity and specificity were also calculated. Interobserver variability of texture features extracted between the two radiologists was evaluated using intraclass correlation coefficients (0-0.4, poor agreement; 0.41-0.6, moderate agreement; 0.61-0.8, good agreement; 0.81-1, excellent agreement). All statistical analyses were performed using SPSS 22.0 (IBM, Armonk,NY, United States).P< 0.05 was considered statistically significant.
A randomly selected case was used to illustrate the ROI segmentation results, as shown in Figure 2. The results of texture analysis for identifying T- and N- stage are described as follows:
Table 2 Features measured by different methods
All significantly different texture features between stage T1-2vsT3-4 groups are summarized in Table 3. No significant difference was observed with respect to any of the ADC measurements. The logistic regression model that incorporated features from DWIb=0images that were significantly different between stage T1-2 and T3-4 tumors reached an AUC of 0.710 (sensitivity, 65.48%; specificity, 72.86%; accuracy,70.87%; 95% confidence interval (CI): 0.618-0.791). The logistic regression model that incorporated significantly different features from DWIb=1000images achieved an AUC of 0.688 (sensitivity, 61.90%; specificity, 70.97%; accuracy, 68.52%; 95%CI: 0.595-0.771).In addition, the logistic regression model that incorporated significantly different features from ADC maps achieved an AUC of 0.657 (sensitivity, 45.24%; specificity,83.87%; accuracy, 73.46; 95%CI: 0.563-0.743). The above significantly different features were used as input variables for the multivariate logistic regression analysis. For DWIb=0features, higher dissimilarity, higher sum average, higher information correlation and lower run-length nonuniformity were found to be independent predictors of local invasion (stage T3-4). For DWIb=1000features, higher gray level nonuniformity, higher run percentage and lower run-length nonuniformity were found to be independent predictors. For ADC map features, lower dissimilarity and higher run percentage were determined to be independent predictors. Using the logistic regression model that incorporated these nine features for differentiating stage T3-4 from T1-2 tumors, an AUC of 0.793 was achieved with a sensitivity of 78.57% and a specificity of 74.19%, and the accuracy was 75.37% with 95%CI of 0.707-0.863. The ROC curves are shown in Figure 3. The Spearman correlation coefficients of the predictors for T stage are listed in Table 4. Among these features, run-length nonuniformity from DWIb=1000images showed the highest correlation with T stage (Rs= 0.246,P= 0.008).
The significantly different texture features and ADC values between stage N0 and N1-2 tumors are summarized in Table 5. With regard to the features from ADC maps,ADCmaxand ADCmeanshowed statistically significant differences between the groups(P= 0.011, and 0.001, respectively). The logistic regression model that incorporated statistically significant features extracted from DWIb=0images achieved an AUC of 0.623 (sensitivity, 79.65%; specificity, 42.86%; accuracy, 63.01%; 95%CI: 0.528-0.711).Using the logistic regression model that incorporated significantly different features derived from DWIb=1000images, the AUC reached 0.635 (sensitivity, 57.69%; specificity,66.67%; accuracy, 61.75%; 95%CI: 0.540-0.723). Using the logistic regression model that incorporated significantly different features obtained from ADC maps, the AUC was 0.714 (sensitivity, 75.02%; specificity, 65.08%; accuracy, 70.52%; 95%CI: 0.622-0.794). In addition, multivariate logistic regression analysis showed that lower sum average,lower gray level nonuniformity and higher SymletH from DWIb=0images were independent predictors of nodal involvement (N1-2); from DWIb=1000images, lower sum average, lower information correlation, lower long run low gray level emphasis and higher SymletH were independent predictors; and for ADC maps, the independent predictors were lower ADCmax, lower ADCmeanand lower information correlation. The logistic regression model that incorporated these ten features for distinguishing stage N0 and N1-2 tumors achieved an AUC of 0.802 with a sensitivity of 80.77% and a specificity of 68.25%, and the accuracy was 75.11% with 95%CI of 0.718-0.871. The Spearman correlation coefficients for the predictors of N stage are shown in Table 6. Of these features, ADCmeanshowed the strongest correlation with N stage (Rs= -0.273,P= 0.003). The ROC curves are shown in Figure 4.
Figure 2 The region of interest segmentation results for a randomly selected case. A-C: Represent diffusion-weighted imaging (DWI)b=0, DWIb=1000 and the apparent diffusion coefficient image on the same slice, respectively; D-F: With regard to T2WI and DWI, the lesion region of interest was drawn on the apparent diffusion coefficient map (F) and copied onto the DWIb=0 (D) and DWIb=1000 (E) images.
There was excellent agreement between the ADC measurements and texture features derived from the two sets of ROIs independently delineated by two radiologists based on DWIb0/b1000images and ADC maps. The intraclass correlation coefficients ranged from 0.844 to 0.960.
In the present study, texture analysis was performed based on DWI images and ADC maps, and the correlation between texture features and T/N stage of rectal cancer was investigated. The results demonstrated the potential of texture features for staging rectal cancer.
Extramural invasion and nodal involvement are the main indications for the use of NAT in patients with rectal cancer[8]. Currently, the accuracy of preoperative staging by rectal MRI is still unsatisfactory[27,28]. Therefore, improved techniques for T and N staging may play an important role in determining the best treatment options for patients. Texture analysis provides a method for quantifying the intratumoral heterogeneity based on the distribution of gray level values and spatial arrangement of the pixels. In particular, texture analysis has been acknowledged as a promising tool for distinguishing benign from malignant tumors and in the staging of kidney and cervical cancer[29,30]. However, few studies on rectal MRI using texture analysis have been conducted to identify noninvasive independent predictors of high T stage and positive nodal status[31]. Furthermore, the independent predictors identified in these studies were derived from conventional morphological images, while our study mainly concentrated on functional DWI images (b= 0/b= 1000) and ADC maps.
In this study, eight texture features (run-length nonuniformity, information
correlation, SymletH, long run low gray level emphasis, sum average, gray level nonuniformity, dissimilarity and run percentage) presented statistically significant differences between groups and were employed for predicting T/N stage of rectal cancer. Theoretically, run-length nonuniformity means similarity of the length of runs derived from gray level run-length matrix, information correlation means nonlinear gray level dependence derived from gray level co-occurrence matrix, SymletH means horizontal components of symlet transform derived from Wavelet, long run low gray level emphasis means distribution of long runs and high gray level derived from gray level run-length matrix, sum average means over brightness derived from gray level co-occurrence matrix, gray level nonuniformity means similarity of the gray level value derived from gray level run-length matrix, dissimilarity means local contrast derived from gray level co-occurrence matrix, and run percentage means the homogeneity and the distribution derived from gray level run-length matrix,respectively.
Table 3 Comparison of extracted features between T1-2 and T3-4 stage groups of rectal cancers
In the present study, we found that six texture features from DWIb=0images, three features from DWIb=1000images and three features from ADC maps were significantly different between stage T1-2 and T3-4 tumors. Moreover, it was found that higher dissimilarity, higher sum average, higher information correlation and lower runlength nonuniformity from DWIb=0images were independent predictors of local invasion, as were higher gray level nonuniformity, higher run percentage and lower run-length nonuniformity from DWIb=1000images, while lower dissimilarity and higher run percentage from ADC maps were also independent predictors. The performance (AUC = 0.793) of the model that contained these independent predictors derived by logistic regression analysis was more favorable than the performance of the models that contained significant features independently extracted from the two types of DWI images and ADC maps. Furthermore, there were no significant differences between ADC measurements (ADCmean, ADCminor ADCmax) for the T1-2 and T3-4 groups in our study. These findings are consistent with earlier reports by Liuet al[31]and Attenbergeret al[32]. In fact, routine measurements of ADC are just conducted by calculating the intensities within a ROI, and the heterogenous intensities of different areas could offset each other. That means ADC quantification may mask some useful information about the tumor, while texture analysis can capture the spatial distribution of intensities, which rendered it as a more informative analysis method and a complement to routine ADC measurements. On the other hand, the lack of statistical significance might be because of the relatively small patient population.
Nodal involvement may be an indication for preoperative NAT in rectal cancer.Numerous previous studies have used size as the criterion for evaluating nodal metastases, but the size cutoff values for distinguishing benign from malignant nodes are inconsistent. In the present study, we found that three texture features from DWIb=0images, four texture features from DWIb=1000images and one texture feature from ADC maps were significantly different between stage N0 and N1-2 groups. In addition, ADC values (ADCmaxand ADCmean) were also significant parameters with discrimination value. This is consistent with a recent study by Vignatiet al[33], in which the correlation between ADC texture features and grading of prostate cancer was investigated. However, our findings were in conflict with the study by Liet al[34],which concluded that none of the ADCs showed any significant difference in predicting N stage. Such contradiction might be induced by the way tumors were identified. In our study, the lesions were extracted on a single ADC slice with the largest tumor diameter, while in the study by Liet al[34], the features were derived from the whole-lesion volume. Our experimental results also proved that lower sum average, lower gray level nonuniformity and higher SymletH from DWIb=0images,lower sum average, lower information correlation, lower long run low gray level emphasis and higher SymletH from DWIb=1000images, and lower ADCmax, lower ADCmeanand lower information correlation from ADC maps appeared to be independent predictors of nodal involvement. By using the logistic regression model that factored these independent predictors, the performance (AUC = 0.802) for predicting N stage was better than that obtained from three other models that included significant features derived from the two types of DWI images and ADC maps, respectively. Therefore, it may be valuable to predict nodal status using the texture features based on medical images of rectal cancer. Huanget al[35]reported the performance of texture analysis in determining N stage based on CT images. Their proposed method showed slightly lower efficiency (AUC = 0.736) than our method(AUC = 0.802). As pointed out by Lubneret al[36], CT acquisition parameters that influence attenuation or pixel relationships may affect texture measures. In addition to the absence of ionizing radiation, MRI is capable of multiparametric imaging, and can provide not only morphological but also functional images. MRI signal intensity is related to many factors, such as strength and uniformity of the main magnetic field,the sequence used, and the imaging parameters used (repetition time/echo time,trigger angle, and others). Thus, the application of MRI has been thought to be complicated by many issues, which brings high soft-tissue contrast and non-invasive assessment of the microcirculation of tumor. Previous studies demonstrated that in comparison with CT, MRI can provide more valuable data for radiomics through high-throughput extraction of quantitative image features. Thus, relative to CT, MRI undoubtedly has greater advantages in reflecting tumor heterogeneity and primary tumor stage for rectal cancer diagnosis, and is strongly recommended by the American Society of Colon and Rectal Surgeons to be performed before treatment[37].
Figure 3 Receiver operating characteristic curves obtained with different discriminatory models for predicting T1-2 and T3-4 stage tumors. AUC: Area under the receiver operating characteristic curve; ADC:Apparent diffusion coefficient.
In addition, interobserver variability for the calculation of ADC values and texture features based on the single-slice method between two radiologists was also evaluated. The results indicated excellent agreement with intraclass correlation coefficients ranging from 0.844 to 0.960. The variability mainly originated from slice selection and ROI delineation. Thus, it will be important to standardize strategies for ROI definition.
There are some limitations in this study. Firstly, it was a retrospective study with a relatively small sample size which impedes the generalizability of the findings.Secondly, texture analysis was performed based on a single-slice image which showed the largest diameter of the tumor, rather than the whole tumor volume. As rectal cancer usually grows along the rectal wall and forms an irregular shape, ROI delineation with a single-slice method may not accurately represent the actual shape.Thirdly, the findings may not apply to advanced rectal cancer as we only enrolled patients who underwent surgical resection directly rather than those who first received NAT. Finally, the calculated features are sensitive to the appliedb-values.We only analyzed DWI ofb= 0 andb= 1000 images and the corresponding ADC maps. More choice inb-values should be considered in the future.
In conclusion, texture features extracted from DWI images and ADC maps are useful clues for predicting pathological T and N stages in rectal cancer. This method may help radiologists perform accurate staging and therefore help improve individualized treatment planning.
Table 4 Spearman correlation coefficients for independent predictors of T-stage
Table 5 Comparison of extracted features between N0 and N1-2 stage groups of rectal cancers
1Independent samples t-test, data are means ± SD.2Mann-Whitney U test, data are medians ± interquartile range. DWI: Diffusion-weighted imaging; ADC: Apparent diffusion coefficient.
Table 6 Spearman correlation coefficients for independent predictors of N-stage
Figure 4 Receiver operating characteristic curves obtained with different discriminatory models for predicting N0 and N1-2 stage tumors. AUC: Area under the receiver operating characteristic curve; ADC: Apparent diffusion coefficient.
Colorectal cancer is the third leading cause of cancer worldwide, and rectal cancer accounts for approximately 30%-35% of colorectal cancer cases. An accurate evaluation of T and N stage in rectal cancer is essential for treatment planning. Heterogeneity within the tumor is a powerful predictor of pathological stage, which can be captured by diffusion-weighted imaging (DWI)images and apparent diffusion coefficient (ADC) maps with texture analysis.
A search of PubMed database indicates that texture analysis of DWI images combined with ADC maps for preoperative staging of rectal cancer has not been reported.
The aim of this study was to investigate whether texture features derived from DWI images and ADC maps were associated with the pathological stage T1-2vsT3-4 and pathological stage N0vsN1-2 in rectal cancer.
One hundred and fifteen eligible patients were selected for analyses. Lesion segmentation was performed manually. Twelve texture features were calculated from DWI images and ADC maps,including the gray level co-occurrence matrix parameters, the gray level run-length matrix parameters and wavelet parameters. Moreover, ADC values were measured from the lesion area.An independent samplet-test or Mann-WhitneyUtest was used to compare parameters between T1-2 and T3-4 stages, and between N0 and N1-2 stages. Multivariate logistic regression analysis was performed with the entry of variables to identify independent factors for T3-4 and N1-2 tumors. Receiver operating characteristic analysis was conducted to evaluate the diagnostic performance of the established logistic models for prediction of T3-4 and N1-2 tumors by calculating the area under the receiver operating curve (AUC).
Dissimilarity, sum average, information correlation and run-length nonuniformity from DWIb=0images, gray level nonuniformity, run percentage and run-length nonuniformity from DWIb=1000images, and dissimilarity and run percentage from ADC maps were found to be independent predictors of local invasion (stage T3-4). The AUC of the model reached 0.793 with a sensitivity of 78.57% and a specificity of 74.19%. Sum average, gray level nonuniformity and the horizontal components of symlet transform from DWIb=0images, sum average, information correlation,long run low gray level emphasis and horizontal components of symlet transform from DWIb=1000images, and ADCmax, ADCmeanand information correlation from ADC maps were identified as independent predictors of nodal involvement. The AUC of the model reached 0.802 with a sensitivity of 80.77% and a specificity of 68.25%.
The results indicated that texture features derived from preoperative DWI images combined with ADC maps were significantly associated with T and N stage in rectal cancer. These findings may be of value for the selection of treatment strategies.
In this project, we evaluated the role of ADC maps and DWI images in determining pathological stage, and the results revealed that texture features could be considered as novel biomarkers to predict the N and T stage in rectal cancer. In a subsequent study, a randomized multi-center prospective trial could be conducted to further validate our findings.
We thank the Shengjing Hospital of China Medical University for providing data and support.
World Journal of Gastroenterology2020年17期