Ultrasound Based Computer-Aided Diagnosis of Kidneys for Pediatric Hydronephrosis Juan J. Cerrolaza*a, Craig A. Petersa,b, Aaron D. Martina,b, Emmarie Myersa, Nabile Safdara,c,d, Marius George Lingurarua,d a Sheikh Zayed Institute for Pediatric Surgical Innovation, Children’s National Medical Center, Washington, DC, USA; bDept. of Pediatric Urology, Children’s National Medical Center, Washington, DC, USA; cDept. of Diagnosis Imaging and Radiology, Children’s National Medical Center, Washington, DC, USA; dDept. of Radiology and Pediatrics, School of Medicine and Health Sciences, George Washington University, Washington, DC, USA. ABSTRACT Ultrasound is the mainstay of imaging for pediatric hydronephrosis, though its potential as diagnostic tool is limited by its subjective assessment, and lack of correlation with renal function. Therefore, all cases showing signs of hydronephrosis undergo further invasive studies, like diuretic renogram, in order to assess the actual renal function. Under the hypothesis that renal morphology is correlated with renal function, a new ultrasound based computer-aided diagnosis (CAD) tool for pediatric hydronephrosis is presented. From 2D ultrasound, a novel set of morphological features of the renal collecting systems and the parenchyma, is automatically extracted using image analysis techniques. From the original set of features, including size, geometric and curvature descriptors, a subset of ten features are selected as predictive variables, combining a feature selection technique and area under the curve filtering. Using the washout half time (T1/2) as indicative of renal obstruction, two groups are defined. Those cases whose T 1/2 is above 30 minutes are considered to be severe, while the rest would be in the safety zone, where diuretic renography could be avoided. Two different classification techniques are evaluated (logistic regression, and support vector machines). Adjusting the probability decision thresholds to operate at the point of maximum sensitivity, i.e., preventing any severe case be misclassified, specificities of 53%, and 75% are achieved, for the logistic regression and the support vector machine classifier, respectively. The proposed CAD system allows to establish a link between non-invasive non-ionizing imaging techniques and renal function, limiting the need for invasive and ionizing diuretic renography. Keywords: Hydronephrosis, kidney, ultrasound imaging, computer-aided diagnosis, machine learning.

INTRODUCTION Hydronephrosis is one of the most common finding in pediatric urology, affecting 2-2.5% of children [1]. The accumulation of urine in the renal pelvis as a result of obstruction to outflow causes the dilation of the renal collecting system and the distortion of the renal parenchyma [2]. Based on these morphological evidences, ultrasound (US) imaging is the mainstay for early diagnosis of hydronephrosis, yet is limited by its subjective assessment, and lack of correlation with functional imaging modalities, like diuretic renogram (MAG-3 scan [3]). Based on simple visual inspection, the commonly used Society for Fetal Urology (SFU) grading system for hydronephrosis [4] is subjective, and the categorization obtained does not necessarily reflects the actual renal function of the kidney, as shown in Fig. 1. As a result, the SFU system is not universally accepted as useful [5] and functional renal imaging has emerged as the gold standard for assessing hydronephrosis. A renal MAG-3 scan provides valuable and objective information on renal function, though its invasive nature, involving bladder catheterization and ionizing radiation exposure, makes it especially undesirable for children. The main goal of this work is to create a new US based computer-aided diagnostic framework for pediatric hydronephrosis. The underlying hypothesis of this study is that hydronephrosis can be objectively quantified using modern image analysis of US images, and that there is a correlation between the degree of hydronephrosis and the severity of the obstruction as measured by diuretic renography. *[email protected]; phone 1 202 476 1272; innovationinstitute.childrensnational.org

Medical Imaging 2014: Computer-Aided Diagnosis, edited by Stephen Aylward, Lubomir M. Hadjiiski, Proc. of SPIE Vol. 9035, 90352T · © 2014 SPIE · CCC code: 1605-7422/14/$18 · doi: 10.1117/12.2043072

Proc. of SPIE Vol. 9035 90352T-1

Figure 1. Lack of correlation between SFU grading system and renal function. According to the SFU grading system, the upper image would be labeled as a moderate hydronephrotic case, while the lower would correspond to a more severe case. However, functional information reveals that the former has a greater obstruction.

+1120

110

90

washout half time Tin (min) 70

46

36

26

16

12

- 0.9 0.9 0.8

(42, 0.8)

Ú

e =

n C

0.8

-0.7Ç

(25, 0.8)

G^ su

,= o 0.7 >

c

(18,0.5)

0.6-

U

- 0.3 0.5

(7, 0.5)

- 0.2 0.45

12

24

36

48

60

age (months)

72

84

96

+98 1

Figure 2. Cumulative frequency distribution of the population. The ages considered in the data set ranges from newborns (< 5 months) to 14 years of ages. The washout half times goes from < 6 min (minor obstruction) to more than 120 min.

From the manual segmentation of renal parenchyma and collecting system in 2DUS images, an optimal set of morphological descriptors of the kidney are automatically extracted and used as inputs features of a machine learning algorithm. This classifier is able to predict the degree of hydronephrosis of the renal unit, identifying with maximum sensitivity the severe cases that require immediate medical attention, while maximizing the number of non-critical cases where diuretic renography can be safely avoided.

METHODOLOGY The data set considered in this study consists of 50 hydronephrotic cases belonging to 45 children (31 male and 14 female; studies of both kidneys are available from some patients) of different ages, from newborns to 14 years of age. The cumulative histogram of the age distribution is depicted in Fig. 2. For each case, concurrent renal 2D ultrasound imaging and diuretic renography are available, allowing to study the correlation between renal function and morphology. The degree of obstruction of the kidney is described by means of the washout half time (T1/2): the time required to clear half of the accumulated tracer from the renal pelvis after reaching its peak and being stimulated by administration of flurosemide [2] Fig. 2. also illustrates the distribution of T 1/2 in the data set. Using this parameter as a reference, thresholds of functional significance are defined to determine those patients where diuretic renography could be safely avoided, and which would benefit from invasive functional imaging. In particular, a threshold of 30 minutes is considered in this study.

Proc. of SPIE Vol. 9035 90352T-2

From the sequence of 2DUS scans, a 2DUS slice containing a whole longitudinal section of the kidney and its collecting system is selected and manually segmented in consensus by two experienced clinicians, a radiologist and an urologist, minimizing the subjectivity of the delineation process in US images. Though the development of accurate and automatic segmentation tools in US is especially challenging, the promising results reported by some recent works [2, 7, 8] suggest the possibility of automating this preliminary stage in the near future. Fig. 3 (b) shows the consensus masks obtained for the case depicted in Fig. 3 (a).

Figure 3. 2DUS slice selection and segmentation. a) Selected slice containing the whole longitudinal section of the kidney. b) Consensus manual delineation of renal parenchyma and collecting system.

Figure 4. Morphological descriptors. Sample of some of the morphological descriptors considered as potential predictive features of hydronephrosis severity: eccentricity of the circumscribed ellipse ((1-b2/a2)1/2), maximum and minimum parenchyma thickness (Pmax, Pmin), curvature statisticis of the collecting system (Ccs) and kidney surface (Ck) (i.e., mean, maximum, minimum, kurtosis, etc), point of maximum curvature of the kidney medial axis KMA, (PKMA max).

The binary masks of the kidney and its collecting system, are processed by means of image analysis techniques to automatically extract a set of morphological parameters of the kidney. These quantitative shape descriptors can be intuitively grouped in the following three categories. Size descriptors: Parameters describing the size of the collecting system and renal parenchyma, such as relative area, perimeter, or parenchymal thickness. Geometric shape descriptors: Generic geometric parameters as the circularity ratio, eccentricity of the circumscribed ellipse, or convexity. The Hu set

Proc. of SPIE Vol. 9035 90352T-3

of seven invariant moments [9] of both, the parenchyma and the collecting system, are also considered here. These features, independent of position, size and orientation, have been extensively applied to image registration and reconstruction, and are of great interest for image pattern recognition. We have also considered other shape descriptors as the kidney medial axis [10], and other derived features such as the asymmetry index (i.e. ratio between the segments that joint the two extremes of the kidney medial axis and the point of maximum curvature of this axis). Curvature descriptors: Including the traditional extrinsic curvature of planar curves, and the more recent local area integral invariants [11]. The potential utility of such morphological descriptors for assessing hydronephrosis has been stressed by several authors, attempting to analyze US imaging of hydronephrosis in more objective ways. The “hydronephrosis index” defined by Saphiro et al. [12] as the ratio between the area of the renal parenchyma and the total area of the kidney, has been shown to correlate with the SFU. Instead of a single feature, more than 130 quantitative shape descriptors are considered in this study (including the aforementioned “hydronephrosis index”), some of them are graphically illustrated in Fig. 4. These quantitative shape descriptors are used as potential predictive variables to evaluate the diagnostic performance of two different supervised classification models: logistic regression analysis, and support vector machine with radial basis function kernel. Both approaches have been widely used in the field of computer-aided-diagnosis with satisfactory results. Logistic regression (LOG) is a binary classifier that performs probability estimation using a logistic formula. On the other hand, support vector machines (SVM) operate by finding an hyperplane leaving the largest possible fraction of points of the same class on the same side, while maximizing the distance of either class from the hyperplane. Prior to evaluating the classification capabilities of both methods, the high dimensional space of predictive features is reduced by finding the subset of relevant morphological descriptors that facilitate the classification process. By means of the feature selection technique proposed by Cai et al. [13], optimal subsets of 9 variables are obtained, using the area under the curve (AUC) criteria to select the final predictive set (Table 1). A detailed description of the resulting set is provided in the next section. The parameters optimization is performed via grid search, and the leave-one-out strategy is employed for training and testing purposes. Receiver operating characteristic (ROC) curve analysis is used to identify the operating points with the highest accuracy, sensitivity and specificity of each classifier. The ultimate goal is to identify those probability thresholds that maximize the sensitivity of detecting severe cases of hydronephrosis; that is, no case with a washout time above the defined threshold is misclassified. Thus, diuretic renography could be safely avoided in those cases classified as non-severe, reducing the number of unnecessary ionizing imaging studies. Table 1. Selection of morphological and geometrical descriptors used as predictive variables.

Predictor P1 P2 P3 P4 P5 P6 P7 P8 P9 *

Descriptor Size Size Geometric Geometric Geometric Curvature Curvature Curvature Curvature

Explanation Minimum parenchymal thickness.* Pmin/a Kurtosis of the parenchymal thickness.* Kurt(P/a) Asymmetry index of the kidney medial axis. (P1KMA-PKMA max)/( P2KMA-PKMA max) 1st Hu invariant moment of the renal parenchyma. 3rdHu invariant moment of the renal parenchyma. Minimum curvature of the kidney contour. min(CK) Entropy of the curvature of the kidney contour. Ent(CK) Minimum curvature of the collecting system. min(CCS) Entropy of the parenchymal local area integral. Ent(LAI(P))

The parenchymal thickness has been normalized by the major axis of the ellipse circumscribing the kidney (see Fig. 4).

RESULTS Following the procedure described above, a set of 10 features is selected, including the following morphological descriptors: first and third invariant moment of the parenchyma; inner curvature of the kidney; minimum and entropy of the curvature of the kidney; minimum curvature of the collecting system; entropy of the local area integral information of the collecting system; minimum, and kurtosis of parenchyma thickness; kurtosis of the distance between the kidney and the collecting system. Fig. 5 shows the ROC curves for the two classifiers under study. The AUC for SVM is 0.89, and 0.93 for LOG. The performance of each one of the classifiers is evaluated in terms of its specificity, sensitivity, and

Proc. of SPIE Vol. 9035 90352T-4

accuracy, whose numerical values are listed in Table 2. The accuracy, sensitivity and specificity for SVM and LOG are 0.86, 0.57, and 0.97, and 0.82, 0.71, and 0.86, respectively. The ROC shown in Fig. 5 allows to identify the optimal operating points that provide 100% sensitivity while maximizing the specificity of the system (i.e., minimizing the false positive rate). The performance of the binary classifiers achieved after modifying the probability decision thresholds to operate at these points of maximum sensitivity is also shown in Table 1, where the best case in terms of specificity (percentage of patients in the safe zone) is marked with (*). In particular, it is the LOG classifier that provides the best result with a specificity of 0.75, while the specificity for the SVM is 0.53. Receiver Operating Characteristics Graphs

0.9 -

:°. 0.5

0

Point of Max. Sensitivity

AUC

d

ÿ

0.424-4 k

SVM 0.89

(0.47, 1)

0.93

(0.25, 1)

LOG 0.2"-

0.1 0

0

0.4

0.2

0.6

0.8

false positive rate

Figure 5. Receiver operating characteristic graph. ROC curves obtained for the two machine learning methods under study, logistic regression and support vector machines. The graph also shows the area under the curve obtained for each case and the point of maximum sensitivity

Table 2. Area under the curve, sensitivity, specificity, and accuracy obtained for the two classifiers studied, SVM, and LOG. After modifying the decision threshold to operate at the point of maximum sensitivity, the best result in terms of specificity is marked with (*) Original Values SVM LOG

Max. Sensitivity

AUC

Sensitivity

Specificity

Accuracy

Sensitivity

Specificity

Accuracy

0.89 0.93

0.57 0.71

0.97 0.86

0.86 0.82

1.00 1.00

0.53 * 0.75

0.66 0..82

CONCLUSIONS A new US based CAD tool for pediatric hydronephrosis is presented in this study. Alternatively to the subjective SFU grading system based on simple visual inspection, we propose a novel objective and reproducible scoring system for hydronephrosis able to relate measurable morphological findings from US imaging with renal function information provided by MAG-3 scans. Using the washout half time as clinical indicative of renal obstruction, the population is divided into two groups of patients: those with severe obstruction and those with satisfactory drainage where further invasive studies can be safely avoided. Beyond the typical parameters based on the area and size of the kidney, more than 130 anatomical features are automatically extracted from the 2DUS images via image analysis techniques. Many of these features are novel and tailored to capture the relation between renal parenchyma and collecting system. Based on the washout half time, a selection of these features are used as apredictive variables to classify hydronephrotic patients as severe, where a further invasive imaging would be warranted, or non-critical, where diuretic renography could be safely avoided. Two different machine learning algorithms are tested: logistic regression, and support vector machines. The results obtained from a dataset of 50 cases with different degree of renal obstruction are very promising. Operating at the

Proc. of SPIE Vol. 9035 90352T-5

point of maximum sensitivity to guarantee that no case with severe hydronephrosis is misclassified, up to 75% of nonsevere cases are successfully identified. The results obtained with 2DUS images demonstrate the feasibility of developing reliable and objective diagnostic tools to support the routine clinical evaluation of kidneys from non-invasive imaging modalities. The new CAD system proposed here does not intend to replace MAG-3, but to present an accurate, quantitative and reproducible diagnostic tool for hydronephrosis based on simple 2DUS images that improves the current standards of management of hydronephrotic patients and limits the need for diuretic renography. In future work we expect to improve system performance by extending the study to 3D US images.

REFERENCES [1] Riccabona, M., "Obstructive diseases of the urinary tract in children: lessons from the last 15 years," Pediatr Radiol. 40(6), 947-955 (2010). [2] McDougal, W.S., Wein, A.J., Kavoussi, L.R., Novick, A.C., Partin, A.W., Peters, C.A. and Ramchandani, P., “Campbell-Walsh Urology 10th Edition – Chapter 113: Congenital Urinary Obstruction: Pathophysiology and Clinical Evaluation”, Elsevier Health Sciences, 3028-3047(2011). [3] Ross, S. S., Kardos, S., Krill, A., Bourland J., Sparque, B., Majd, M., Pohl, H. G., Gibbons, M. D., Belman A. B. and Rushton, H. G. "Observation of infants with SFU grades 3-4 hydronephrosis: worsening drainage with serial diuresis renography indicates surgical intervention and helps prevent loss of renal function." Journal of pediatric urology 7(3), 266-271 (2011). [4] Fernbach, S. K., Maizels, M. and Conway, J.J. "Ultrasound grading of hydronephrosis: introduction to the system used by the Society for Fetal Urology," Pediatric Radiology 23(6), 478-80 (1993). [5] Keays, M. A., Guerra, L.A., Mihill, J., Raju, G., Al-Asheeri, N., Geier, P., Gaboury, I., Matzinger, M., Pike, J. and Leonard, M. P, "Reliability assessment of Society for Fetal Urology ultrasound grading system for hydronephrosis." The Journal of urology 180(4), 1680-1682 (2008). [6] Mendoza, C. S., Xin Kang, Safdar, N., Myers, E., Peters, C. A. and Linguraru, M. G., "Kidney segmentation in ultrasound via genetic initialization and Active Shape Models with rotation correction,” Int. Symp. Biomedical Imaging (ISBI), 69-72 (2013). [7] Cerrolaza, J. J., Safdar, N., Peters, C. A., Myers, E., Jago, J. and Linguraru, M. G., “Segmentation of Kidney in 3D-Ultrasound Images Using Gabor-based Appearance Models”, Int. Symp. Biomedical Imaging (ISBI), (2014) (in press). [8] Mendoza, C.S. Kang, X. Safdar, N. Myers, E. Martin, A. Grisan, E. Peters, C.A. and Linguraru, M.G., “Automatic Analysis of Pediatric Renal Ultrasound Using Shape, Anatomical and Image Acquisition Priors”, Medical Image Computing and Computer-Assisted Intervention, Lecture Notes in Computer Science, 8151, 259-266 (2013). [9] Ming-Kuei Hu, “Visual pattern recognition by moment invariants,” IRE Transactions on Information Theory. 8(2), 179-187 (1962). [10] Aichholzer, O., Aigner, W., Aurenhammer, F., Hacki, T., Juttler, B. and Rabi, M., “Medial axis computation for planar free-form shapes”, Computer-Aided Design, 41(5), 339-349 (2009). [11] Manay, S., Cremers, D., Byung-Woo Hong, Yezzi, A.J. and Soatto, S., "Integral Invariants for Shape Matching," IEEE Transactions on Pattern Analysis and Machine Intelligence 28(10), 1602-1618 (2006). [12] Shapiro, S. R., Wahl, E. F., Silberstein and M. J., Steinhardt, G., “Hydrnephrosis index: a new method to track patients with hydronephrosis quantitatively”, Urology, 72(3) 536-539 (2008). [13] Cai, D., Zhang, C., He, X., “Unsupervised feature selection for multi-cluster data,” Proc. ACM SIGKDD Int. Conf. on Konwledge Discovery and Data Mining, 333-342 (2010).

Proc. of SPIE Vol. 9035 90352T-6

Ultrasound based computer-aided-diagnosis of kidneys ...

Sciences, George Washington University, Washington, DC, USA. ... half time (T1/2) as indicative of renal obstruction, two groups are defined. ... Keywords: Hydronephrosis, kidney, ultrasound imaging, computer-aided ... This classifier is able to predict the degree of hydronephrosis of the renal unit, identifying with maximum.

542KB Sizes 0 Downloads 104 Views

Recommend Documents

object-based ultrasound video processing for wireless ... - Springer Link
The thin white lines define boundaries between objects of interest. In the upper rectangular region, patient information has been deleted. number of times, or if ...

Automatic segmentation of kidneys from non-contrast ...
We evaluated the accuracy of our algorithm on five non-contrast CTC datasets .... f q t qp p. +. = → min, min. (3) t qp m → is the belief message that point p ...

Badlands Ultrasound Data.pdf
Badlands Ultrasound Data.pdf. Badlands Ultrasound Data.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Badlands Ultrasound Data.pdf. Page 1 ...

Ultrasound data page.pdf
53 647 0.38 15.0 3.32. 60 03E 0.31 13.8 2.78. 61 02E 0.19 12.1 1.95. AVERAGES | Adj Rib Fat: 0.38 | Adj RE: 15.0 | Adj % IMF: 3.82. ULTRASOUND DATA ON ALL BULLS. Page 1 of 1. Ultrasound data page.pdf. Ultrasound data page.pdf. Open. Extract. Open wit

Frey Ultrasound Data.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Frey Ultrasound ...

Download Diagnostic Ultrasound
quality assurance and safety. • Additional chapters dedicated to 3D ultrasound, contrast agents and elastography. • New glossary containing definitions of over 500 terms. The editors and contributing authors are all authorities in their a

Ultrasound findings of hepatic lipidosis in cows with ...
Efficient application of diagnostic ... life of dairy caws. ... development of hepatic lipidosis [3] and [5] and [7]. ... needed to keep in view the tricarboxylic acid cycle,.

pulsed ultrasound Acceleration of tibial fracture-healing ...
Science. Center at San Antonio,. San Antonio. ABSTRACT: Sixty-seven closed or grade-I open .... accelerated fracture-healing ..... (degrees). Before reductiont. After reductiont. Maximum fracture gapt (mm). Length of fracturet ..... computer. All of

Ultrasound-promoted synthesis of 4(3H)-quinazolines under ... - Arkivoc
Jul 13, 2016 - the reported methods represent practical applications and modifications of the ... development of milder and novel synthetic schemes to these ...

Accuracy of three-dimensional ultrasound compared with magnetic.pdf
uterine wall thickness (Figure 1); a septum may partially. (Class U2a) or completely (Class U2b) divide the uterine. cavity. Class U3, or bicorporeal uterus, was ...