Proceedings of the 7th International Symposium on Machinery and Mechatronics for Agriculture and Biosystems Engineering (ISMAB) 21-23 May 2014, Yilan, Taiwan

DEVELOPING A COMPUTER-AIDED DIAGNOSIS SYSTEM FOR CLASSIFICATION OF MALIGNANT BREAST TUMOR GRADE IN ULTRASOUND IMAGING Cheng-Liang Chien1, Yan-Fu Kuo1*, Dar-Ren Chen2 1

Department of Bio-Industrial Mechatronics Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 106, Taiwan, R.O.C. 2 Comprehensive Breast Cancer Center, Department of Medical Research, Changhua Christian Hospital, 135 Nanhsiao St., Changhua 500, Taiwan, R.O.C. *Corresponding Author-- Voice: +866-3366-5329, Email: [email protected] Abstract: This work developed a computer-aided diagnosis (CAD) system for discriminating malignant grades of breast tumors in three-dimensional (3D) ultrasound (US) imaging. US imaging is a widely used noninvasive detection approach for breast cancer. The grades of breast cancer masses are standard prognostic indicators in patients. Knowing the tumor grades allows doctors to be more selective in their diagnosis and to appropriately determine the treatment to the patients. In this study, a total of 148 3D US images of malignant breast tumors were obtained. The tumor masses were segmented. The patient’s age, texture, morphological, ellipsoid fitting features, and were quantified to describe the characteristics of the tumor masses. A support vector machine (SVM) model was then developed using machine learning algorithms to classify breast tumor grades. Performance of the proposed approach was evaluated using ten-fold cross-validation. It is demonstrated that the developed CAD system can effectively classify between low and high grades of breast tumors. Key Words: Breast tumor, Three-dimensional ultrasound image, Machine learning, Image processing.

INTRODUCTION Breast cancer is a leading cause of death for women throughout the world (Jemal et al., 2006). Early detection is essential for improving the patient survival rate. In previous decades, ultrasound (US) imaging has been reported as an effective method for screening breast cancer because of its low cost, high efficacy, real-time results, and lack of radiation. Grade of a malignant tumor is a measure of the tumor abnormality. According to the Nottingham’s grading system, breast cancers can be categorized into 3 grades (Elston and Ellis, 1991; Genestie et al., 1998; Elston, 2005). Tumor grading has been regarded as the important The authors are solely responsible for the content of this technical presentation. The technical presentation does not necessarily reflect the official position of the Chinese Institute of Agricultural Machinery (CIAM), and its printing and distribution does not constitute an endorsement of views which may be expressed. Technical presentations are not subject to the formal peer review process by CIAM editorial committees; therefore, they are not to be presented as th refereed publications. Citation of this work should state that it is from the 7 ISMAB paper. EXAMPLE: Author's Last th Name, Initials. 2014. Title of Presentation. The 7 ISMAB May 21-23, 2014. Yilan, Taiwan. For information about securing permission to reprint or reproduce a technical presentation, please contact CIAM at [email protected] or the Chinese Institute of Agricultural Machinery, c/o Department of Bio-industrial Mechatronics, National Chung Hsing University, 250 KuoKuang Road, Taichung 40227, Taiwan.

360

prognostic indicator (Rakha et al., 2010). The tumor grade information is essential for helping the doctors to decide appropriate treatments for the patients. Analysis of sonographic characteristics can assist in differentiating breast tumor grades by using BI-RADS (American College of Radiology, 2003) criteria (Kim et al., 2008; Aho et al., 2013; Wojcinski et al., 2013). The BI-RADS® -US lexicon classification for breast tumors provides a variety of categories, such as morphological and textural features. Experienced physicians evaluate breast tumor according to BI-RADS. However, US images are usually speckled and include tissue-related textures. The image interpretation is operator-dependent (Huang et al., 2008). Hence, computer-aided diagnosis (CAD) system is developed to enhance the diagnostic accuracy. In this study, a CAD (Huang et al., 2008; Chen et al., 2009; Chen et al., 2011; Moon et al., 2011) system was proposed for discriminating tumor grade with 3D US imaging. The specific objectives were as follows: (1) to quantify the features of the breast cancer on US image; (2) to develop a model for distinguishing between high grade and low grade tumors by using the feature sets; and (3) to evaluate the performance levels of developed model. In the process, volumetric US breast images were collected, and the tumor lesions were segmented based on the images. The textural, morphological, and ellipsoid fitting features of these tumor masses were then quantified. A support vector machine (SVM) classifier was then developed to distinguish tumor grades with the different features. The performance of developed models was assessed by receiver operating characteristic (ROC) analysis. MATERIALS AND METHODS IMAGE ACQUISITION The breast US images used in this study were samples of diagnostic cases obtained during routine clinical care at Changhua Christian Hospital (Changhua, Taiwan). The images were acquired using a US scanner (Voluson 730; GE Healthcare, Zipf, Austria) equipped with a 5.6-18 MHz volume transducer (RSP6-16; GE Healthcare, Zipf, Austria). All images were collected from patients diagnosed between June 2007 and August 2009. The ages of the patients ranged from 24 to 87 years (mean 51.3 years). Regarding patients that exhibited multiple tumor masses, only images of the largest lesion were collected from the database. The grades of the tumors were identified based on a pathological diagnosis, involving biopsy methods and the Nottingham’s grading system. The numbers of grade 1, 2, and 3 tumors were 25, 94, and 29. In this study, grade 1 and 2 were defined as low grade, whereas grade 3 was considered as high grade. The US images were converted into 3D grey-level images in a Cartesian coordinate system by using 4D View software (Chang et al., 2012). The mean voxel resolution was 0.2 mm. The ethics committee of the hospital approved the study. No patient identification was disclosed in an effort to avoid diagnosis bias and ensure patient privacy. TUMOR SEGMENTATION Tumor segmentation was performed to extract the region of the tumor before quantifying the lesion features. The tumor masses were semiautomatically segmented using ITK-SNAP software (Yushkevich et al., 2006), which performed active contouring based on level set

361

algorithm (Osher and Sethian, 1988; Sethian, 1999). The operators identified the lesions in the US images, then placed seeds (i.e., starting points) at appropriate locations inside the tumor masses. The seeds expanded until they reached the tumor boundaries. Appropriate control parameters were set to ensure optimal segmentation results were attained (Yushkevich et al., 2006). Compared with manual methods, semiautomatic segmentation is more consistent and less laborious when sketching precise contours and is particularly suitable for use with 3D US images. Figure 1 shows a segmented volumetric tumor mass.

Figure 1. The sonographic A-, B-, and C-view and segmented volumetric tumor mass using ITK-SNAP. FEATURE QUANTIFICATION Features were used to describe the characteristics of tumors for classification. They could be categorized into three sets – textural, morphological and ellipsoid fitting features. The age of the patient was an important attribute and was also considered as a feature. The textural features quantify the spatial correlation of voxel gray levels of the tumor mass. The textural features were calculated based on the gray level co-occurrence matrix (GLCM; (Haralick et al., 1973) Pd of volumetric images (Chen et al., 2007). In the process, the US images were quantized to 16 gray levels. The frequencies of gray level differences between 2 adjacent voxels in the images then were cumulated to from the Pd (Chen et al., 2007). In this study, a displacement vector d was defined to represent the geometric relationship between the 2 adjacent voxels. The vector d was set to be (1,1,1), (1,0,0), (0,1,0), and (0,0,1) in this study. Afterward, 6 textural feature were calculated based on the GLCM, including the angular second moment TASM, contrast TCon, inverse different moment TI, entropy TE, dissimilarity TD, and correlation TCor (Haralick et al., 1973; Jobanputra and Clausi, 2004). For instance, the TCon(0,1,0) feature represent the contrast feature evaluated by GLCM P(0,1,0) in vector (0,1,0). A total of 24 textural features were included in this study. The morphological features (Huang et al., 2008) describe the superficial and boundary regularity of the tumor masses. There were 6 morphological features included in this study.

362

Tumor volume MV (unit: mm3) and tumor surface area MA (unit: mm2) assess basic structural characteristics of the tumor mass. Classical compactness MCc describes the degree of similarity between a tumor mass and its optimally fitted sphere, and discrete compactness MCd describes the degree of similarity between a tumor mass and its optimally fitted cube (Bribiesca, 2008; Moon et al., 2011). The mean radius MRm and standard deviation of radius MRstd were used as indices to represent the characteristics of irregular tumor surface. The ellipsoid fitting features (Moon et al., 2011) describe the degree of similarity between a tumor mass and its optimally fitted ellipsoid. The optimally fitted ellipse can be regarded as the baseline for measuring the degree of shape irregularity of the tumor mass. Nine ellipsoid fitting features were applied in this study. Three of them were related to the properties with the ellipsoid and the tumor, including axis ratio EA, surface ratio ES, and volume covering ratio EV. The outside region ERO and inside region ERI quantified the numbers of regions of a tumor outside and inside, respectively, its best-fitted ellipsoid. The sum of regions ER is the sum of outside and inside regions ERO and ERI. The angularity is used to evaluate the protruding level in the outside region and indenting level in inside region. The feature EROa is the number of regions whose angularity is larger than a threshold for outside region. Similarly, the feature ERIa is the number of regions whose angularity is smaller than a threshold for inside region. The feature ERa is the sum of EROa and ERIa. TUMOR GRADE CLASSIFICATION SVM classifiers were developed to differentiate high and low grade tumors. The patient’s age, textural, morphological, and ellipsoid fitting features were model inputs. In this study, a soft margin SVM classifier with radial basis function kernel was developed using LIBSVM (Chang and Lin, 2011). The margin and parameters were determined using grid search. The dataset used in this study was unbalanced (119 and 29 of the low-grade and high-grade tumors). In the model development, the soft margin parameter ratio was set to be the inverse of the tumor number ratio between the two grades (Ben-Hur and Weston, 2010). The details of model parameter selection can be referred to LIBSVM (Chang and Lin, 2011). PERFORMANCE EVALUATION ROC analysis was applied to measure the performance levels of the developed CAD systems with ten-fold cross-validation (CV). Six indices were calculated: the area under the curve (AZ), accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) (Hanley and McNeil, 1982). The sensitivity and specify were defined as the percentages of actual high-grade and low-grade tumors, respectively, that were correctly classified. The PPV was defined as the percentage of predicted high-grade tumors which were correctly classified, and NPV was defined as the percentage of predicted low-grade tumors correctly classified. AZ is a measure of the overall performance of a model. The calculation of the ROC indices was performed using MATLAB (MathWorks, Inc.). RESULTS & DISCUSSION A SVM model was developed to discriminate the low-grade and high-grade tumors using 24 textural, 6 morphological, 9 ellipsoid fitting features, and patient's age as the inputs. Table 1 shows the classifier performance in terms of the 6 ROC indices. Accurate diagnosis of

363

high-grade tumor is crucial for a CAD system. High-grade tumors are more threatening. Misdiagnosing a high-grade tumor may increase risk of harm to life and should be avoided. Therefore, the sensitivity and NPV are 2 critical indices for evaluating the performance of the CAD system. Although the current model achieved a reasonable accuracy (75.0%), the sensitivity (55.17%) given by the proposed approach needed to be improved. Table 1. ROC indices of the classifier Accuracy Sensitivity Specificity PPV NPV AZ

75.00% 55.17% 79.83% 40.00% 87.96% 0.6620

CONCLUSIONS This study proposed a CAD system to be used with 3D US imaging for discriminating grades of breast cancer lesions. The textural, morphological, and ellipsoid fitting features were quantified from US images and were used to present the characteristics of the tumors. Then a SVM classifier was applied to distinguish low-grade and high-grade tumors using these features as input. The future works include improvement of the model sensitivity. REFERENCES Aho, M., A. Irshad, S. J. Ackerman, M. Lewis, R. Leddy, T. L. Pope, A. S. Campbell, A. Cluver, B. J. Wolf, and J. E. Cunningham. 2013. Correlation of sonographic features of invasive ductal mammary carcinoma with age, tumor grade, and hormone-receptor status. Journal of Clinical Ultrasound 41(1):10-17. American College of Radiology. 2003. Breast imaging reporting and data system (BI-RADS) Ultrasound. 1st ed. Reston, Virginia: American College of Radiology. Ben-Hur, A., and J. Weston. 2010. A user's guide to support vector machines. Methods in molecular biology 609:223-239. Bribiesca, E. 2008. An easy measure of compactness for 2D and 3D shapes. Pattern Recognition 41(2):543-554. Chang, C. C., and C. J. Lin. 2011. LIBSVM: A Library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2:27:1-27:27. Chang, Y. C., Y. H. Huang, C. S. Huang, and R. F. Chang. 2012. Vascular Morphology and Tortuosity Analysis of Breast Tumor Inside and Outside Contour by 3-D Power Doppler Ultrasound. Ultrasound in Medicine and Biology 38(11):1859-1869. Chen, W., M. L. Giger, H. Li, U. Bick, and G. M. Newstead. 2007. Volumetric texture analysis of breast lesions on contrast-enhanced magnetic resonance images. Magnetic Resonance in Medicine 58(3):562-571. Chen, C. Y., H. J. Chiou, S. Y. Chou, S. Y. Chiou, H. K. Wang, Y. H. Chou, and H. K. Chiang. 2009. Computer-Aided Diagnosis of Soft-Tissue Tumors Using Sonographic Morphologic and Texture Features. Academic Radiology 16(12):1531-1538. Chen, D. R., Y. L. Huang, and S. H. Lin. 2011. Computer-aided diagnosis with textural

364

features for breast lesions in sonograms. Computerized Medical Imaging and Graphics 35(3):220-226. Elston, C. W., and I. O. Ellis. 1991. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: Experience from a large study with long-term follow-up. Histopathology 19(5):403-410. Elston, C. W. 2005. Classification and grading of invasive breast carcinoma. Verhandlungen der Deutschen Gesellschaft für Pathologie 89:35-44. Genestie, C., B. Zafrani, B. Asselain, A. Fourquet, S. Rozan, P. Validire, A. Vincent-Salomon, and X. Sastre-Garau. 1998. Comparison of the prognostic value of Scarff-Bloom-Richardson and Nottingham histological grades in a series of 825 cases of breast cancer: Major importance of the mitotic count as a component of both grading systems. Anticancer Research 18(1 B):571-576. Hanley, J. A., and B. J. McNeil. 1982. The Meaning and Use of the Area under a Receiver Operating Characteristic (ROC) Curve. Radiology 143(1):29-36. Haralick, R. M., K. Shanmugam, and I. Dinstein. 1973. Textural features for image classification. IEEE Transactions on Systems, Man and Cybernetics smc 3(6):610-621. Huang, Y. L., D. R. Chen, Y. R. Jiang, S. J. Kuo, H. K. Wu, and W. K. Moon. 2008. Computer-aided diagnosis using morphological features for classifying breast lesions on ultrasound. Ultrasound in Obstetrics and Gynecology 32(4):565-572. Jemal, A., R. Siegel, E. Ward, T. Murray, J. Xu, C. Smigal, and M. J. Thun. 2006. Cancer statistics, 2006. Ca-A Cancer Journal for Clinicians 56(2):106-130. Jobanputra, R., and D. A. Clausi. 2004. Texture analysis using Gaussian weighted grey level co-occurrence probabilities. 1st Canadian Conference on Computer and Robot Vision, pp. 51-57. Kim, S. H., B. K. Seo, J. Lee, S. J. Kim, K. R. Cho, K. Y. Lee, B. K. Je, H. Y. Kim, Y. S. Kim, and J. H. Lee. 2008. Correlation of ultrasound findings with histology, tumor grade, and biological markers in breast cancer. Acta Oncologica 47(8):1531-1538. Moon, W. K., Y. W. Shen, C. S. Huang, L. R. Chiang, and R. F. Chang. 2011. Computer-Aided Diagnosis for the Classification of Breast Masses in Automated Whole Breast Ultrasound Images. Ultrasound in Medicine and Biology 37(4):539-548. Osher, S., and J. A. Sethian. 1988. Fronts propagating with curvature-dependent speed: Algorithms based on Hamilton-Jacobi formulations. Journal of Computational Physics 79(1):12-49. Rakha, E. A., J. S. Reis-Filho, F. Baehner, D. J. Dabbs, T. Decker, V. Eusebi, S. B. Fox, S. Ichihara, J. Jacquemier, S. R. Lakhani, J. Palacios, A. L. Richardson, S. J. Schnitt, F. C. Schmitt, P. H. Tan, G. M. Tse, S. Badve, and I. O. Ellis. 2010. Breast cancer prognostic classification in the molecular era: The role of histological grade. Breast Cancer Research 12(4):207. Sethian, J. A. 1999. Level Set Methods and Fast Marching Methods : Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science. Cambridge University Press. Wojcinski, S., N. Stefanidou, P. Hillemanns, and F. Degenhardt. 2013. The biology of malignant breast tumors has an impact on the presentation in ultrasound: An analysis of 315 cases. BMC Women's Health 13(1):47. Yushkevich, P. A., J. Piven, H. C. Hazlett, R. G. Smith, S. Ho, J. C. Gee, and G. Gerig. 2006. User-guided 3D active contour segmentation of anatomical structures: Significantly improved efficiency and reliability. NeuroImage 31(3):1116-1128.

365

developing a computer-aided diagnosis system for ...

May 23, 2014 - 135 Nanhsiao St., Changhua 500, Taiwan, R.O.C. ... routine clinical care at Changhua Christian Hospital (Changhua, Taiwan). .... Genestie, C., B. Zafrani, B. Asselain, A. Fourquet, S. Rozan, P. Validire, A. Vincent-Salomon,.

748KB Sizes 0 Downloads 206 Views

Recommend Documents

Developing A vendor Rating System & Quality manual for an ... - Groups
in the areas of Quality, Sourcing, Apparel Manufacturing, Pattern Making ,. Business Studies & Fashion Overview. Strengths. Mentors. Quality Management & ...

Expert System for the Diagnosis of Memory Loss ...
Abstract. The paper present an account of Case-based and Rule-based Expert System for Memory Loss Disease. It will initially discuss different approaches in designing of Medical Diagnosis Expert Systems with focus on all the information about the mem

a study on developing a monitoring system for the bimp ...
Product (GDP) per capita for Brunei was $50,100 (2009), which was the highest among the members of BIMP-EAGA. The country's economy was strong and stable and it had no external debt. The research found that there were no previous studies or any publi

DEVELOPING A COMMUNITY SUPPORT MODEL FOR TOURISM.pdf
DEVELOPING A COMMUNITY SUPPORT MODEL FOR TOURISM.pdf. DEVELOPING A COMMUNITY SUPPORT MODEL FOR TOURISM.pdf. Open. Extract.

Developing a Framework for Decomposing ...
Nov 2, 2012 - with higher prevalence and increases in medical care service prices being the key drivers of ... ket, which is an economically important segmento accounting for more enrollees than ..... that developed the grouper software.

Developing a Framework for Evaluating Organizational Information ...
Mar 6, 2007 - Purpose, Mechanism, and Domain of Information Security . ...... Further, they argue that the free market will not force products and ...... Page 100 ...

For Developing Countries - GitHub
Adhere to perceived affordance of a mobile phone to avoid alienating the user. Target users are unfamiliar to banking terminology. Balance Check. Shows current balance in m-banking account. Top Up. Scratch card or merchant credit transfer. Credit Tra

A Novel Design for a Biogas Generator in Developing Countries
to present a viable alternative energy source. ... ii. Acknowledgments. I am very grateful to my supervisor, Professor Shaker Meguid, for agreeing to supervise.

Relevance and Application: Developing a Curriculum for a World ...
Relevance and Application: Developing a Curriculum for a World Literature Course.pdf. Relevance and Application: Developing a Curriculum for a World ...

System and method for protecting a computer system from malicious ...
Nov 7, 2010 - so often in order to take advantage of neW virus detection techniques (e. g. .... and wireless Personal Communications Systems (PCS) devices ...

System and method for protecting a computer system from malicious ...
Nov 7, 2010 - ABSTRACT. In a computer system, a ?rst electronic data processor is .... 2005/0240810 A1 10/2005 Safford et al. 6,505,300 ... 6,633,963 B1 10/2003 Ellison et a1' ...... top computers, laptop computers, hand-held computers,.

pdf-1877\diagnosis-schizophrenia-a-comprehensive-resource-for ...
... the apps below to open or edit this item. pdf-1877\diagnosis-schizophrenia-a-comprehensive-res ... for-consumers-families-and-helping-professionals.pdf.

EEG - A test for diagnosis or misdiagnosis.pdf
EEG - A test for diagnosis or misdiagnosis.pdf. EEG - A test for diagnosis or misdiagnosis.pdf. Open. Extract. Open with. Sign In. Details. Comments. General Info.

A Speaker Count System for Telephone Conversations
The applications of this research include three-way call detection and speaker tracking, and could be extended to speaker change-point detection and indexing.

A Recommender-System for Telecommunications ...
telecoms network management system using a. Recommender System. This Recommender System approach has the advantage that the network management system will learn the human expert's approaches, preferences and knowledge over time and evolve the recomme

A System for Predictive Writing
3.1 Server-side design. We implemented a ... was to generate a database of n-grams (for n = 1, . . . , 12) with a plausible coverage on texts but at the ... Engine Project Wiki: http://www.sketchengine.co.uk/documentation/wiki/SkE/NGrams. 4.

A Speaker Count System for Telephone Conversations
3, which shows the intra- and inter-speaker Mahalanobis distance distributions (1000 comparisons for each class) for 5- segment based models, suggests some ...

pdf-1853\emancipating-the-banking-system-and-developing ...
... the apps below to open or edit this item. pdf-1853\emancipating-the-banking-system-and-developing-markets-for-government-debt-cornell-east-asia.pdf.

Developing for iOS.pdf
Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Developing for iOS.pdf. Developing for iOS.pdf. Open. Extract.