Adaptive spectral window sizes for feature extraction from optical spectra

Chih-Wen Kan1, Andy Y. Lee1, Nhi Pham1, Linda T. Nieman2, Konstantin Sokolov2, and Mia K. Markey1 1 The

University of Texas at Austin 2 The University of Texas MD Anderson Cancer Center

Introduction  Cancer is a major health problem  Early diagnosis of cancer is critical for cure  Invasive biopsy is currently the way to assess epithelial pre-cancers  Optical technologies hold great promise for noninvasive and real-time detection of pre-cancers

The University of Texas Department of Biomedical Engineering

2

Introduction  Although optical spectroscopy captures useful information, the spectral response variations between patients still make it difficult for clinicians to visually assess the likelihood of disease  Software tools are needed to help clinicians make diagnostic decisions

The University of Texas Department of Biomedical Engineering

3

Introduction  Three major categories of feature extraction: – principal component analysis (PCA) – spectral feature extraction – model-based feature extraction

 In spectral feature extraction, previous studies used windowing techniques to extract features from smaller regions of the spectrum  Choice of spectral region for feature extraction would make difference in the performance of the extracted features The University of Texas Department of Biomedical Engineering

4

Introduction

The University of Texas Department of Biomedical Engineering

5

Introduction

The University of Texas Department of Biomedical Engineering

6

Introduction  We propose an approach to adaptively adjust the spectral window sizes for feature extraction from optical spectra.  We hypothesize that by adaptively adjusting the spectral window sizes, the trends in the data will be captured more accurately.

The University of Texas Department of Biomedical Engineering

7

Materials  Used a diffuse reflectance spectroscopy dataset obtained in a study of oblique polarization reflectance spectroscopy of oral mucosa lesions*  Data collected at The University of Texas M. D. Anderson Cancer Center (UT MDACC)  We measured a total of 57 sites – – – –

22 were Normal 13 were Benign 12 were mild dysplasia (MD) 10 were high grade dysplasia or carcinoma (SD)

*L. Nieman, C. W. Kan, A. Gillenwater, M. Markey, and K. Sokolov, “Probing the local tissue changes in the oral cavity for the early detection of cancer using oblique polarized reflectance spectroscopy: a pilot clinical trial,” Journal of Biomedical Optics (in press) , 2007 The University of Texas Department of Biomedical Engineering

8

Algorithm 1. Set the starting point of the 1st window to be the smallest wavelength in the spectrum 2. Initial window size = 5 nm 3. Iteratively increase window size by 5 nm, perform simple linear regression, and obtain R2 4. Repeat step 3 until R2<0.8 5. End the current window and the new window starts 6. Repeat steps 2-5 across the entire spectrum

The University of Texas Department of Biomedical Engineering

9

Piecewise linear regression models

Figure. Sample diffuse reflectance spectra for a normal patient (top) and a SD patient (bottom). x: wavelength (nm), y: intensity. The University of Texas Department of Biomedical Engineering

10

Feature extraction  Three methods are compared: 1. No windowing 2. Fixed window size of 20 nm 3. Adaptive spectral window sizes

 Nine features extracted in each spectral window:

The University of Texas Department of Biomedical Engineering

11

Wavelength selection  Select one wavelength that is the most discriminatory  A two-class Linear Discriminant Analysis (LDA) classifier, performed with leave-one-out cross validation, is used to combine the nine features  The area under the Receiver Operating Characteristic curve is used as the evaluation metric for the diagnostic power of the wavelength

The University of Texas Department of Biomedical Engineering

12

Performance Evaluation  In order to investigate how features work in combination, we perform a wrapper method of an exhaustive search for feature combinations (291=511 combinations)  Leave-one-out cross validation was employed to train and test all LDA models

The University of Texas Department of Biomedical Engineering

13

Performance Evaluation

Figure. AUC distribution for Normal vs. MD+SD

Figure. AUC distribution for Normal vs. MD The University of Texas Department of Biomedical Engineering

14

Performance Evaluation

Figure. AUC distribution for Benign vs. MD+SD

The University of Texas Department of Biomedical Engineering

15

Feature Performance  Signal energy perform significantly better in adaptive spectral windows than in other two windowing techniques

Figure. Average area under curves of (1) all feature combinations, (2) ones that include signal,(3)ones that do not include signal energy

The University of Texas Department of Biomedical Engineering

16

Conclusions  The objective is not to fit the measured spectra, but to define spectral regions to perform analyses on  In general, adaptive spectral window size out-performs the use of 20 nm spectral windows and the use of one large window  The use of 20 nm spectral windows outperforms the use of one large window Table. Maximum area under curves for Normal vs. MD+SD

AUC

No windowing used

Fixed window sizes

Adaptive window sizes

0.64

0.71

0.84

The University of Texas Department of Biomedical Engineering

17

 Acknowledgments – Bryan Jiang – Arjun Ramachandran – Shalini Gupta – – – – – –

Rana Jahanbin Hyunjin Shin Anthony Stuckey Kort Travis Jesse Aaron Wei Shi Tsai

The University of Texas Department of Biomedical Engineering

18

Adaptive spectral window sizes for feature extraction ...

the spectral window sizes, the trends in the data will be ... Set the starting point of the 1st window to be the smallest ... The area under the Receiver Operating.

393KB Sizes 2 Downloads 348 Views

Recommend Documents

No documents