Original Research

ajog.org

GYNECOLOGY

Validation of a second-generation multivariate index assay for malignancy risk of adnexal masses Robert L. Coleman, MD; Thomas J. Herzog, MD; Daniel W. Chan, PhD; Donald G. Munroe, PhD; Todd C. Pappas, PhD; Alan Smith, MS; Zhen Zhang, PhD; Judith Wolf, MD

BACKGROUND: Women with adnexal mass suspected of ovarian

malignancy are likely to benefit from consultation with a gynecologic oncologist, but imaging and biomarker tools to ensure this referral show low sensitivity and may miss cancer at critical stages. OBJECTIVE: The multivariate index assay (MIA) was designed to improve the detection of ovarian cancer among women undergoing surgery for a pelvic mass. To improve the prediction of benign masses, we undertook the redesign and validation of a second-generation MIA (MIA2G). STUDY DESIGN: MIA2G was developed using banked serum samples from a previously published prospective, multisite registry of patients who underwent surgery to remove an adnexal mass. Clinical validity was then established using banked serum samples from the OVA500 trial, a second prospective cohort of adnexal surgery patients. Based on the final pathology results of the OVA500 trial, this intended-use population for MIA2G testing was high risk, with an observed cancer prevalence of 18.7% (92/ 493). Coded samples were assayed for MIA2G biomarkers by an external clinical laboratory. Then MIA2G results were calculated and submitted to a clinical statistics contract organization for decoding and comparison to MIA results for each subject. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated, among other

Introduction The number of women diagnosed annually with adnexal mass far exceeds the number of ovarian cancer cases, posing a serious clinical challenge to accurately identify the subgroup of patients most likely to benefit from consultation with a gynecologic oncologist. Although numerous prediction models and referral guidelines have been tested in the preoperative evaluation of the adnexal mass, no single method has received widespread acceptance.1-3 In addition, numerous studies indicate that

Cite this article as: Coleman RL, Herzog TJ, Chan DW, et al. Validation of a second-generation multivariate index assay for malignancy risk of adnexal masses. Am J Obstet Gynecol 2016;215:82.e1-11. 0002-9378 ª 2016 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http:// creativecommons.org/licenses/by-nc-nd/4.0/). http://dx.doi.org/10.1016/j.ajog.2016.03.003

measures, and stratified by menopausal status, stage, and histologic subtype. RESULTS: Three MIA markers (cancer antigen 125, transferrin, and apolipoprotein A-1) and 2 new biomarkers (follicle-stimulating hormone and human epididymis protein 4) were included in MIA2G. A single cut-off separated high and low risk of malignancy regardless of patient menopausal status, eliminating potential for confusion or error. MIA2G specificity (69%, 277/401 [n/N]; 95% confidence interval [CI], 64.4e73.4%) and PPV (40%, 84/208; 95% CI, 33.9e47.2%) were significantly improved over MIA (specificity, 54%, 215/401; 95% CI, 48.7e58.4%, and PPV, 31%, 85/271; 95% CI, 26.1e37.1%, respectively) in this cohort. Sensitivity and NPV were not significantly different between the 2 tests. When combined with physician assessment, MIA2G correctly identified 75% of the malignancies missed by physician assessment alone. CONCLUSION: MIA2G specificity and PPV were significantly improved compared with MIA, while sensitivity and NPV were unchanged. The second-generation test significantly improved the predicted efficiency of triage vs MIA without sacrificing high sensitivity and NPV, which are essential for effectiveness. Key words: diagnostic, ovarian cancer, protein, referral, serum

the majority of new ovarian cancer cases fail to be appropriately referred or treated at the time of first surgery, with significant detrimental effects on 5-year survival.4-6 To improve the presurgical detection of ovarian cancer among women undergoing removal of adnexal masses, the multivariate index assay (MIA) OVA1 was developed. The test was cleared by the Food and Drug Administration (FDA) in 2009 for presurgical risk assessment limited to cases where adnexal excision is warranted, and validated in 2 previously published clinical trials.7,8 The MIA combined the results of the biomarker concentrations from the Siemens BNII assays (Siemens, Malvern, PA) for apolipoprotein A-1, transthyretin, beta-2microglobulin, and transferrin (TRF) and the Elecsys assay for cancer antigen 125 (CA125-II) (Roche, Indianapolis, IN). In an intended-use clinical cohort (adnexal surgery patients enrolled from nongynecologic oncology practices), MIA

82.e1 American Journal of Obstetrics & Gynecology JULY 2016

showed significantly higher sensitivity for predicting malignancy compared with clinical impression, CA125-II, or modified American Congress of Obstetricians and Gynecologists (ACOG) criteria of Dearking et al,9 and negative predictive value (NPV) ranging from 95-98%. Limitations of the MIA assay, however, included a less than ideal specificity of 43-50%, as a consequence of a high falsepositive rate resulting in a positive predictive value (PPV) of 30-42% in the cohorts examined.7,10 These results predict that many patients with benign masses may be classified as high risk, reducing overall triage effectiveness. The current study was undertaken to evaluate the clinical validity of a secondgeneration MIA (MIA2G), in which the MIA panel was redesigned to improve specificity and PPV of the assay while maintaining high sensitivity and NPV. Herein we report that the MIA2G assay improves both specificity and PPV relative to MIA without compromising

ajog.org sensitivity, NPV, or detection of earlystage ovarian cancer.

Materials and Methods The MIA2G algorithm was derived from samples from an intended use cohort described by Ueland et al.7 This proprietary algorithm was derived from the serum proteins apolipoprotein A-1, CA125-II, human epididymis protein 4 (HE4), follicle-stimulating hormone (FSH), and TRF using methods described by the coauthors.11-13 For the validation study, archived serum samples from an independent prospectively collected set of specimense the OVA500 studyewere used.8 This study cohort had the same enrollment criteria as the OVA1 pivotal study7 and had been previously evaluated for the firstgeneration serum biomarker MIA as part of an independent verification of MIA. Consecutive patients who met inclusion criteria were prospectively enrolled from 27 sites throughout the United States, all of which had institutional review board approval. All enrolling clinicians were from nongynecologic oncology specialty practices, although patients may have had consultation with or undergone surgery by a gynecologic oncologist. Inclusion criteria were: women age 18 years, signed informed consent, agreeable to phlebotomy, and documented pelvic mass planned for surgical intervention within 3 months of imaging. A pelvic mass was confirmed by imaging (computed tomography, ultrasonography, or magnetic resonance imaging) prior to enrollment. Exclusion criteria included a diagnosis of malignancy in the previous 5 years (except of nonmelanoma skin cancers) or enrollment by a gynecologic oncologist. Menopause was defined as the absence of menses for 12 months or age 50 years. Demographic and clinicopathologic information was collected on case report forms. A preoperative blood sample of 80 mL was processed within 1-6 hours of collection, and serum was frozen at the collection site. Serum samples were shipped on dry ice to an archive site (PrecisionMed Inc, Solana Beach, CA) where they were thawed and aliquoted, then frozen and stored at e65 to

GYNECOLOGY e85 C. All aliquots were thawed only once and consumed entirely during testing, so that no sample had undergone >2 or <2 freeze-thaw cycles. Serum biomarker concentrations were determined on the Roche cobas 6000 clinical analyzer, utilizing the c501 and e601 modules. The c501 module is a medium throughput (up to 600 samples/h), photometric detection module used for clinical chemistry applications, homogenous immunoassays, and whole blood measurement. The e601 module is a medium throughput electrochemiluminescent detection module used for heterogeneous immunoassays. Biomarker assays were run according to the manufacturer’s instructions. All measurements were performed on coded samples (blinded as to patient demographics or pathology outcome) at the Clinical Laboratory Improvement Amendmentse/College of American Pathologistsecertified laboratory of the Division of Clinical Chemistry, Department of Pathology, Johns Hopkins Medical Institution. The MIA2G combines the results of the biomarker concentrations from the cobas assays for apolipoprotein A-1, CA125-II, HE4, FSH, and TRF. Assays for apolipoprotein A-1 and TRF are immunoturbidimetric assays; CA125-II, HE4, and FSH assays use electrochemiluminescent detection. Package inserts for these assays indicate a maximum coefficient of variation of 1.1-2.8% for repeatability and 2.5%-4.5% for intermediate precision for serum samples on these individual biomarker assays. The MIA2G risk score was calculated using software (OvaCalc, Version 4.0.0, Vermillion, Inc., Austin, TX) that uses the 5 biomarker values and a proprietary algorithm to return a dimensionless numerical score from 0.0-10.0. While MIA was optimized using different cutoffs of 5.0 (premenopausal) and 4.4 (postmenopausal) to separate higherfrom lower-risk subjects, MIA2G uses a single risk cut-off of 5.0 regardless of menopausal status). In the original OVA500 trial, clinicians were required to document the results of physical examination, family history, imaging, laboratory tests, and

Original Research

formal presurgical assessment of malignancy. In cases where a formal assessment was done by a clinician, other than the enrolling physician, the referral history and the specialty of the clinician who made the prediction were recorded, as was the specialty of the surgeon who ultimately operated on each patient. To reflect routine clinical judgment and referral behavior, physicians were not asked to either follow any specific prediction algorithm or justify their prediction. The same prospectively obtained physician assessment (PA) prediction of malignancy was utilized in the present trial, to compare routine clinical assessment to MIA2G independently and in combination with PA. Postoperative pathology diagnosis was recorded at each enrolling site and independently reviewed. The coded sample MIA2G scores and biomarker values were submitted to an independent clinical statistics contract research organization, Applied Clinical Intelligence (Bala Cynwyd, PA), where they were matched by a subject identifier to the information from the case report form and used for statistical analyses. Clinical diagnostic performance criteria (sensitivity, specificity, NPV, and PPV) were calculated for MIA2G alone, PA alone, and MIA2G þ PA. Triage decision rules followed those previously published for MIA.7,8 The combined test result was declared positive when the patient had either a highrisk MIA2G score or the presurgical PA predicted a malignancy. Accordingly, MIA2G þ PA was scored negative only when both MIA2G and PA predicted a benign outcome. A subset of the OVA500 cohort meeting study inclusion/exclusion criteria had MIA scores generated as part of a previously published study,8 and these data were used for comparison (N ¼ 493). Statistical analyses were performed with software (SAS, Version 9.2 or later; SAS Institute Inc, Cary, NC). Sensitivity was defined as: (test positives/all subjects with identified malignancy)  100. Specificity was defined as: (test negatives/all subjects without identified malignancy)  100. PPV was defined as: (true test positives/all test positive

JULY 2016 American Journal of Obstetrics & Gynecology

82.e2

Original Research

ajog.org

GYNECOLOGY

subjects)  100. NPV was defined as: (true test negatives/all test negative subjects)  100. Overall accuracy was defined as: (true test positives þ true test negatives)/all subjects)  100. These

clinical performance measures are presented as a percentage score, followed by the numbers of subjects that define the measure, followed by the 95% confidence interval (CI) of the measure.

Wilson14 score-corrected 95% CI was used throughout to provide better estimates for smaller subgroups. McNemar test was used to test the marginal homogeneity of the proportions of true

TABLE 1

Demographics and pathology of OVA500 clinical cohort Evaluable subjects All enrolled subjects N ¼ 519

All evaluable subjects N ¼ 493

Premenopausal women N ¼ 276

Postmenopausal women N ¼ 217

519

493

276

217

Age, y N Mean (SD)

48.4 (14.32)

48.6 (14.16)

39.5 (8.96)

60.2 (10.74)

Median

47

48

41

60

Range (minimum, maximum)

18, 87

18, 87

18, 60

33, 87

Ethnicity/race, n (%) Asian

13 (2.5)

13 (2.6)

8 (2.9)

5 (2.3)

Black or African American

86 (16.6)

81 (16.4)

54 (19.6)

27 (12.4)

1 (0.2)

1 (0.2)

1 (0.4)

0 (0.0)

365 (70.3)

347 (70.4)

173 (62.7)

174 (80.2)

Native Hawaiian/Pacific islander White Other

5 (1.0)

5 (1.0)

4 (1.4)

1 (0.5)

49 (9.4)

46 (9.3)

36 (13.0)

10 (4.6)

Yes

511 (98.5)

493 (100.0)

276 (100.0)

217 (100.0)

No

8 (1.5)

0 (0.0)

0 (0.0)

0 (0.0)

Hispanic or Latino Surgery performed, n (%)

Time to surgery, wk N Mean (SD)

511 2.1 (2.19)

493 2.0 (1.72)

276 1.9 (1.68)

217 2.1 (1.76)

Median

1

1

1

2

Range (minimum, maximum)

0, 24

0, 11

0, 10

0, 11

Obstetrics/gynecology

212 (40.8)

204 (41.4)

144 (52.2)

60 (27.6)

Gynecological oncology

299 (57.6)

289 (58.6)

132 (47.8)

157 (72.4)

415 (80.0)

401 (81.3)

245 (88.8)

156 (71.9)

Nonovarian primary malignancy with no involvement of ovaries

4 (0.8)

4 (0.8)

1 (0.4)

3 (1.4)

Nonovarian primary malignancy with involvement of ovaries

6 (1.2)

6 (1.2)

2 (0.7)

4 (1.8)

Low malignant potential (borderline)

17 (3.3)

17 (3.4)

5 (1.8)

12 (5.5)

Primary malignant ovarian malignancy

69 (13.3)

65 (13.2)

23 (8.3)

42 (19.4)

Specialty of surgeon, n (%)

Pathology diagnosis, n (%) Benign ovarian tumor

Coleman et al. Validation of MIA2G for ovarian cancer. Am J Obstet Gynecol 2016.

82.e3 American Journal of Obstetrics & Gynecology JULY 2016

(continued)

ajog.org

GYNECOLOGY

Original Research

TABLE 1

Demographics and pathology of OVA500 clinical cohort (continued) Evaluable subjects All enrolled subjects N ¼ 519

All evaluable subjects N ¼ 493

Premenopausal women N ¼ 276

Postmenopausal women N ¼ 217

If malignant ovarian malignancy: predominant histology, n (%) Epithelial: serous

24 (4.6)

24 (4.9)

8 (2.9)

16 (7.4)

Epithelial: mucinous

12 (2.3)

9 (1.8)

1 (0.4)

8 (3.7)

Epithelial: endometrioid

13 (2.5)

13 (2.6)

5 (1.8)

8 (3.7)

Epithelial: clear cell

5 (1.0)

5 (1.0)

1 (0.4)

4 (1.8)

Epithelial: carcinosarcoma

1 (0.2)

1 (0.2)

1 (0.4)

0 (0.0)

Epithelial: mixed

2 (0.4)

1 (0.2)

0 (0.0)

1 (0.5)

12 (2.3)

12 (2.4)

7 (2.5)

5 (2.3)

69

65

Other Tumor grade, n (%) N

23

42

1

9 (13.0)

9 (13.8)

2

12 (17.4)

10 (15.4)

4 (17.4)

6 (14.3)

3

43 (62.3)

41 (63.1)

15 (65.2)

26 (61.9)

5 (7.2)

5 (7.7)

3 (13.0)

2 (4.8)

Not graded

1 (4.3)

8 (19.0)

Tumor stage, n (%) N

69

65

I

30 (43.5)

28 (43.1)

23 9 (39.1)

19 (45.2)

42

II

7 (10.1)

7 (10.8)

2 (8.7)

5 (11.9)

III

26 (37.7)

25 (38.5)

10 (43.5)

15 (35.7)

IV

6 (8.7)

5 (7.7)

2 (8.7)

3 (7.1)

Coleman et al. Validation of MIA2G for ovarian cancer. Am J Obstet Gynecol 2016.

positives (negatives) as identified by pairs of diagnostic tests under consideration. Additionally, statistics included CI for the differences between the risk indices taking into account the correlated nature of the indices,15 receiver operator characteristic (ROC) curve, ROC area under the curve (AUC), and their corresponding CIs. The P value from a test of the equality of the areas under the empirical ROC curves is presented (using methods indicated by DeLong et al16 as implemented in SAS, Version 9.4 software). Differences in ratios were considered significant if the lower bound of the CI for a difference comparison was >0 or the lower bound for a ratio comparison was >1. A total of approximately 500 evaluable subjects in the validation subset with an

assumed prevalence of 20% would provide 95% 2-tailed CIs for estimates of sensitivity (%) and specificity (%) within 7% and 5% (absolute), respectively, using the defined cut-off value and assuming comparable levels of sensitivity and specificity seen in previous studies. Under the same assumption, the 95% 2-tailed CI would be at 4.0% for PPV(%) and 2.0% for NPV(%).

Results This study followed a Prospective Specimen Collection Retrospective Blinded Evaluation design17 using clinically annotated sera collected for the OVA500 trial. From August 2010 through December 2011 a total of 520 subjects were consecutively enrolled, all of whom

provided a specimen. One subject was found to have been enrolled twice, leaving 519 for analysis. Subjects were excluded from the final analysis for: failed exclusion criteria (imaging outside of window prior to inclusion, surgery >12 weeks, previous cancer <5 years, n ¼ 12), primary contact was a gynecologic oncologist (n ¼ 6), and no ovarian pathology (n ¼ 8). The remaining 493 fully evaluable patients were scored after MIA2G testing. The demographic characteristics of the subject cohort are presented in Table 1. All 493 subjects had a nongynecologic oncologist as their primary contact. The specialty of physicians making the clinical assessment was a nongynecologic oncologist in 249 patients and a gynecologic oncologist in 244 patients.

JULY 2016 American Journal of Obstetrics & Gynecology

82.e4

Clinical performance of second-generation multivariate index assay, presurgical physician assessment, and combination for all evaluable subjects and by menopausal status Evaluable subjects N ¼ 493 Sensitivity %

Premenopausal women N ¼ 276

Postmenopausal women N ¼ 217

MIA2G

PA

MIA2G þ PA

MIA2G

PA

MIA2G þ PA

MIA2G

PA

MIA2G þ PA

73.9

93.5

90.3

74.2

90.3

91.8

73.8

95.1

84/92

68/92

86/92

28/31

23/31

28/31

56/61

45/61

58/61

95% CI

83.8e95.5

64.1e81.8

86.5e97.0

75.1e96.7

56.8e86.3

75.1e96.7

82.2e96.4

61.6e83.2

86.5e98.3

69.1

92.8

64.8

71.4

93.9

67.3

65.4

91.0

60.9

n/N

277/401

372/401

260/401

175/245

230/245

165/245

102/156

142/156

95/156

95% CI

64.4e73.4

89.8e94.9

60.0e69.4

65.5e76.7

90.1e96.3

61.2e72.9

57.6e72.4

85.5e94.6

53.1e68.2

40.4

70.1

37.9

28.6

60.5

25.9

50.9

76.3

48.7

n/N

84/208

68/97

86/227

28/98

23/38

28/108

56/110

45/59

58/119

95% CI

33.9e47.2

60.4e78.3

31.8e44.3

20.6e38.2

44.7e74.4

18.6e34.9

41.7e60.1

64.0e85.3

39.9e57.6

Specificity %

Positive predictive value, %

Negative predictive value, %

97.2

93.9

97.7

98.3

96.6

98.2

95.3

89.9

96.9

n/N

277/285

372/396

260/266

175/178

230/238

165/168

102/107

142/158

95/98

95% CI

94.6e98.6

91.1e95.9

95.2e99.0

95.2e99.4

93.5e98.3

94.9e99.4

89.5e98.0

84.2e93.7

91.4e99.0

2.953

10.220

2.658

3.161

12.118

2.766

2.652

8.220

2.432

Positive likelihood ratio 95% CI

2.518e3.463

7.053e14.811

2.303e3.069

2.514e3.975

7.115e20.639

2.234e3.425

2.111e3.332

4.879e13.850

1.983e2.982

0.126

0.281

0.101

0.135

0.275

0.144

0.125

0.288

0.081

0.065e0.245

0.199e0.397

0.046e0.219

0.046e0.398

0.151e0.500

0.049e0.423

0.054e0.293

0.189e0.440

0.027e0.245

Pretest odds of ovarian malignancy

0.23:1

0.23:1

0.23:1

0.13:1

0.13:1

0.13:1

0.39:1

0.39:1

0.39:1

Posttest odds of ovarian malignancy with high-risk score

0.68:1

2.34:1

0.61:1

0.40:1

1.53:1

0.35:1

1.04:1

3.21:1

0.95:1

Posttest odds of no ovarian malignancy with low-risk score

34.63:1

15.50:1

43.33:1

58.33:1

28.75:1

55.00:1

20.40:1

8.88:1

31.67:1

Negative likelihood ratio 95% CI

GYNECOLOGY

91.3

n/N

Original Research

82.e5 American Journal of Obstetrics & Gynecology JULY 2016

TABLE 2

Positive combined test result is when woman has high-risk index score or presurgical PA was malignant. When woman has both low-risk index score and benign presurgical PA, combined test result is negative. CI, confidence interval; MIA2G, second-generation multivariate index assay; PA, physician assessment. Coleman et al. Validation of MIA2G for ovarian cancer. Am J Obstet Gynecol 2016.

ajog.org

ajog.org

GYNECOLOGY

Original Research

FIGURE 1

Performance of MIA2G alone and with PA

Receiver operator characteristic (ROC) curves of MIA2G: A, for all evaluable women; B, for premenopausal women, and C, for postmenopausal women comparing MIA2G alone (blue lines) and MIA2G þ PA (red lines). ROC curves were obtained from logistic regression in prediction of malignancy. Mosaic plot of clinical performance of: D to F, MIA2G alone; and G to I, in combination with PA. Mosaic plots are presented for: D and G, all evaluable women; E and H, premenopausal women; and F and I, postmenopausal women. Mosaic plots show comparisons of relative numbers of patients in each category of high or low MIA2G risk and whether patients with risk score were malignant or benign. Sn, sensitivity; Sp, specificity. Coleman et al. Validation of MIA2G for ovarian cancer. Am J Obstet Gynecol 2016.

The diagnosis from pathology for all subjects and for subjects stratified by menopausal status is also presented in Table 1. Overall, the prevalence of pelvic malignancy among evaluable patients was 19% (92/493). Of these, 65 (13%) were invasive primary ovarian malignancies and 17 (3%) were low malignant potential (borderline) tumors. Documented pelvic malignancies also

included 6 malignancies metastatic to the ovaries (1%) and 4 nonovarian malignancies (1%). The overall prevalence in premenopausal subjects was 11% while the prevalence in postmenopausal subjects was 28%. The primary ovarian malignancies were assigned subtype, stage, and grade information, where possible. Eleven (48%) of the invasive ovarian cancers in

premenopausal women were found to be early stage (I/II) and 24 (57%) of the invasive ovarian cancers in postmenopausal women were early stage. In all, 401 patients (81%) had benign conditions. The performance of MIA2G in predicting malignancy both alone and in combination with PA is shown in Table 2. In summary, MIA2G alone

JULY 2016 American Journal of Obstetrics & Gynecology

82.e6

Original Research

ajog.org

GYNECOLOGY

TABLE 3

Performance of second-generation multivariate index assay by histological subtype of malignancy and by stage of primary ovarian malignancy for all evaluable subjects and stratified by menopausal status Evaluable subjects N ¼ 493 %

Premenopausal women N ¼ 276

Postmenopausal women N ¼ 217 %

n/N

95% CI

%

n/N

95% CI

n/N

95% CI 81.0e97.5

Histological subtype EOC

95.0

57/60

86.3e98.3

100.0

18/18

82.4e100.0

92.9

39/42

Non-EOC malignancies

80.0

4/5

37.6e96.4

80.0

4/5

37.6e96.4

e

0/0

Low malignant potential

82.4

14/17

59.0e93.8

80.0

4/5

37.6e96.4

83.3

10/12

55.2e95.3

100.0

6/6

61.0e100.0

100.0

2/2

34.2e100.0

100.0

4/4

51.0e100.0

75.0

3/4

30.1e95.4

0

0/1

0.0e79.3

100.0

3/3

43.9e100.0

I

85.7

24/28

68.5e94.3

88.9

8/9

56.5e98.0

84.2

16/19

62.4e94.5

II

100.0

7/7

64.4e100.0

100.0

2/2

34.2e100.0

100.0

5/5

56.6e100.0

88.6

31/35

74.0e95.5

90.9

10/11

62.3e98.4

87.5

21/24

69.0e95.7

III

100.0

25/25

86.7e100.0

100.0

10/10

72.2e100.0

100.0

15/15

79.6e100.0

IV

100.0

5/5

56.6e100.0

100.0

2/2

34.2e100.0

100.0

3/3

All late stage (III or IV)

100.0

30/30

88.6e100.0

100.0

12/12

75.8e100.0

100.0

18/18

Metastatic to ovaries Other nonovarian malignancies

e

Stage of primary ovarian malignancy

All early stage (I or II)

43.9e100 82.4e100.0

CI, confidence interval; EOC, epithelial ovarian cancer. Coleman et al. Validation of MIA2G for ovarian cancer. Am J Obstet Gynecol 2016.

showed a sensitivity of 91%; the combination of MIA2G þ PA showed a sensitivity of 94%. Adding MIA2G to PA correctly identified 75% of the malignancies missed by PA alone (P < .001 from McNemar test). The NPV of MIA2G þ PA was 98%. PA alone showed the best performance in correctly identifying benign disease. Figure 1 presents ROC curves for all evaluable patients and patients stratified by menopausal status for MIA2G alone or combined with PA. The ROC curves were derived from logistic regression of clinical data. The mosaic plots (Figure 1, D-I) depict the confusion matrix of highand low-risk test results vs malignant and benign patients. These plots show the performance of the test at the fixed cutoff of 5.0 for all evaluable women and both premenopausal and postmenopausal women. PA and MIA2G agreed on 72% of all cases. When PA was combined with MIA2G, sensitivity was increased by 2% overall, with a 4% reduction in specificity compared to MIA2G alone. Addition of PA had no effect on premenopausal sensitivity of

MIA2G (90%), but increased sensitivity from 92-95% in postmenopausal subjects. The specificity of MIA2G was slightly higher in premenopausal vs postmenopausal patients (71% vs 65%, respectively), and this difference was maintained with the addition of PA (67% vs 61%, respectively) (Table 2). The performance of MIA2G for histological subtype of ovarian cancer and for International Federation of Gynecology and Obstetrics stage (pre-January 1, 2014) of ovarian cancer, for all patients and stratified by menopausal subgroup, is summarized in Table 3. In summary, MIA2G showed 95% sensitivity for epithelial ovarian cancer; sensitivity was 100% for premenopausal women and 93% for postmenopausal women. MIA2G also demonstrated a sensitivity of 80% for nonepithelial ovarian cancer primary malignancies and 82% for low malignant potential tumors. MIA2G demonstrated 86% sensitivity for stage I ovarian cancers and 100% sensitivity for stage II ovarian cancers. Table 4 shows results comparing clinical performance of MIA2G and MIA

82.e7 American Journal of Obstetrics & Gynecology JULY 2016

for all evaluable subjects in this set of serum samples, as well as CA125-II levels that are consistent with clinical cut-off values in accordance with published ACOG referral criteria (200 U/mL for premenopausal women or >35 U/mL for postmenopausal women).3 The effect of substituting the modified ACOG criteria for premenopausal women 67 U/mL was also evaluated.9 In summary, MIA2G demonstrated nearly identical sensitivity to MIA and detected only 1 less malignancy overall. In contrast, MIA2G demonstrated significantly higher specificity compared to MIA and, because of improved test performance among women with benign conditions, the PPV of MIA2G was significantly higher than that of MIA, while the NPV remained the same. Comparison of the 2 assays is also presented in Figure 2, showing ROC curves for all evaluable women and for premenopausal and postmenopausal subgroups. Notably, MIA2G showed statistically better ROC AUC for all evaluable women (AUC of MIA2G, 0.924; 95% CI, 0.892e0.956; AUC of MIA, 0.862; 95% CI,

88.6e100.0 ACOG, American Congress of Obstetricians and Gynecologists; CA125-II, cancer antigen 125; CI, confidence interval; MIA, multivariate index assay; MIA2G, second-generation multivariate index assay. Coleman et al. Validation of MIA2G for ovarian cancer. Am J Obstet Gynecol 2016.

Modified ACOG according to Dearking et al9 (2007).

64.1e90.0 28/35

30/30 100.0

80.0 52.0e81.4

83.3e99.4 29/30

24/35 68.6

96.7 83.3e99.4

77.6e97.0 32/35

29/30 96.7

91.4 74.0e95.5

88.6e100.0

31/35

30/30

88.6

100.0

Early stage (I or II)

Sensitivity for stage of primary ovarian malignancy

Late stage (III or IV)

15.0e85.0 2/4 50.0 15.0e85.0 2/4 50.0 51.0e100.0 4/4 Nonovarian malignancies

30.1e95.4 3/4 75

100.0

36.0e78.4

61.0e100.0 6/6

10/17 58.8

100.0 61.0e100.0

31.0e73.8 9/17

6/6 100.0

52.9 59.0e93.8

61.0e100.0 6/6

14/17 82.4 59.0e93.8

6/6

86.3e98.3

61.0e100.0

14/17 82.4

100.0 Metastatic to ovaries

Low malignant potential

37.6e96.4 4/5 Nonepithelial ovarian cancer

80.0

100.0

23.1e88.2 3/5 60.0 11.8e76.9 2/5 40.0 37.6e96.4 4/5

55/60 91.7 73.9e91.9 51/60 85.0 86.3e98.3 57/60 95.0 57/60 95.0 Epithelial ovarian cancer

Sensitivity for histological subtypes

Negative predictive value

94.6e98.6 277/285 97.2

80.0

81.9e96.4

92.5e97.1 326/342 95.3 91.8e96.3 378/400 94.5 93.6e98.5 215/222 96.8

42.4e58.2 76/151 50.3 65.6e82.9 70/93 75.3 26.1e37.1 85/271 33.9e47.7 84/208 40.4 Positive predictive value

Specificity

64.4e73.4 277/401

31.4

77.2e84.8 326/401 81.3 91.9e96.4 378/401 94.3 48.7e58.4 215/401

95% CI

69.1

53.5

76/92

n/N %

82.6 66.4e83.6

95% CI n/N

70/92 76.1

% 95% CI

85.1e96.3 85/92

n/N %

84/92 91.3

95% CI n/N %

Sensitivity

83.3e95.5

MIA

94.2

Modified ACOG CA125-II

73.6e89.0

GYNECOLOGY

MIA2G

Comparison of clinical performance of second-generation multivariate index assay (MIA), MIA (OVA1), cancer antigen 125, and modified American Congress of Obstetricians and Gynecologist criteria on OVA500 clinical cohort

TABLE 4

ajog.org

Original Research

0.814e0.911; P < .001) and for postmenopausal women (AUC of MIA2G, 0.919; 95% CI, 0.878e0.960; AUC of MIA, 0.855; 95% CI, 0.789e0.921; P < .01). There was no statistical difference in ROC AUC in premenopausal women (AUC of MIA2G, 0.921; 95% CI, 0.860e0.981; AUC of MIA, 0.899; 95% CI, 0.830e0.967; P > .1). Overall accuracy [(true positives þ true negatives)/all subjects]  100 was increased by 12% for MIA2G (95% CI, 7.6e17.1) and 11% for MIA2G þ PA (95% CI, 6.4e15.5%), when compared to MIA and MIA þ PA. This represented 61 subjects whose classification changed from inaccurate to accurate when MIA2G was substituted for MIA without PA and 54 whose classification changed from inaccurate to accurate when MIA2G was substituted for MIA including PA. The difference in accuracy between premenopausal and postmenopausal subjects decreased from 10% for MIA to <1% for MIA2G. Accuracy of MIA2G was 73% (95% CI, 66.5e78.3%) for postmenopausal subjects and 74% (95% CI, 68.0e78.4%) for premenopausal subjects. CA125-II alone showed a sensitivity of 76% (70/92; 95% CI, 66.4e83.6%) for all evaluable women, which stratified to 68% for premenopausal women (200 U/ mL cut-off) and 80% in postmenopausal women (35 U/mL cut-off). The specificity was 96% for premenopausal women and 92% for postmenopausal women. MIA2G detected 7 more earlystage primary ovarian cancers than CA125-II.

Comment To improve triage of adnexal mass scheduled for surgery, MIA2G design focused on these critical attributes: (1) high-sensitivity detection so as not to miss any cancers that should be referred to a specialist with a resulting high NPV for ruling out cancer given a low-risk result; (2) significantly increased specificity to provide fewer misclassifications of benign ovarian conditions, resulting in and increased PPV; and (3) harmonized performance at a single cut-off to eliminate the uncertainty in assessing risk across menopausal strata, and to

JULY 2016 American Journal of Obstetrics & Gynecology

82.e8

Original Research

GYNECOLOGY

ajog.org

FIGURE 2

ROC curves comparing MIA2G to MIA

Receiver operator characteristic curves comparing second-generation multivariate index assay (MIA) (blue lines) to MIA (red lines) for A, all evaluable patients and stratified by B, premenopausal and C, postmenopausal women. AUC, area under curve. Coleman et al. Validation of MIA2G for ovarian cancer. Am J Obstet Gynecol 2016.

eliminate the need to report menopausal status in patient testing. The samples used to validate MIA2G were prospectively collected from female patients scheduled to undergo surgery for an adnexal mass. Based on the final pathology results of the OVA500 trial, this study population was high risk and presented with an increased cancer prevalence. The results reported here show that MIA2G achieved these design goals by demonstrating equal or improved clinical performance when compared to MIA across a number of attributes (with or without PA, stratified by ovarian cancer stage, histological subtype, and premenopausal vs postmenopausal status. A high-risk MIA2G result, with a 40% PPV, correlated with a 2 in 5 chance of malignancy; while a low-risk result indicated a 98% chance of the mass being benign. Driving more malignant masses to the gynecologic oncologist and keeping more benign masses with the gynecologist should help improve the outcomes of women with ovarian cancer while saving health care dollars, as well as patient inconvenience and anxiety for those with benign masses. MIA2G showed >90% sensitivity both independently and in combination with PA. Of note is the finding that

MIA2G demonstrated sensitivity that was equivalent to or higher than other diagnostic tools used in triage of adnexal mass, including PA alone and CA125-II with various cut-offs. No significant change in NPV was found with MIA2G vs MIA, CA125-II, or PA alone. Overall, 1 fewer malignancy was classified as high risk by MIA2G. The 8 malignant masses classified as low risk by MIA2G (falsenegative test) included 1 metastatic nonovarian, 3 low malignant potential tumors, 3 stage I epithelial malignancy, and 1 stage I nonepithelial malignancy. Of these 8, 2 were classified as high risk by PA (both stage 1 epithelial), while the remaining 6 were classified as low risk by PA. PA failed to predict 24 of 92 malignancies, a false-negative rate of 26%. The false-negative rate of MIA2G alone was 9%, while MIA2G þ PA showed a falsenegative rate of 7%ea 75% reduction compared with PA. Early-stage cancer is particularly difficult to predict using current diagnostic tools and guidelines.18 The overall sensitivity of MIA2G for stage I primary ovarian cancers was not statistically different than for all ovarian cancer stages (Table 3). For both premenopausal and postmenopausal women the sensitivity for stage I and all early-stage cancer was similarly high, indicating

82.e9 American Journal of Obstetrics & Gynecology JULY 2016

sensitivity was retained across menopausal strata. The sensitivity for earlystage cancer in all subjects was further improved with the addition of PA (MIA2G þ PA to 93% for stage I [26/28; 95% CI, 77.4e98.0%] and 94% for all early-stage cancers [33/35; 95% CI, 81.4e98.4%], data not shown). These data indicated that MIA2G may be an effective tool in managing early-stage ovarian cancer, where intervention has been shown to be most effective. Fiveyear survival for stage I ovarian cancer is 90% and for stage II is 70%, while for stage III and IV is it 39% and 17%, respectively.19 All of the serous cancers in this study were high-grade malignancies and none were missed by MIA2G þ PA, whereas 1 was missed by MIA þ PA, consistent with retained sensitivity for this medically critical subtype. Overall, the results suggest that MIA2G, like MIA, offers high sensitivity with significant PA complementarity in the setting of adnexal malignancy. While high sensitivity and NPV is a diagnostic priority for presurgical management of ovarian cancer, the modest specificity and PPV of MIA potentially pose barriers to adoption and effective triage. Indeed, despite a favorable position statement on MIA issued by the

ajog.org Society of Gynecologic Oncology,20 other guidelines such as the National Comprehensive Cancer Network and ACOG have not followed. In the present study, specificity of MIA2G was increased over MIA by 15% without PA and 14% when combined with PA. As a result, the total number of patients identified as low risk with MIA2G alone was increased from 45-58%, a 13% increase. But in this cohort, the NPV was 97% for MIA2G and 97% for MIA. The results suggest that MIA2G may substantially increase the percentage of patients who may be treated locally by their generalist providers, while maintaining an equivalently low likelihood of malignancy among patients classified as low risk. Strengths of the present study include the comparison of MIA2G against MIA results for each subject from a large, prospectively collected and published, multiinstitutional trial of MIA; as well as the fact that MIA2G utilized industryleading platform and FDA-cleared biomarker assays in the algorithm design. These factors lend themselves to robust analytical and clinical performance compared with many biomarker clinical trials. In addition, biomarker testing and clinical statistics analysis were independently performed, and all ovarian tumor types were included in the statistical analysis of test performance. A potential limitation of this study is the lack of uniform preoperative evaluation for comparison. This study was designed to include only those patients who received a PA independent of the MIA score. This represented the most realistic clinical surrogate given that other referral algorithms are not uniformly followed in the United States. Additionally, all women enrolled in this trial were scheduled for surgery for an adnexal mass, a population with increased cancer prevalence compared with those in whom surgery is not planned. A population with lower cancer prevalence would decrease the test’s PPV and increase its NPV. A related limitation is the lack of standardized imaging criteria such as the International Ovarian Tumor Analysis protocol, used

GYNECOLOGY in some European nations.21 While standardized, expert imaging may be practical in some single-payer health care systems abroad, the present study evaluated incremental diagnostic performance against actual PA, inclusive of all assessment components. In addition a separate manuscript has been submitted for publication, using a simplified, retrospective imaging risk score, which was previously published for the MIA test.22 The present findings have important implications for ovarian cancer management at the very first medical intervention. Previous studies have shown that most new ovarian cancer cases are operated by low-volume physicians and clinics that lack the preparation, training, and knowledge to perform surgical staging, radical cytoreduction, and intraoperative chemotherapy.23-26 As a result, survival is statistically significantly compromised.25,27 Presurgical triage must carefully balance high sensitivity, early-stage detection, and high NPV against possible overreferral. The present MIA2G validation study demonstrates a significant increase in specificity and PPV over MIA, without sacrificing the sensitivity, NPV, PA complementarity, or detection of earlystage ovarian cancer. The clinical outcome of these results can improve the triage to the correct surgeon of adnexal masses deemed necessary for surgical removal by allowing gynecologists to keep benign masses within their practice and directing malignant masses to the gynecologic oncologist. n Acknowledgment Significant contributions to the collection, analyses and discussion of data in this manuscript were made by Lori Sokoll, PhD, Debra Elliott, Renu Dua, Phaedra Mohr, Tina Hudson, Brian Carpenter, and Marc Dantone.

References 1. Dodge JE, Covens AL, Lacchetti C, et al. Management of a suspicious adnexal mass: a clinical practice guideline. Curr Oncol 2012;19: e244-57. 2. American College of Obstetricians and Gynecologists. The role of the generalist obstetrician gynecologist in the early detection of epithelial ovarian cancer. ACOG Committee

Original Research

opinion no. 280. Obstet Gynecol 2011;117: 742-6. 3. American College of Obstetricians and Gynecologists. The role of the generalist obstetrician gynecologist in the early detection of ovarian cancer. ACOG Committee opinion no. 280. Obstet Gynecol 2002;100:1413-6. 4. McGowan L, Lesher LP, Norris HJ, Barnett M. Misstaging of ovarian cancer. Obstet Gynecol 1985;65:568-72. 5. Giede KC, Kieser K, Dodge J, Rosen B. Who should operate on patients with ovarian cancer? An evidence-based review. Gynecol Oncol 2005;99:447-61. 6. Carney ME, Lancaster JM, Ford C, Tsodikov A, Wiggins CL. A population-based study of patterns of care for ovarian cancer: who is seen by a gynecologic oncologist and who is not? Gynecol Oncol 2002;84:36-42. 7. Ueland FR, Desimone CP, Seamon LG, et al. Effectiveness of a multivariate index assay in the preoperative assessment of ovarian tumors. Obstet Gynecol 2011;117:1289-97. 8. Bristow RE, Smith A, Zhang Z, et al. Ovarian malignancy risk stratification of the adnexal mass using a multivariate index assay. Gynecol Oncol 2013;128:252-9. 9. Dearking AC, Aletti GD, McGree ME, Weaver AL, Sommerfield MK, Cliby WA. How relevant are ACOG and SGO guidelines for referral of adnexal mass? Obstet Gynecol 2007;110:841-8. 10. Bristow RE, Hodeib M, Smith A, et al. Impact of a multivariate index assay on referral patterns for surgical management of an adnexal mass. Am J Obstet Gynecol 2013;209: 581.e1-8. 11. Zhang Z, Bast RC Jr, Yu Y, et al. Three biomarkers identified from serum proteomic analysis for the detection of early stage ovarian cancer. Cancer Res 2004;64:5882-90. 12. Zhang Z, Chan DW. The road from discovery to clinical diagnostics: lessons learned from the first FDA-cleared in vitro diagnostic multivariate index assay of proteomic biomarkers. Cancer Epidemiol Biomarkers Prev 2010;19: 2995-9. 13. Zhang ZP, Page G, Zhang H. Applying classification separability analysis to microarray data. In: Lin SM, Johnson KJ, eds. Methods of microarray data analysis. Boston (MA): Kluwer Academic Publishers; 2001. 14. Wilson EB. Probable inference, the law of succession, and statistical inference. J Am Stat Assoc 1927;22:209-12. 15. Moskowitz CS, Pepe MS. Comparing the predictive values of diagnostic tests: sample size and analysis for paired study designs. Clin Trials 2006;3:272-9. 16. DeLong ER, DeLong DM, ClarkePearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 1988;44:837-45. 17. Feng Z, Kagan J, Pepe M, et al. The early detection research network’s specimen reference sets: paving the way for rapid evaluation of

JULY 2016 American Journal of Obstetrics & Gynecology

82.e10

Original Research

GYNECOLOGY

potential biomarkers. Clin Chem 2013;59: 68-74. 18. Longoria TC, Ueland FR, Zhang Z, et al. Clinical performance of a multivariate index assay for detecting early-stage ovarian cancer. Am J Obstet Gynecol 2014;210:78.e1-9. 19. American Cancer Society. Cancer facts and figures 2015. Atlanta (GA): American Cancer Society; 2015. 20. Society of Gynecologic Oncology. SGO position statement. Multiplex serum testing for women with pelvic mass. Chicago (IL): Society of Gynecologic Oncology; 2013. 21. Timmerman D, Van Calster B, Testa AC, et al. Ovarian cancer prediction in adnexal masses using ultrasound-based logistic regression models: a temporal and external validation study by the IOTA group. Ultrasound Obstet Gynecol 2010;36:226-34. 22. Goodrich ST, Bristow RE, Santoso JT, et al. The effect of ovarian imaging on the clinical interpretation of a multivariate index assay. Am J Obstet Gynecol 2014;211:65.e1-11. 23. Bristow RE, Zahurak ML, Alexander CJ, Zellars RC, Montz FJ. FIGO stage IIIC endometrial carcinoma: resection of macroscopic nodal disease and other determinants of survival. Int J Gynecol Cancer 2003;13:664-72.

24. Goff BA, Matthews BJ, Larson EH, et al. Predictors of comprehensive surgical treatment in patients with ovarian cancer. Cancer 2007;109:2031-42. 25. Bristow RE, Palis BE, Chi DS, Cliby WA. The National Cancer Database report on advancedstage epithelial ovarian cancer: impact of hospital surgical case volume on overall survival and surgical treatment paradigm. Gynecol Oncol 2010;118:262-7. 26. Mercado C, Zingmond D, Karlan BY, et al. Quality of care in advanced ovarian cancer: the importance of provider specialty. Gynecol Oncol 2010;117:18-22. 27. Bristow RE, Chang J, Ziogas A, AntonCulver H. Adherence to treatment guidelines for ovarian cancer as a measure of quality care. Obstet Gynecol 2013;121:1226-34.

Author and article information From the Department of Gynecologic Oncology and Reproductive Medicine, University of Texas MD Anderson Cancer Center, Houston, TX (Dr Coleman); Obstetrics and Gynecology, University of Cincinnati Cancer Institute, Cincinnati, OH (Dr Herzog); Departments of Pathology (Drs Chan and Zhang), Oncology (Drs Chan and Zhang), Urology (Dr Chan), and Radiology (Dr Chan), Johns

82.e11 American Journal of Obstetrics & Gynecology JULY 2016

ajog.org Hopkins University School of Medicine, Baltimore, MD; Vermillion Inc, Austin, TX (Drs Munroe, Pappas, and Wolf); and Applied Clinical Intelligence LLC, Bala Cynwyd, PA (Mr Smith). Received Nov. 10, 2015; revised Jan. 22, 2016; accepted March 2, 2016. This study was funded by Vermillion Inc. Disclosure: D.W.C. serves on the advisory board at Vermillion Inc and is sponsored through a research agreement between Vermillion Inc and Johns Hopkins University. Z.Z. is coinventor of patents associated with the OVA1 product and is entitled to royalty payments from the sale of the OVA1 test through a license agreement between Vermillion Inc and Johns Hopkins University; his development work is also sponsored through a research agreement between Vermillion Inc and Johns Hopkins University. A.S. is a paid statistical consultant for Vermillion Inc. D.G.M. is Chief Scientific Officer and Senior Vice President of Business Development at Vermillion Inc. T.C.P. is Director of Research and Development at Vermillion Inc. J.W. is Chief Medical Officer of Vermillion Inc. The remaining authors report no conflict of interest. Portions of this research were presented as a poster at the annual meeting of the American Society of Clinical Oncology, Chicago, IL, May 29-June 2, 2015. Corresponding author: Judith Wolf, MD. jwolf@ Vermillion.com

A9 Julio.pdf

open access article under the CC BY-NC-ND license (http:// creativecommons.org/licenses/by-nc-nd/4.0/). http://dx.doi.org/10.1016/j.ajog.2016.03.003.

674KB Sizes 6 Downloads 166 Views

Recommend Documents

A9 Septiembre.pdf
Page 1. Whoops! There was a problem loading more pages. Retrying... A9 Septiembre.pdf. A9 Septiembre.pdf. Open. Extract. Open with. Sign In. Main menu.

A9 Julio.pdf
Page 1 of 11. GYNECOLOGY. Validation of a second-generation multivariate index assay. for malignancy risk of adnexal masses. Robert L. Coleman, MD; Thomas J. Herzog, MD; Daniel W. Chan, PhD; Donald G. Munroe, PhD;. Todd C. Pappas, PhD; Alan Smith, MS

A9-B1ProportionalReasoning.pdf
Page 1 of 49. 8/9/16. 1. DEPARTMENT OF EDUCATIONAL PSYCHOLOGY. Teaching Proportional. Reasoning: The Role of. Schema-Based Instruction. Presented at Kansas Multi-Tier System of. Supports Symposium. Wichita, Kansas. September 8, 2016. DEPARTMENT OF ED

A9 Abril 2016.pdf
Sign in. Loading… Whoops! There was a problem loading more pages. Retrying... Whoops! There was a problem previewing this document. Retrying.

A9 20170902 Joey Cub Day.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. A9 20170902 ...

The A9 Trunk Road (Scrabster) - Legislation.gov.uk
Oct 11, 2017 - routes suitable for the traffic which will be affected by this Order. ... No person may use, or permit the use of, any vehicle on the length of road, ...

The A9 Trunk Road (Scrabster) - Legislation.gov.uk
Oct 11, 2017 - of the type described in item 8 (diagram 7105 – position of barrier to mark length of road closed or to guide traffic past an obstruction) of the sign ...

A9 Enero 2016.pdf
The authors thank Kevin Mau, MS, for his assistance with statistical analysis. and Carol Walden, RN, for her data collection and case management.

NIELIT A9-R4 Data Communication & Network Technologies SP-3.pdf
2.10 Virtual private networks need not use real private WANs. Page 2 of 4 ... 3.8 Avoids collision on wireless networks H. Advanced mobile phone system.

Cheap Mesuvida A9 Wireless Bluetooth Speaker Mini Column ...
Cheap Mesuvida A9 Wireless Bluetooth Speaker Mini C ... Mobile Phone Pc Free Shipping & Wholesale Price.pdf. Cheap Mesuvida A9 Wireless Bluetooth ...