EHR-Based Study of Long-Term Infectious Diseases Connor Olson1, Yanqing Gui2, Siddharth Madapoosi2 University of Wisconsin – Madison1, University of Michigan – Ann Arbor2

Background & Objectives

Cohort Definition

• Infectious diseases are extremely common, leading to 19.3 million annual visits to physician offices in the United States alone.6 • Infectious disease studies using widely available Electronic Health Record (EHR) databases has largely been nonexistent, especially using the Michigan Genomics Initiative (MGI). • Utilized the Ninth Revision of the International Classification of Diseases (ICD-9) to identify the three most common chronic infectious diseases: Viral Hepatitis C (n=203), Sarcoidosis (n=96), and HIV (n=48). • Hypothesized that the presence of Viral Hepatitis C or HIV in patients would increase their likelihood of developing postoperative complications, as supported by other medical studies1,7,8 • Further targeted our analysis of Sarcoidosis toward utilizing lab measurements as to identify novel lab biomarkers for Sarcoidosis.

Figure 2: Algorithm used to generate the cohorts for each of the three diseases. The six covariates are • Age • Sex, • Ischemic Heart Disease • Congestive Heart Failure • Cerebrovascular disease • Type 2 Diabetes We identified these as major risk factors for post-operative complications.

Exploratory Data Analysis • First examined the association of comorbidities with each of our three infectious diseases

Packages Used: MatchIt works to preprocess data with nonparametric propensity score matching methods in order to improve parametric statistical models.4 The package works efficiently with the Zelig package and is therefore conducive to our goal of matching our disease of interest on a number of likely covariates directly followed by generation of a CLR. The Zelig package streamlines utilization of a number of statistical models with matched data.2

• Sarcoidosis is a relatively common diagnosis, and can greatly benefit from an understanding of biomarkers collected through labs. • PCA reduces the dimensionality of high dimensional datasets. • Applied to lab data relating to Sarcoidosis after matching. • Using the first six PCs covered 74% of our data. • Reduced 20 dimensions of data down to 6 with minimal loss of info. • PCs do not describe a major difference between cases and controls. • Backwards Selection on the CLR model returned 4 significant predictors, chloride, creatinine, red blood cell count and distribution.

Infectious Diseases and Postoperative Complications • • • • •

Over 18,000 patient records in the original data set, only 48 patients with HIV. 3 HIV patients had developed postoperative complications. CLR revealed that the parameter for HIV in the model was not significant. The lack of cases could be due to HIV patients only receiving surgery when it is absolutely necessary due to all the risks that come with an HIV infection. Viral Hepatitis C had a sample size of 203 with 79 patients having complications.

(A)

Figure 1: In the network model of common HIV (n=48) comorbidities, nodes and edges, which show both non-significant differences from population prevalence (dashed) and significant differences with p < 0.05 (solid) are sized relative to number of occurrences in patients. All nodes and edges are greater than 9 co-occurrences. This shows the connections between common HIV comorbidities, grouped by disease type, but lacks true prevalence counts. This method of data visualization is great at seeing underlying structure. This network was generated using iGraph3.

Lab Biomarkers for Sarcoidosis

Covariate

Pre-Matching PValue

Post-Matching P-Value

Ischemic Heart Disease

3.37e-05

0.88

Congestive Heart Failure (Nonhypertensive)

2.30e-05

0.36

Congestive Heart Failure

7.89e-06

0.38

Cerebrovascular Disease

2.30e-05

0.95

Type 2 Diabetes

3e-10

0.77

Age

0.15

0.045

Sex

8.40e-06

0.51

(B) Figure 3: Propensity score nearest-neighbor matching on all 6 covariates for Viral Hepatitis C is effective in reducing confounding by factors linked to postoperative complications. (A) Welch’s two-sample t-test of means showed a marked increase in p-value of all covariates but Age following matching. (B) In addition, matching resulted in a similar distribution of propensity scores among cases and controls.

Sampling Distribution of CLR P-Values by Matching Algorithm over 50 Iterations

Figure 6: PCA on Sarcoidosis using 20 lab measurements showed no major difference between cases and controls in and of the PCs. These scatterplots between 6 PCs generated through PCA show no major difference between cases (blue) and controls (red).

Conclusion •Infectious disease research can benefit from the ease of access of EHR datasets to study long term effects of infectious diseases. •EHRs have many pitfalls with missing data, loss of patient data during subsetting, and lack of cases. •HIV did not have enough cases with post-operative complications to suggest a meaningful conclusion. •Sarcoidosis PCA did not show relationships with lab data values. •Backwards selection onto the CLR for Sarcoidosis showed four labs as significant biomarkers, but calcium was not identified despite being the diagnostic biomarker for the disease. •Viral Hepatitis C propensity score matching reduced confounding in CLR on the association with postoperative complications. •Given the risk factors matched on, it appears that Viral Hepatitis C is associated with an increased likelihood of developing postoperative complications. •Subsequent corroboration with Nail Dermatophytosis shows necessity of matching on risk factors for proper conclusions.

References Figure 2: Common comorbidity frequencies in the sample of Viral Hepatitis C patients (n=203) are shown as well as whether their prevalence among Viral Hepatitis C patients is (blue) or is not (red) significantly different from that of the entire population. This approach retains the true prevalence of each disease but loses any connections between diseases that could be easily identified in the network model.

Figure 4: In algorithms that do not match on all 6 risk factors for postoperative complications, Nail Dermtophytosis appears to be associated with postoperative complications, but matching on all 6 risk factors removes this effect. This demonstrates the necessity of matching to remove false signals caused by other covariates.

Figure 5: In all four matching algorithms, Viral Hepatitis C remains a significant predictor of postoperative complications, though the average p-value using CLR increases following matching on all 6 risk factors. Propensity score matching on known risk factors thus appears to reduce the confounding present in data for association studies.

1. Asthana, S., & Kneteman, N. (2009). Operating on a patient with hepatitis C. Canadian Journal of Surgery,52(4), 337-342. Retrieved July 11, 2017, from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2724804/. 2. Choirat C, Honaker J, Imai K, King G and Lau O (2017). Zelig: Everyone's Statistical Software. Version 5.1- 2, http://zeligproject.org/ 3. Csardi G, Nepusz T: The igraph software package for complex network research, InterJournal, Complex Systems 1695. 2006. http://igraph.org 4. Daniel E. Ho, Kosuke Imai, Gary King, Elizabeth A. Stuart (2011). MatchIt: Nonparametric Preprocessing for Parametric Causal Inference. Journal of Statistical Software, Vol. 42, No. 8, pp. 1-28. URL http://www.jstatsoft.org/v42/i08/ 5. Denny JC, Bastarache L, Ritchie MD et al. (2013). Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat Biotechnol. 31(12): 1102-10 6. Infectious Disease. (2017, May 03). Retrieved July 11, 2017, from https://www.cdc.gov/nchs/fastats/infectious-disease.htm 7. Koshy, R. C., Rajasree, & Thomas, M. (2010). Anaesthetic Management of a Patient with Sarcoidosis Presenting for Mastectomy. Journal of Anaesthesiology - Clinical Pharmacology, 26(4), 555-556. Retrieved July 11, 2017, from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3087275/. 8. Smit, S. (2010). Guidelines for surgery in the HIV patient . Continuing Medical Education, 28(8), 356-358. Retrieved July 11, 2017, from http://www.cmej.org.za/index.php/cmej/article/viewFile/1853/1564

EHR - Olson, Gui, Madapoosi.pdf

Operating on a patient with hepatitis C. Canadian Journal of Surgery,52(4), ... data and genome-wide association study data. ... Type 2 Diabetes 3e-10 0.77.

908KB Sizes 4 Downloads 257 Views

Recommend Documents

EHR - Olson, Gui, Madapoosi.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. EHR - Olson ...

Matthew E. Olson - GitHub
Mar 13, 2016 - All-Conference. 33 career varsity goals. Experience. Chesapeake Technology International. Denver, CO. SOFTWARE ENGINEERING INTERN.

Olson 323.pdf
lower on entrepreneurial self-efficacy, a person's belief in his/her ability to be a successful. entrepreneur. With regard to motivations, women were less driven to seek entrepreneurship for. career and independence reasons. Finally, when examining s

Nurse EHR Rankings 2018.pdf
6 EPIC SYSTEMS 80% 95% 93% 83% 88%. 7 HEALTHLAND ... 4 OPERATING ROOM 84% 81% 86% 91% 86% ... Displaying Nurse EHR Rankings 2018.pdf.

nifty gui 1.3.20 - GitHub
Dec 28, 2011 - Change Panel, Image and Text Properties. 100. 10.Nifty Styles. 101 ..... a DSL (Domain Specific Language) for Nifty instead of a regular class. ...... the control definition tag you are free to apply any style to the elements that ...

EHR - Emerson, Goldman, Kolli.pdf
Connect more apps... Try one of the apps below to open or edit this item. EHR - Emerson, Goldman, Kolli.pdf. EHR - Emerson, Goldman, Kolli.pdf. Open. Extract.

Olson et al 2009_Family Matters.pdf
published collections of parental experi- ences and clinical wisdom [e.g., Klein- feld and Wescott, 1993; Kleinfeld et al.,. 2000; Salmon, 2007]. This extensive.

Apple EHR- Your Life Is A Story….pdf
Loading… Page 1. Whoops! There was a problem loading more pages. Apple EHR- Your Life Is A Story….pdf. Apple EHR- Your Life Is A Story….pdf. Open.

Tailor Your EHR System to Future Guidelines for ... - SLIDEBLAST.COM
Register for the webinar at the IHE webinar website ... profile during the IHE Quality, Research, and Public Health (QRPH) Domain. Update 8 August 2014. Check out all the exciting new technical supplements and white papers to help you.

Apple EHR- Your Life Is A Story….pdf
Page 1 of 2. Stand 02/ 2000 MULTITESTER I Seite 1. RANGE MAX/MIN VoltSensor HOLD. MM 1-3. V. V. OFF. Hz A. A. °C. °F. Hz. A. MAX. 10A. FUSED.

W-EHR: A Wireless Distributed Framework for secure ...
Technological Education Institute of Athens, Greece [email protected] ... advanced operations (such as to provide access to the data stored in their repository ...

WILLIAM GUI WOOLSTON STANFORD UNIVERSITY ...
Quantitative Intern, Weiss Asset Management for Professor Andrew Weiss (BU). 2003 ... Economic theory predicts that access to credit helps people smooth their ...

Download Tkinter GUI Application Development ...
architecture * Build multithreaded and database-driven apps * Create apps that ... data with object serialization and tools such as configparser In Detail Tkinter is ...

Cyclone PCB Factory Gui Tutorial -
C:\Anaconda\python.exe setup.py install. Install PyPy. Download PyPy from http://pypy.org/download.html and install it. Install Java. Download Java from ...