Studying and improving current approaches for using a Bayesian framework for modeling gene regulatory networks Milad Behbahaninia1 and Michael Verdicchio2 Harrington Department of Bioengineering1, School of Computing and Informatics2

1) Study current approaches for using the Bayesian framework in biology Research Goals: 2) Improve usage of Bayesian networks in modeling gene regulation Abstract Gene regulatory networks are networks of genomic interaction, lending insight into various functions within a cell. Learning new ways in which genes interact can facilitate the discovery of triggering mechanisms and treatments for high-profile diseases such as cancer. There are many ways by which gene regulatory networks can be modeled, one of which is a Bayesian network (BN). Even though Bayesian networks have been widely used there is definite room for improvement. We will incorporate prior knowledge learned from biomedical knowledge bases into the Bayesian network learning process in order to improve the chances discovering of novel gene interactions using gene expression microarray data.

Why a Bayesian Network?

Challenges of BN Learning in the Biomedical Domain:

• A Bayesian network is a probabilistic graphical structure representing a set of variables and the probabilities of their interactions. • BNs minimize the number of parameters needed to specify the probability distribution by taking advantage of the conditional independencies between genes. These conditional independencies are encoded in the graph to help visualization and reasoning [6]. • BNs have been proven to be useful and important in biomedical applications including clinical decision support systems and information retrieval. BNs have also been researched for more than a decade, resulting in practical algorithms and tools. Learning the network back from data sets involves a search-and-score strategy, which attempts to identify the most probable network given the data. • Algorithms search the space of all possible networks for the one that maximizes the score based on greedy, local, or other search strategies [1].

1. Data sets are extremely large 2. Data sets have few samples but many variables 3. There are redundant and concurrent processes in the cellular system, but a BN model assumes all data comes from some homogenous distribution

Example Bayesian Network

Knowledge Resources Available for Integration with BNs:

P(s1) = 20%

S Feature S B L F C

Value

Expression

s1

Smoker

S2

Non- smoker

b1

Bronchitis

B2

No bronchitis

l1

Lung cancer

L2

No lung cancer

f1

Fatigue

P(f1|b1,l1) = 75%

F2

No fatigue

P(f1|b1,l2) = 10%

C1

Positive chest X-ray

P(f1|b2,l1) = 50%

C2

Negative chest X-ray

P(f1|b2,l2) = 5%

P(b1|s1) = 25%

P(l1| s1) = 0.3%

P(b1|S2) = 5%

L

P(l1 | S2) = 0.005%

B C

F

P(c1 | l1) = 60% P(c1 | l2) = 2%

Bayesian Network Learning with Knowledge Integration Tissues/Samples B

Bayesian Learning Algorithm

Genes

B

A

A D

D

C

C

Gene Ontology (GO) Interactions Annotates genes with BN edges can be validated their processes and by comparing the functions. Edges learned connected genes to known by the BN can be interactions from validated using GO terms biomedical literature

Pathways Edges learned by the BN can be validated by finding biological signaling pathways shared by the genes

What’s Next? The immediate goal is to incorporate biological knowledge obtained through knowledge repositories to improve the Bayesian network learning technique, which will be validated by comparing the improved method to the traditional method. The improved technique will be used in a human cancer study. The future goal is to apply similar techniques learned Bayesian networks to Causal Networks. Causal networks are a special case of Bayesian networks in which each connection carries with it the implication of a causal relationship. In the end, modified techniques for Bayesian and Causal networks will be evaluated and assessed. Works Cited 1. E. Almasri, P. Larsen, G. Chen, and Y. Dai, "Incorporating literature knowledge in bayesian network for inferring gene networks with gene expression data," 2008, pp. 184-195. [Online]. Available: http://dx.doi.org/10.1007/978-3-54079450-9_18 2. Ashburner et al., “Gene Ontology: tool for the unification of biology,” 2000. [Online]. Available: http://www.nature.com.ezproxy1.lib.asu.edu/ng/journal/v25/n1/abs/ng0500_25.html 3. Harris et al., “The Gene Ontology (GO) database and informatics resource,” 2003. [Online]. Available: http://www.nature.com.ezproxy1.lib.asu.edu/ng/journal/v25/n1/abs/ng0500_25.html 4. M. Kanehisa, S. Goto, S. Kawashima, and A. Nakaya, “The KEGG databases at GennomeNet,” 2001. [Online]. Avaiable: http://nar.oxfordjournals.org.ezproxy1.lib.asu.edu/cgi/content/full/30/1/42 5. P. Larsen, E. Almasri, G. Chen, and Y. Dai, "A statistical method to incorporate biological knowledge for generating testable novel gene regulatory interactions from microarray experiments," BMC Bioinformatics, vol. 8, pp. 317+, August 2007. [Online]. Available: http://dx.doi.org/10.1186/1471-2105-8-317 6. J. Pena, J. Bjorkegren, J. Tegner, “Growing Bayesian Network Models of Gene Networks from Seed Genes,” 2005. [Online]. Available: http://bioinformatics.oxfordjournals.org.ezproxy1.lib.asu.edu/cgi/reprint/21/suppl_2/ii224

Acknowledgements Unknown Gene Network

Micro-Array Data

Biological Knowledge

FURI: Research Program

Learned Network

Seungchan Kim, Ph.D., Assistant Professor, School of Computing and informatics Ina Sen, Graduate Research Associate, School of Computing and Informatics Archana Ramesh, Graduate Research Associate, School of Computing and Informatics

1) Study current approaches for using the Bayesian ...

Causal Networks. Causal networks are a special case of Bayesian networks in which each connection carries with it the implication of a causal relationship. In the end, modified techniques for Bayesian and Causal networks will be evaluated and assessed. D. C. B. A. Tissues/Samples. Genes. Unknown Gene Network.

212KB Sizes 0 Downloads 136 Views

Recommend Documents

Bayesian Approaches to Distribution Regression
[14] Michelle Ntampaka, Hy Trac, Dougal J. Sutherland, S. Fromenteau, B. Poczos, and Jeff. Schneider. Dynamical mass measurements of contaminated galaxy ...

Approaches of using UML for Embedded System ... - Semantic Scholar
and software. Choice of system components depends upon optimality of. - Implementation of function. - Analysis of alternatives in both hardware and software domains. Specification of flexible hardware- ... These systems are integration of subsystems

Approaches of using UML for Embedded System ... - Semantic Scholar
of constraints like performance, cost, power consumption, size and weight etc. The platform of implementation may be ... signal processing of audio and video, wired and wireless communication have led to complex embedded ..... for Proposal: UML. Prof

Comparative Study of Surgical Approaches for Renal Pelvic Stones ...
Page 1 of 2. Stand 02/ 2000 MULTITESTER I Seite 1. RANGE MAX/MIN VoltSensor HOLD. MM 1-3. V. V. OFF. Hz A. A. °C. °F. Hz. A. MAX. 10A. FUSED.

Bayesian Hypothesis Test for sparse support recovery using Belief ...
+82-62-715-2264, Fax.:+82-62-715-2274, Email:{jwkkang,heungno,kskim}@gist.ac.kr). Abstract—In this ... the test are obtained by aid of belief propagation (BP).

Data Cleaning: Problems and Current Approaches
So far, these systems provide only limited support for data cleaning, ... extraction and integration will further increase response times but is mandatory to achieve useful query ... major errors and inconsistencies both in individual data sources an

Alternative Approaches to the Study of Change
teams and my clinical team, and all the wonderful people of Wediko who have helped make this project .... continued to receive support and empirical testing, researchers have also developed other theories of ..... and Social Problems; the broadband m

Automatic speaker recognition using dynamic Bayesian network ...
This paper presents a novel approach to automatic speaker recognition using dynamic Bayesian network (DBN). DBNs have a precise and well-understand ...

BAYESIAN COMPRESSED SENSING USING ...
weight to small components encourages sparse solutions. The CS reconstruction ... knowledge about the signal. ... MERIDIAN PRIORS. Of interest here is the development of a sparse reconstruction strategy using a Bayesian framework. To encourage sparsi

Proportional Bandwidth Sharing Using Bayesian ...
(SDN) paradigm, traffic management in data center networks ... Index Terms—Software Defined Networks, OpenFlow, Data center ..... that is building up. Fig.

1 Bayesian Inference with Tears a tutorial workbook for natural ...
be less competitive when large amounts of data are available, anyway – prior knowledge is more powerful when ... I don't know that Bayesian techniques will actually deliver on these promises, on the big problems that we care about. ... Input: a cip

1 Bayesian Inference with Tears a tutorial workbook for natural ...
I had Mr. Robinson for calculus in high school, but I mean, man, that was a long time ago. ... Does it work? Well, do Bayesian techniques really work, on real problems? If it works, then I should bother. Bayesian techniques seem to beat plain EM1 on

Study Guide for Unit 1
Accessibility, Connectivity, Topological Space, and Time Space Convergence k. Toponyms and place ... Geography today (new technologies) i. Environmental ...

Study of Data Warehouse Modeling and its different design approaches
Though multidimensional data models and star schema are relevant for warehouse .... Several efficient multidimensional data structures such as dwarfs [47, 48], ...

A Study on Convex Optimization Approaches to Image ...
ful applications include image denoising [22,8,30], image decomposition [2,17] ..... + ui/c)}) . (30). Interestingly, our experiments show that just one single step of ... experiments are computed by a Ubuntu desktop with AMD Athalon 64 X2 5600.

Bayesian Active Learning Using Arbitrary Binary Valued Queries
Liu Yang would like to extend her sincere gratitude to Avrim Blum and Venkatesan. Guruswami for several enlightening and highly stimulating discussions. Bibliography. Bshouty, N. H., Li, Y., & Long, P. M. (2009). Using the doubling dimension to analy