BAYESIAN NETWORKS

Frans Coenen

Liverpool University

Department of Computer Science

January 2002

Version I

**** IN PREPARATION, APPLETS DO NOT WORK! ****

Contents:

Introduction.
1. Joint probability.
2. Conditional probability.
Example 1: Diverging connection.
1. Instantiating B.
2. Instantiating A.

1. INTRODUCTION

A bayesian network is a causal network comprising:

A set of variables each of which has a finite set of mutually exclusive states.
A set of conditional probability tables P|B₁ ... B_m, attached to each variable. A with parents B₁ ... B_m.
A set of directed edges linking variables.

The variable and edges form a Directed Acyclic Graph (DAG), i.e. there are no "feed-back cycles" in the graph. The root nodes of this graph have unconditional probabilities associated with them.

1.1 Joint Probability

Equation 1

P(a_i and b_i) = P(a_i) x P(b_i)

where a_i and b_i are possible values for two variables A and B, and P(n) is the probability of value n being true (probability is expressed as a real number between 0.0 and 1.0.

Alternatively we can calculate the joint probability using:

Equation 2

p(a_i and b_i) = p(a_i | b_i) x p(b_i)

Where p(a_i | b_i) is the conditional probability of the value a_i existing if the value b_i exists.

1.2 Conditional Probability

Equation 3

              p(b_i | a_i) x p(a_i)
p(a_i | b_i) = --------------------
                    p(b_i)

2. EXAMPLE 1: DIVERGING CONNECTION

In Figure 1 a network with a diverging connection is presented such that A causes both B and C.

Figure 1: Diverging connection

We will assume that each variable has two values i and j each with which has a probability associated with it. The variables might then be interpreted as follows:

Variable	Interpretation
`a_i`	Rail travel disruption
`a_j`	No rail travel disruption
`b_i`	Trevor late at work
`b_j`	Trevor not late at work
`b_i`	Paul late at work
`b_i`	Paul not late at work

We then attach probability tables to each node. In the case of A, the root node, this will be an unconditional probability table (P(A) = 1):

Unconditional Probability table for Node `A`
Variable	P
`a_i`	0.7
`a_j`	0.3

Thus the probability of there being disruption on the railways is 0.7. We attach conditional probability tables to nodes B and C (P(B|A) and P(C|A)). That for B is given below (C will have identical values).

Conditional Probability table for Node `B`
	`a_i`	`a_j`
`b_i`	`0.8`	`0.1`
`b_j`	`0.2`	`0.9`

Note that the figures entered into these tables, for example that the probability of Trevor being late for work if there is disruption on the rail ways is 0.8 or that the probability that Trevor will be late for work if there is no disruption on the railways is 0.1 is subjective.

The joint probability table for p(A and B) is given below (again that for p(A and C) will have identical values). The values are calculated using Equation 2, fir example:

p(b_n and a_n) = p(b_n | a_n) x p(a_n)

Thus:

p(b_i and a_i) = p(b_i | a_i x p(a_i) = 0.8 x 0.7 = 0.56

`p(B and A)`
	`a_i`	`a_j`
`b_i`	`0.8 x 0.7 = 0.56`	`0.1 x 0.3 = 0.03`
`b_j`	`0.2 x 0.7 = 0.14`	`0.9 x 0.3 = 0.22`

By marginalising A from p(B and A) we get p(B):

p(B) = p(C) = ((0.56+0.03),(0.14,0.27)) = (0.59,0.41)

i.e.

p(B=b_i) = p(C=c_i) = 0.59
p(B=b_j) = p(C=c_j) = 0.41

Thus:

Variable	P
`a_i`	0.70
`a_j`	0.30
`b_i`	0.59
`b_j`	0.41
`c_i`	0.59
`c_j`	0.41

i.e. the probability of Trevor or Paul being late for work is 0.59.

2.1 Instantiating `B`

Now supposing that we obtain evidence that b_i is true, i.e. B = b_i (Trevor is late for work); we say that node B has become instantiated. Knowledge of this now allows us to revise the probability table for node A. The revision is done by calculating the conditional probability of A given B = b_i (p(A|B=b_i)), using:

                p(B=b_i|A) p(A)
p(A|B=b_i) = ---------------------
                    p(B=b_i)

We know that:

`p(B=b_i) = 0.59`	(The probability of Trevor being late at work for what ever reason)
`p(b_i\|a_i) = 0.8`	(The probability of Trevor being late at work due to disruption on the railways)
`p(b_i\|a_j) = 0.1`	(The probability of Trevor being late at work for some other reason)
`p(a_i) = 0.7`	(The probability of disruption on the railways)
`p(a_i) = 0.3`	(The probability of no disruption on the railways)

Thus:

             (0.8,0.1) x (0.7,0.3)     (0.8 x 0.7,0.1 x 0.3)
p(A|B=b_i) = ----------------------- = -----------------------
                      0.59                        0.59

                        (0.56,0.03)   
                     = ------------- = (0.95,0.05)
		           0.59

The new probability table for Node A is now:

Probability table for Node `A`
Variable	P
`a_i`	0.95
`a_j`	0.05

Note that the probability of there being disruption on the railways has increased as a result of node B being instantiated (i.e. knowledge that Trevor is late at work).

The new probability table for Node C must now be calculated using:

P(c_i and a_i) =  c_i x a_i

`p(B and A)`
	`a_i`	`a_j`
`b_i`	`0.8 x 0.95 = 0.760`	`0.1 x 0.05 = 0.005`
`b_j`	`0.2 x 0.95 = 0.190`	`0.9 x 0.05 = 0.045`

and

P(C) = (0.765,0.235)

Thus:

Variable	P
`a_i`	0.950
`a_j`	0.050
`b_i`	1.000
`b_j`	0.000
`c_i`	0.765
`c_j`	0.235

The probability of Paul being late at work has increased to 0.765 from 0.59.

2.2 Instantiating `A`

If we now receive knowledge that there is no disruption on the railway (A = a_j and P(a_j)=1) this will have no effect on node B (we know that Trevor is late), however we must revise our probability table for C

`p(C and A)`
	`a_i`	`a_j`
`b_i`	`0.8 x 0.0 = 0.0`	`0.1 x 1.0 = 0.1`
`b_j`	`0.2 x 0.0 = 0.0`	`0.9 x 1.0 = 0.9`

Thus:

Variable	P
`a_i`	0.000
`a_j`	1.000
`b_i`	1.000
`b_j`	0.000
`c_i`	0.100
`c_j`	0.900

3. REFERENCES

Jensen, F.V.(1996). An Introduction to Bayesian Networks UCL Press, London.

Created and maintained by Frans Coenen. Last updated 01 February 2002

BAYESIAN NETWORKS

Frans Coenen

Liverpool University

Department of Computer Science

1. INTRODUCTION

1.1 Joint Probability

1.2 Conditional Probability

2. EXAMPLE 1: DIVERGING CONNECTION

2.1 Instantiating B

2.2 Instantiating A

3. REFERENCES

2.1 Instantiating `B`

2.2 Instantiating `A`