CAUSAL NETWORKS

Frans Coenen

Liverpool University

Department of Computer Science

January 2002

Version I

**** IN PREPARATION, APPLETS DO NOT WORK! ****

Contents:

  1. Introduction.
  2. Serial connections.
  3. Diverging connections.
  4. Converging connections.
    1. Explaining away.
  5. Prior certainty.
  6. d-separation and d-connection (summary).



1. INTRODUCTION

A causal network is a directed graph, i.e. a set of nodes connected by directed arcs. The nodes represent events (or propositions), which at their simplest can be either true or false. In addition each node has a certainty associated with it. If we increase the certainty of an event (represented by a node) this increase is transmitted across the net work, according to the nature of the connections and what else has been evidenced, so as to increase/decrease the certainty associated with other nodes.

We can identify three basic types of connection:

  1. Serial (Figure 1)
  2. Diverging (Figure 2)
  3. Converging (Figure 3)



1. SERIAL CONNECTIONS

Figure 1: Serial connection

An example of a serial connection in a causal network is presented in Figure 1. The network states that A causes B with certainty xB, and B causes C with certainty xC. Thus:

We can also reason in the opposite direction. If we have evidence for C then this increases the certainty of B and consequently the certainty of A.

Thus evidence may be transmitted up and down a serial connection. However, if (in Figure 1) the state of B is known (we say that B has been instantiated) then the channel is blocked and A and C become independent, we say that A and C are d-separated.

Rule 1: Evidence may be transmitted through a serial connection unless the node has been instantiated.


3. DIVERGING CONNECTIONS

In Figure 2 a diverging connection is presented where A causes both B and C. If the certainty associated with A is increased then this is transmitted to both B and C. Conversely if the certainty of B is increased, this will be transmitted to A, and then from A to C. (And, conversely if the certainty of C is increased, this will be transmitted to A, and then B.)

Figure 2: Diverging connection

Thus, if we know nothing about A, then B and C are dependent; any change in the evidence for either B or Cwill affect the other (and also the evidence for A). However, if A is instantiated then B and C will become independent; any change in the evidence for B will not affect C (and vice versa). This is called conditional independence.

Rule 1: Evidence may be transmitted through a diverging connection unless the node has been instantiated.


4. CONVERGING CONNECTION

A converging connection is presented in Figure 3. Here events A, B and C may D, i.e a change in the certainty for either A, B and C will be transmitted to D. However, A, B and C are independent; for example any change in A will not be transmitted on to B or C through D. However, if we have evidence for D, either directly or through a serial connection to a child node this is transmitted back up to the parents of D and consequently A, B and C become dependent. This is called conditional independence

Figure 3: Converging connection




4.1 Explaining Away

In Figure 4 we have a diverging connection combined with a converging connection. Event C may be caused by either A or D, while B can only be caused by A.

Figure 4: Explaining away

Rule 3: Evidence may be transmitted through a converging connection only if either the variable in the connection has received evidence (directly or through a serial or diverging connection).


5. PRIOR CERTAINTY

From the above we have seen that if the certainty associated with an event is changed this has an effect on other events linked through the network (either positive or negative). Thus, prior to any reasoning that my take place, each event must have some "start" certainty associated with it. We can calculate these by assigning prior certainties to the root nodes and then calculating the affect on connected nodes.




6. d-SEPARATION AND d-CONNECTION (SUMMARY)

From the above evidence is transmitted from one node to another if they are dependent, i.e. they are d-connected. A node is d-connected if it is not d-separated. A node A is d-separated from a node B if for all paths between A and B there is a intermediate node C such that the connection is either:

  1. Serial and C is instantiated, or
  2. diverging and C is instantiated, or
  3. converging and C has not received any evidence.

7. REFERENCES

  1. Jensen, F.V.(1996). An Introduction to Bayesian Networks, UCL Press, London, (Chapter 2).



Created and maintained by Frans Coenen. Last updated 01 February 2002