Probability

Objective versus Subjective view
Objective view generally deals with frequency oriented interpretation
- You assume or visualize an experiment
- Coin that is tossed. The actual experiment may or may not be conducted
- If you assume randomness, the probability of head is 0.5.
- P(head)=0.5
- P(head,head)=0.5*0.5=0.25
- P(head,tail)=0.25
- P(tail,head)=0.25
- P(tail,tail)=0.25
- P(a head and a tail)=P(head,tail)+P(tail,head)=0.5
P(x or y) = P(x) + P(y) provided x and y are mutually exclusive.
P(x or y) = P(x) + P(y) - P(x and y)
The previous rule can be derived from the simple addition axiom.
Let us use the concept of random variable for simplicity assume that these variables take true or false values.
the variable X1,X2,X3,....,Xn can take values x1,x2,x3,x4,....xn or -x1,-x2,-x3,....,-xn
P(X1,X2)=P(X2|X1)*P(X1)
P(X1,...Xn)=P(X2,.....,Xn|X1)*P(X1)
=P(Xn|X1,....Xn-1)*P(X1,...,Xn-1)
=P(Xn|X1...Xn-1)*P(Xn-1|X1...Xn-2)*....*P(X2|X1)*P(X1)
See the example when n = 5. The graph with no independence assumption is quite complicated
Total number of probabilities required is O(2^n)
Conditional independence assumptions are made to reduce the number of probability values that need to be obtained
You should be able to look at a belief network and construct a formula for joint probability and vice versa
You should be able to explain the advantage of belief networks for a short descriptive question
In the belief networks, the probabilities of evidence are propagated through the network using conditional probabilities.
Local computations in a parallel computer are possible using belief networks.
Most of the probabilities used in belief networks are essentially subjective in nature. We have to stretch our imagination to visualize the experiments
- Coin toss: easy to visualize the experiment
- How do you visualize the experiment for Mary called given that there was an alarm. How many times has this type of situation has occurred or likely to occur to obtain reliable frequency.
- We are using our subjective judgment. Do laws of probability really hold for these subjective probabilities? They don't.
- 1920-1950 people came up with a betting scenario, concept of rationality, utility.
- A rational person will not make a book to lose money. Make assumptions about rational behaviour and show that with those assumptions the laws of probability hold.
- Is betting the right scenario for reasoning uncertainty? It is not.
- There are several objections to the theory of probability.
Fuzzy sets use this criticism of probability, the debate continues
You can come up with many different reasoning schemes but they have to be mathematically sound.
One mathematically sound theory is theory of belief functions.
Bel(A)+Bel(-A)<=1.
More generally, Bel(A or B) >= Bel(A) + Bel(B) assuming A and B are mutually exclusive.
Bel(rain) = 0.56 this was based on evidence in support of rain.
Bel(-rain) if you never did any calculations or there was no evidence to make the calculations to calculate this then you should be allowed say that this value is zero.
Bel(rain or -rain) = 1.
It is snowing, thermometer is saying 0. The probability that therm. is working is 0.95. If therm is working the streets can't be slippery there is a lot of traffic out there
Bel(-slip)=0.95
the prob. that therm is broken is 0.05. but if the thermometer is broken the temp. could really be anything there is no need to assume that the temp is well below 0. So we can't say Bel(slip) =0.05 we have to say Bel(slip)=0 because we have no evidence to conclude slip.
But Bel(slip or -slip)=1.
Bel(slip or -slip)>Bel(slip)+Bel(-slip)
The super additivity axiom is in fact generalized for non-mutually exclusive propositions similar to the probability. That's why belief functions are called generalizations of the probability functions.
The evidential rule is not as sophisticated as the Bayes rule.
There are Markov trees for belief functions similar to the belief nets for probability.