CMPSCI 291b: Reasoning About Uncertainty
David Mix Barrington
Spring, 2008
Homework Assignment #4
Posted Wednesday 30 April 2008
Due on paper to me or to CMPSCI main office by 4:00 p.m.,
Tuesday 13 May 2008
There were originally five questions for 70 total
points -- the last three-part question for 30 more points was added on
5 May 2008.
Students are responsible for understanding and following
the academic honesty
policies indicated on this page.
Question X.4 corrected 5 May.
- Problem X.4 (10) Suppose my prior hypothesis is that black, yellow,
and chocolate Labrador retrievers are equally numerous.
This is not enough information to solve the problem -- I need to have a prior
that indicates the range of hypotheses I am considering. Let's say that H is
the hypothesis that the three types are equally likely. The hypotheses
HB, HY, and HC say that the distribution of black, yellow, and chocolate is
60-20-20, 20-60-20, and 20-20-60 respectively. My prior will then be that
H is true with probability 70%, and that HB, HY, and HC have probability
10% each. I then observe three
black labs and a chocolate lab. What should be my posterior estimate of the
probability of each of the four hypotheses?
- Problem X.5 (10) In State M, 63% of voters are Democrats and 37%
Republicans, while in State U 73% are Republicans and 27% Democrats. In State
M, 90% prefer the Red Sox to the Yankees and in State U 60% prefer the Red
Sox to the Yankees. Suppose I now meet a person who is either a random State M
resident or a random State U resident with equal probability. What should
be my estimated probability that she is from State M given that she is:
- a Democratic Red Sox fan
- a Democratic Yankee fan
- a Republican Red Sox fan
- a Republican Yankee fan
- Problem X.6 (15)
We place a marker on one of the cells of a three-by-three array (a tic-tac-toe
board). On each turn the marker moves one cell north, east, south, or west,
with each possible move equally likely -- for example, from the northwest corner
it moves south or east with probability 1/2 each. For each of three possible
initial conditions (corner, side, center) determine the probability of each
of the nine positions after one, two, and three turns. Determine the long-term
behavior of the system and the steady-state probability if any.
- Problem X.7 (15)
Repeat Problem X.6 where the marker has an equal probability of remaining where
it is or moving one square -- so from the northwest corner it stays, moves
south, or moves east with probability 1/3 each, and from the center it stays
or moves in one of four directions with probability 1/5 each.
- Problem X.8 (20)
Now consider a Markov decision process where we can decide at each turn
whether to use Strategy A, which keeps the marker stationary with 1/2
probability and moves it in one of the available directions otherwise, and
Plan B, which moves the marker to any of the nine squares with probability
1/9 each. The reward on any turn is 3 for the center square, 2 for a side
square, and 1 for a corner square. What is the policy that maximizes
the reward on the next turn? What is the policy that maximizes discounted
future return with a discount factor of γ = 1/2?
- Problem X.9 (30) This problem views a single
pitch of a baseball game as a two-person zero-sum game with simultaneous
moves. On every pitch the state consists of a number of balls from 0 through
3 and a number of strikes from 0 through 2. On the pitch, the pitcher chooses
whether to throw a good or a bad pitch, and the batter decides to swing or not
swing (take). If the batter swings at a good pitch, he hits the ball in play
which has a 35% chance of being a hit, so the batter gets 35 points. If the
batter takes a good pitch or swings at a bad pitch the number of strikes goes
up by one. If he takes a bad pitch the number of balls goes up by one. If the
number of strikes reaches three the batter is out and gets no points. If the
number of balls reaches four there is a walk and the batter gets a number of
points to be determined below.
>
- (a,5) Determine the optimal mixed strategies for each player if the
count is 3-2 (three balls and two points) and a walk is worth 100 points.
What is the expected return to the batter from this game?
- (b,10) Compute the expected return to the batter for the other eleven
count situations. You must do this in a sensible order. For example, if the
count is 2-1 a ball makes it 3-1 and a strike makes it 2-2, so to compute the
value of 2-1 you must already know the values of 3-1 and 2-2.
- (c,15) Repeat parts (a) and (b) with the walk valued at 60 points
rather than 100.
Last modified 5 May 2008