Question 6 (45):
Professor Kyle is conducting an experiment on animal behavior. She observes
a cat for n successive five-minute periods, and characterizes the cat's
behavior in each period as "Active", "Quiet", or "Sleeping". For each time
period t in the set {1, 2,..., n}, the behavior b(t) is thus either A, Q, or S.
She wants to know whether this sequence of behaviors can be well modeled by
a Markov chain. Examining her data, she finds that when b(t) = A, b(t+1)
= A 20% of the time and b(t+1) = Q 80% of the time. When b(t) = Q, b(t+1) = A
20% of the time, b(t+1) = Q 20% of the time, and b(t+1) = S 60% of the time.
Finally, when b(t) = S, she finds that b(t+1) = Q 20% of the time and b(t+1)
= S 80% of the time.
- (a,5) Draw a diagram and write a transition matrix for a Markov chain that
has three states and the given transition probabilities. (For your matrix,
order the rows A, Q, S.)
- (b,10) Determine the steady-state probability of this Markov chain.
- Looking more closely at her data, Professor Kyle discovers that for the
100 values of t where b(t) = A, b(t+2) = A only four times.
- (c,5) Determine the probabilities of each of the three states of the
Markov chain at time t+2, given that the state at time t is A.
- (d,10) Based on your answer to part (c) and the Normal Approximation to
the Binomial, determine how unusua it would be to have b(t+2) only four times
in 100 situations with b(t) = A. Should she reject the hypothesis that the cat
is behaving according to this Markov chain, using a 95% confidence level?
- (e,5) How might the Markov Hypothesis be failing in this situation?
Suggest a way in which she might refine her model to be more accurate.
- (f,5) Suppose now that the cat's behavior, as characterized by these
three states, is characterized by a Markov Decision Process, where the
possible actions are to give the cat some catnip (C) or not (N). What
observations would she need to make to completely describe the MDP?
- (g,5) Once she has all the numbers necessary to describe the MDP from
part (f), how could she determine a policy that would maximize the percentage
of the time that the cat is active, on average in the steady state? (You don't
need to do the arithmetic because I'm not giving you the numbers. But describe
a correct algorithm that she could use -- don't worry if it's not the most
efficient one.)