Optimisation And Control
Optimisation And Control
Jump to year
Paper 2, Section II,
comment(a) A ball may be in one of boxes. A search of the box costs and finds the ball with probability if the ball is in that box. We are given initial probabilities that the ball is in the box.
Show that the policy which at time searches the box with the maximal value of minimises the expected searching cost until the ball is found, where is the probability (given everything that has occurred up to time ) that the ball is in box .
(b) Next suppose that a reward is earned if the ball is found in the box. Suppose also that we may decide to stop at any time. Develop the dynamic programming equation for the value function starting from the probability distribution .
Show that if then it is never optimal to stop searching until the ball is found. In this case, is the policy defined in part (a) optimal?
Paper 3, Section II, K
commentThe scalars are related by the equations
where the initial state is normally distributed with mean and variance 1 and is a sequence of independent random variables each normally distributed with mean 0 and variance 1 . The control variable is to be chosen at time on the basis of information , where and
(a) Let be the Kalman filter estimates of , i.e.
where is chosen to minimise . Calculate and show that, conditional on is normally distributed with mean and variance .
(b) Define
Show that , where and .
(c) Show that the minimising control can be expressed in the form and find . How would the expression for be altered if or had variances other than 1?
Paper 4, Section II,
commentConsider the deterministic system
where and are scalars. Here is the state variable and the control variable is to be chosen to minimise, for a fixed , the cost
where is known and for all . Let be the minimal cost from state and time .
(a) By writing the dynamic programming equation in infinitesimal form and taking the appropriate limit show that satisfies
with boundary condition .
(b) Determine the form of the optimal control in the special case where is constant, and also in general.