4.II.29I
Consider the scalar controllable linear system, whose state evolves by
with observations given by
Here, is the control variable, which is to be determined on the basis of the observations up to time , and are independent random variables. You wish to minimize the long-run average expected cost, where the instantaneous cost at time is . You may assume that the optimal control in equilibrium has the form , where is given by a recursion of the form
and where is chosen so that is independent of the observations up to time . Show that , and determine the minimal long-run average expected cost. You are not expected to simplify the arithmetic form of your answer but should show clearly how you have obtained it.
Typos? Please submit corrections to this page on GitHub.