Finite-and infinite-horizon shapley games with nonsymmetric partial observation
Document Type
Article
Publication Title
SIAM Journal On Control And Optimization
Abstract
We consider asymmetric partially observed Shapley-type finite-horizon and infinite-horizon games where the state, a controlled Markov chain {Xt}, is not observable to one player (minimizer) who observes only a state-dependent signal {Yt}. The maximizer observes both. The minimizer is informed of the maximizer's action after (before) choosing his control in the MINMAX (MAXMIN) game. A nontrivial open problem in such situations is how the minimizer can use this knowledge to update his belief about {Xt}. To address this, the maximizer uses off-line control functions which are known to the minimizer. Using these, novel control-parameterized nonlinear filters are constructed which are proved to characterize the conditional distribution of the full path of {Xt} . Using these filters, recursive algorithms are developed which show that saddle-points exist in both behavioral and Markov strategies for the finite-horizon case in both games. These algorithms are extended to prove saddle-points in Markov strategies for both games for the infinite-horizon case. A counterexample shows that the finite-horizon MINMAX value may be greater than the MAXMIN value. We show that the asymptotic limits of these values converge to the corresponding MINMAX and MAXMIN saddle-point values in the infinite-horizon setup. Another counterexample shows that the uniform value need not exist.
DOI Link
Publication Date
1-3-2015
Publisher
Society For Industrial and Applied Mathematics Publications
Volume
Vol.53
Issue
Iss.6
Recommended Citation
Basu, Arnab and Stettner, Lukasz, "Finite-and infinite-horizon shapley games with nonsymmetric partial observation" (2015). Faculty Publications. 724.
https://research.iimb.ac.in/fac_pubs/724