Title:
|
Risk-sensitive Markov stopping games with an absorbing state (English) |
Author:
|
López-Rivero, Jaicer |
Author:
|
Cavazos-Cadena, Rolando |
Author:
|
Cruz-Suárez, Hugo |
Language:
|
English |
Journal:
|
Kybernetika |
ISSN:
|
0023-5954 (print) |
ISSN:
|
1805-949X (online) |
Volume:
|
58 |
Issue:
|
1 |
Year:
|
2022 |
Pages:
|
101-122 |
Summary lang:
|
English |
. |
Category:
|
math |
. |
Summary:
|
This work is concerned with discrete-time Markov stopping games with two players. At each decision time player II can stop the game paying a terminal reward to player I, or can let the system to continue its evolution. In this latter case player I applies an action affecting the transitions and entitling him to receive a running reward from player II. It is supposed that player I has a no-null and constant risk-sensitivity coefficient, and that player II tries to minimize the utility of player I. The performance of a pair of decision strategies is measured by the risk-sensitive (expected) total reward of player I and, besides mild continuity-compactness conditions, the main structural assumption on the model is the existence of an absorbing state which is accessible from any starting point. In this context, it is shown that the value function of the game is characterized by an equilibrium equation, and the existence of a Nash equilibrium is established. (English) |
Keyword:
|
monotone operator |
Keyword:
|
fixed point |
Keyword:
|
equilibrium equation |
Keyword:
|
hitting time |
Keyword:
|
bounded rewards |
Keyword:
|
certainty equivalent |
MSC:
|
60J05 |
MSC:
|
93C55 |
MSC:
|
93E20 |
idZBL:
|
Zbl 07511613 |
idMR:
|
MR4405949 |
DOI:
|
10.14736/kyb-2022-1-0101 |
. |
Date available:
|
2022-04-08T07:54:43Z |
Last updated:
|
2022-08-11 |
Stable URL:
|
http://hdl.handle.net/10338.dmlcz/149604 |
. |
Reference:
|
[1] Alanís-Durán, A., Cavazos-Cadena, R.: An optimality system for finite average Markov decision chains under risk-aversion..Kybernetika 48 (2012), 83-104. MR 2932929 |
Reference:
|
[2] Altman, E., Shwartz, A.: Constrained Markov games: Nash equilibria..In: Annals of Dynamic Games (V. Gaitsgory, J. Filar, and K. Mizukami, eds.), Birkhauser, Boston 2000, pp. 213-221. MR 1764491 |
Reference:
|
[3] Atar, R., Budhiraja, A.: A stochastic differential game for the inhomogeneous Laplace equation..Ann. Probab. 38 (2010), 2, 498-531. MR 2642884, |
Reference:
|
[4] Balaji, S., Meyn, S. P.: Multiplicative ergodicity and large deviations for an irreducible Markov chain..Stoch. Proc. Appl. 90 (2000), 1, 123-144. MR 1787128, |
Reference:
|
[5] Bäuerle, N., Rieder, U.: Markov Decision Processes with Applications to Finance..Springer, New York 2011. Zbl 1236.90004, MR 2808878 |
Reference:
|
[6] Bäuerle, N., Rieder, U.: More risk-sensitive Markov decision processes..Math. Oper. Res. 39 (2014), 1, 105-120. MR 3173005, |
Reference:
|
[7] Bäuerle, N., Rieder, U.: Zero-sum risk-sensitive stochastic games..Stoch. Proc. Appl. 127 (2017), 2, 622-642. MR 3583765, |
Reference:
|
[8] Bielecki, T. R., Hernández-Hernández, D., Pliska, S. R.: Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management..Mathematical Methods of OR 50 (1999), 167-188. Zbl 0959.91029, MR 1732397, |
Reference:
|
[9] Borkar, V. S., Meyn, S. F.: Risk-sensitive optimal control for Markov decision process with monotone cost..Math. Oper. Res. 27 (2002), 1, 192-209. MR 1886226, |
Reference:
|
[10] Cavazos-Cadena, R., Hernández-Hernández, D.: A system of Poisson equations for a non-constant {Varadhan} functional on a finite state space..Appl. Math. Optim. 53 (2006), 101-119. MR 2190228, |
Reference:
|
[11] Cavazos-Cadena, R., Hernández-Hernández, D.: Nash equilibria in a class of Markov stopping games..Kybernetika 48 (2012), 5, 1027-1044. MR 3086867 |
Reference:
|
[12] Cavazos-Cadena, R., Rodríguez-Gutiérrez, L., Sánchez-Guillermo, D. M.: Markov stopping games with an absorbing state and total reward criterion..Kybernetika 57 (2021), 474-492. MR 4299459, |
Reference:
|
[13] Denardo, E. V., Rothblum, U. G.: A turnpike theorem for a risk-sensitive Markov decision process with stopping..SIAM J. Control Optim. 45 (2006), 2, 414-431. MR 2246083, |
Reference:
|
[14] Masi, G. B. Di, Stettner, L.: Risk-sensitive control of discrete time Markov processes with infinite horizon..SIAM J. Control Optim. 38 (1999), 1, 61-78. MR 1740607, |
Reference:
|
[15] Masi, G. B. Di, Stettner, L.: Infinite horizon risk sensitive control of discrete time Markov processes with small risk..Syst. Control Lett. 40 (2000), 15-20. Zbl 0977.93083, MR 1829070, |
Reference:
|
[16] Masi, G. B. Di, Stettner, L.: Infinite horizon risk sensitive control of discrete time Markov processes under minorization property..SIAM J. Control Optim. 46 (2007), 1, 231-252. MR 2299627, |
Reference:
|
[17] A.Filar, J., Vrieze, O. J.: Competitive Markov Decision Processes..Springer, New York 1996. MR 1418636 |
Reference:
|
[18] Hernández-Lerma, O.: Adaptive Markov Control Processes..Springer, New York 1989. Zbl 0677.93073, MR 0995463 |
Reference:
|
[19] Howard, R. A., Matheson, J. E.: Risk-sensitive Markov decision processes..Manage. Sci. 18 (1972), 7, 349-463. MR 0292497, |
Reference:
|
[20] Jaśkiewicz, A.: Average optimality for risk sensitive control with general state space..Ann. Appl. Probab. 17 (2007), 2, 654-675. MR 2308338, |
Reference:
|
[21] Kolokoltsov, V. N., Malafeyev, O. A.: Understanding Game Theory..World Scientific, Singapore 2010. Zbl 1189.91001, MR 2666863 |
Reference:
|
[22] Kontoyiannis, I., Meyn, S. P.: Spectral theory and limit theorems for geometrically ergodic Markov processes..Ann. Appl. Probab. 13 (2003), 1, 304-362. MR 1952001, |
Reference:
|
[23] Martínez-Cortés, V. M.: Bi-personal stochastic transient Markov games with stopping times and total reward criterion..Kybernetika 57 (2021), 1, 1-14. MR 4231853, |
Reference:
|
[24] Peskir, G.: On the American option problem..Math. Finance 15 (2007), 169-181. Zbl 1109.91028, MR 2116800, |
Reference:
|
[25] Peskir, G., Shiryaev, A.: Optimal Stopping and Free-Boundary Problems..Birkhauser, Boston 2006. Zbl 1115.60001, MR 2256030 |
Reference:
|
[26] Pitera, M., Stettner, L.: Long run risk sensitive portfolio with general factors..Math. Meth. Oper. Res. 82 (2016), 2, 265-293. MR 3489700, |
Reference:
|
[27] Puterman, M. L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming..Wiley, New York 1994. Zbl 1184.90170, MR 1270015 |
Reference:
|
[28] Shapley, L. S.: Stochastic games..Proc. National Academy Sci. 39 (1953), 10, 1095-1100. Zbl 1180.91042, MR 0061807 |
Reference:
|
[29] Shiryaev, A.: Optimal Stopping Rules..Springer, New York 2008. Zbl 1138.60008, MR 2374974 |
Reference:
|
[30] Sladký, K.: Growth rates and average optimality in risk-sensitive Markov decision chains..Kybernetika 44 (2008), 2, 205-226. MR 2428220 |
Reference:
|
[31] Sladký, K.: Risk-sensitive average optimality in Markov decision processes..Kybernetika 54 (2018), 6, 1218-1230. MR 3902630, |
Reference:
|
[32] Stettner, L.: Risk sensitive portfolio optimization..Math. Meth. Oper. Res. 50 (1999), 3, 463-474. MR 1731299, |
Reference:
|
[33] Zachrisson, L. E.: Markov Games..Princeton University Press 12, Princeton 1964. MR 0170729 |
. |