Previous |  Up |  Next

Article

Title: Denumerable Markov stopping games with risk-sensitive total reward criterion (English)
Author: Torres-Gomar, Manuel A.
Author: Cavazos-Cadena, Rolando
Author: Cruz-Suárez, Hugo
Language: English
Journal: Kybernetika
ISSN: 0023-5954 (print)
ISSN: 1805-949X (online)
Volume: 60
Issue: 1
Year: 2024
Pages: 1-18
Summary lang: English
.
Category: math
.
Summary: This paper studies Markov stopping games with two players on a denumerable state space. At each decision time player II has two actions: to stop the game paying a terminal reward to player I, or to let the system to continue it evolution. In this latter case, player I selects an action affecting the transitions and charges a running reward to player II. The performance of each pair of strategies is measured by the risk-sensitive total expected reward of player I. Under mild continuity and compactness conditions on the components of the model, it is proved that the value of the game satisfies an equilibrium equation, and the existence of a Nash equilibrium is established. (English)
Keyword: monotone operator
Keyword: fixed point
Keyword: equilibrium equation
Keyword: Nash equilibrium
Keyword: hitting time
Keyword: bounded rewards
MSC: 91A10
MSC: 91A15
DOI: 10.14736/kyb-2024-1-0001
.
Date available: 2024-04-12T10:07:27Z
Last updated: 2024-04-12
Stable URL: http://hdl.handle.net/10338.dmlcz/152341
.
Reference: [1] Alanís-Durán, A., Cavazos-Cadena, R.: An optimality system for finite average Markov decision chains under risk-aversion..Kybernetika 48 (2012), 1, 83-104. MR 2932929
Reference: [2] Bäuerle, N., Rieder, U.: Markov Decision Processes with Applications to Finance..Springer-Verlag, New York 2011. Zbl 1236.90004, MR 2808878
Reference: [3] Bäuerle, N., Rieder, U.: More risk-sensitive Markov decision processes..Math. Oper. Res. 39 (2014), 1, 105-120. MR 3173005,
Reference: [4] Balaji, S., Meyn, S. P.: Multiplicative ergodicity and large deviations for an irreducible Markov chain..Stoch. Proc. Appl. 90 (2000), 1, 123-144. MR 1787128,
Reference: [5] Bielecki, T., Hernández-Hernández, D., Pliska, S. R.: Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management..Math. Methods Oper. Res. 50 (1999), 167-188. Zbl 0959.91029, MR 1732397,
Reference: [6] Borkar, V. S., Meyn, S. P.: Risk-sensitive optimal control for Markov decision process with monotone cost..Math. Oper. Res. 27 (2002), 1, 192-209. MR 1886226,
Reference: [7] Cavazos-Cadena, R., Hernández-Hernández, D.: A system of Poisson equations for a non-constant Varadhan functional on a finite state space..Appl. Math. Optim. 53 (2006), 101-119. MR 2190228,
Reference: [8] Cavazos-Cadena, R., Hernández-Hernández, D.: Nash equilibrium in a class of Markov stopping games..Kybernetika 48 (2012), 1027-1044. MR 3086867
Reference: [9] Cavazos-Cadena, R., Rodríguez-Gutiérrez, L., Sánchez-Guillermo, D. M.: Markov stopping game with an absorbing state..Kybernetika 57 (2021), 3, 474-492. MR 4299459,
Reference: [10] Cavazos-Cadena, R., Cantú-Sifuentes, M., Cerda-Delgado, I.: Nash equilibria in a class of Markov stopping games with total reward criterion..Math. Methods Oper. Res. 94 (2021), 319-340. MR 4338528,
Reference: [11] Denardo, E. V., Rothblum, U. G.: A turnpike theorem for A risk-sensitive Markov decision process with stopping..SIAM J. Control Optim. 45 (2006), 2, 414-431. MR 2246083,
Reference: [12] Masi, G. B. Di, Stettner, L.: Risk-sensitive control of discrete-time Markov processes with infinite horizon..SIAM J. Control Optim. 38 (1999), 1, 61-78. MR 1740607,
Reference: [13] Masi, G. B. Di, Stettner, L.: Infinite horizon risk sensitive control of discrete time Markov processes with small risk..Systems Control Lett. 40 (2000), 1, 305-321. Zbl 0977.93083, MR 1829070,
Reference: [14] Masi, G. B. Di, Stettner, L.: Infinite horizon risk sensitive control of discrete time Markov processes under minorization property..SIAM J. Control Optim. 46 (2007), 1, 231-252. MR 2299627,
Reference: [15] Hernández-Lerma, O.: Adaptive Markov Control Processes..Springer, New York 1988. Zbl 0677.93073, MR 0983898
Reference: [16] Howard, R., Matheson, J.: Risk-sensitive Markov decision processes..Management Science 18 (1972), 356-369. MR 0292497,
Reference: [17] Jaśkiewicz, A.: Average optimality for risk sensitive control with general state space..Ann. App. Probab. 17 (2007), 2, 654-675. MR 2308338,
Reference: [18] Kontoyiannis, I., Meyn, S. P.: Spectral theory and limit theorems for geometrically ergodic Markov processes..Ann. App. Probab. 13 (2003), 1, 304-362. MR 1952001,
Reference: [19] López-Rivero, J., Cavazos-Cadena, R., Cruz-Suárez, H.: Risk-sensitive Markov stopping games with an absorbing state..Kybernetika 58 (2022), 1, 101-122. MR 4405949,
Reference: [20] Martínez-Cortés, V. M.: Bipersonal stochastic transient Markov games with stopping times and total reward criteria..Kybernetika 57 (2021), 1, 1-14. MR 4231853,
Reference: [21] Pitera, M., Stettner, L.: Long run risk sensitive portfolio with general factors..Math. Meth. Oper. Res. 82 (2016), 2, 265-293. MR 3489700,
Reference: [22] Puterman, M.: Markov Decision Processes..Wiley, New York 1994. Zbl 1184.90170, MR 1270015
Reference: [23] Sladký, K.: Ramsey growth model under uncertainty..In: Proc. 27th International Conference Mathematical Methods in Economics 2009 (H. Brozová, ed.), Kostelec nad Cernými lesy 2009, pp. 296-300.
Reference: [24] Sladký, K.: Risk-sensitive Ramsey growth model..In: Proce. 27th International Conference Mathematical Methods in Economics 2010 (M. Houda and J. Friebelová, eds.), Ceské Budějovice 2010, pp. 1-6.
Reference: [25] Sladký, K.: Risk-sensitive average optimality in Markov decision processes..Kybernetika 54 (2018), 1218-1230. MR 3902630,
Reference: [26] Stettner, L.: Risk sensitive portfolio optimization..Math. Meth. Oper. Res. 50 (1999), 3, 463-474. MR 1731299,
.

Files

Files Size Format View
Kybernetika_60-2024-1_1.pdf 479.3Kb application/pdf View/Open
Back to standard record
Partner of
EuDML logo