Title:
|
Markov decision processes on finite spaces with fuzzy total rewards (English) |
Author:
|
Carrero-Vera, Karla |
Author:
|
Cruz-Suárez, Hugo |
Author:
|
Montes-de-Oca, Raúl |
Language:
|
English |
Journal:
|
Kybernetika |
ISSN:
|
0023-5954 (print) |
ISSN:
|
1805-949X (online) |
Volume:
|
58 |
Issue:
|
2 |
Year:
|
2022 |
Pages:
|
180-199 |
Summary lang:
|
English |
. |
Category:
|
math |
. |
Summary:
|
The paper concerns Markov decision processes (MDPs) with both the state and the decision spaces being finite and with the total reward as the objective function. For such a kind of MDPs, the authors assume that the reward function is of a fuzzy type. Specifically, this fuzzy reward function is of a suitable trapezoidal shape which is a function of a standard non-fuzzy reward. The fuzzy control problem consists of determining a control policy that maximizes the fuzzy expected total reward, where the maximization is made with respect to the partial order on the $\alpha$-cuts of fuzzy numbers. The optimal policy and the optimal value function for the fuzzy optimal control problem are characterized by means of the dynamic programming equation of the standard optimal control problem and, as main conclusions, it is obtained that the optimal policy of the standard problem and the fuzzy one coincide and the fuzzy optimal value function is of a convenient trapezoidal form. As illustrations, fuzzy extensions of an optimal stopping problem and of a red-black gambling model are presented. (English) |
Keyword:
|
Markov decision process |
Keyword:
|
total reward |
Keyword:
|
fuzzy reward |
Keyword:
|
trapezoidal fuzzy number |
Keyword:
|
optimal stopping problem |
Keyword:
|
gambling model |
MSC:
|
90C40 |
MSC:
|
93C40 |
idZBL:
|
Zbl 07584152 |
idMR:
|
MR4467492 |
DOI:
|
10.14736/kyb-2022-2-0180 |
. |
Date available:
|
2022-07-29T12:08:16Z |
Last updated:
|
2023-03-13 |
Stable URL:
|
http://hdl.handle.net/10338.dmlcz/150463 |
. |
Reference:
|
[1] Abbasbandy, S., Hajjari, T.: A new approach for ranking of trapezoidal fuzzy numbers..Comput. Math. Appl. 57 (2009), 413-419. MR 2488614, |
Reference:
|
[2] Ban, A. I.: Triangular and parametric approximations of fuzzy numbers inadvertences and corrections..Fuzzy Sets and Systems 160 (2009), 3048-3058. MR 2567092, |
Reference:
|
[3] Bartle, R. G.: The Elements of Integration..Wiley, New York 1995. MR 0200398 |
Reference:
|
[4] Bellman, R. E., Zadeh, L. A.: Decision-making in a fuzzy enviroment..Management Sci. 17 (1970), 141-164. MR 0301613, |
Reference:
|
[5] Cavazos-Cadena, R., Montes-de-Oca, R.: Existence of optimal stationary policies in finite dynamic programs with nonnegative rewards..Probab. Engrg. Inform. Sci. 15 (2001), 557-564. MR 1852975, |
Reference:
|
[6] Chen, S. H.: Operations of fuzzy numbers with step form membership function using function principle..Information Sci. 108 (1998), 149-155. Zbl 0922.04007, MR 1632503, |
Reference:
|
[7] Diamond, P., Kloeden, P.: Metric Spaces of Fuzzy Sets: Theory and Applications..World Scientific, Singapore 1994. MR 1337027 |
Reference:
|
[8] Driankov, D., Hellendoorn, H., Reinfrank, M.: An Introduction to Fuzzy Control..Springer Science and Business Media, New York 2013. MR 3010569 |
Reference:
|
[9] Efendi, R., Arbaiy, N., Deris, M. M.: A new procedure in stock market forecasting based on fuzzy random auto-regression time series model..Information Sci. 441 (2018), 113-132. MR 3771167, |
Reference:
|
[10] Fakoor, M., Kosari, A., Jafarzadeh, M.: Humanoid robot path planning with fuzzy Markov decision processes..J. Appl. Res. Tech. 14 (2016), 300-310. |
Reference:
|
[11] Furukawa, N.: Parametric orders on fuzzy numbers and their roles in fuzzy optimization problems..Optimization 40 (1997), 171-192. MR 1620380, |
Reference:
|
[12] Kurano, M., Yasuda, M., Nakagami, J., Yoshida, Y.: Markov decision processes with fuzzy rewards..In: Proc. Int. Conf. on Nonlinear Analysis, Hirosaki 2002, pp. 221-232. MR 1986973 |
Reference:
|
[13] López-Díaz, M., Ralescu, D. A.: Tools for fuzzy random variables: embeddings and measurabilities..Comput. Statist. Data Anal. 51 (2006), 109-114. MR 2297590, |
Reference:
|
[14] Pedrycz, W.: Why triangular membership functions?..Fuzzy Sets and Systems 64 (1994), 21-30. MR 1281283, |
Reference:
|
[15] Puri, M. L., Ralescu, D. A.: Fuzzy random variable..J. Math. Anal. Appl. 114 (1986), 402-422. MR 0833596, |
Reference:
|
[16] Puterman, M. L.: Markov Decision Processes: Discrete Stochastic Dynamic. First edition..Wiley-Interscience, California 2005. MR 1270015 |
Reference:
|
[17] Rezvani, S., Molani, M.: Representation of trapezoidal fuzzy numbers with shape function..Ann. Fuzzy Math. Inform. 8 (2014), 89-112. MR 3214770 |
Reference:
|
[18] Ross, S.: Dynamic programming and gambling models..Adv. Appl. Probab. 6 (1974), 593-606. MR 0347381, |
Reference:
|
[19] Ross, S.: Introduction to Stochastic Dynamic Programming..Academic Press, New York 1983. MR 0749232 |
Reference:
|
[20] Semmouri, A., Jourhmane, M., Belhallaj, Z.: Discounted Markov decision processes with fuzzy costs..Ann. Oper. Res. 295 (2020), 769-786. MR 4181708, |
Reference:
|
[21] Syropoulos, A., Grammenos, T.: A Modern Introduction to Fuzzy Mathematics..Wiley, New Jersey 2020. |
Reference:
|
[22] Zadeh, L.: Fuzzy sets..Inform. Control 8 (1965), 338-353. Zbl 0942.00007, MR 0219427, |
Reference:
|
[23] Zeng, W., Li, H.: Weighted triangular approximation of fuzzy numbers..Int. J. Approx. Reason. 46 (2007), 137-150. MR 2362230, |
. |