Markov decision processes on finite spaces with fuzzy total rewards

Carrero-Vera, Karla; Cruz-Suárez, Hugo; Montes-de-Oca, Raúl

About DML-CZ | FAQ | Conditions of Use | Math Archives | Contact Us

Previous | Up | Next

Article

Title:	Markov decision processes on finite spaces with fuzzy total rewards (English)
Author:	Carrero-Vera, Karla
Author:	Cruz-Suárez, Hugo
Author:	Montes-de-Oca, Raúl
Language:	English
Journal:	Kybernetika
ISSN:	0023-5954 (print)
ISSN:	1805-949X (online)
Volume:	58
Issue:	2
Year:	2022
Pages:	180-199
Summary lang:	English
.
Category:	math
.
Summary:	The paper concerns Markov decision processes (MDPs) with both the state and the decision spaces being finite and with the total reward as the objective function. For such a kind of MDPs, the authors assume that the reward function is of a fuzzy type. Specifically, this fuzzy reward function is of a suitable trapezoidal shape which is a function of a standard non-fuzzy reward. The fuzzy control problem consists of determining a control policy that maximizes the fuzzy expected total reward, where the maximization is made with respect to the partial order on the $\alpha$-cuts of fuzzy numbers. The optimal policy and the optimal value function for the fuzzy optimal control problem are characterized by means of the dynamic programming equation of the standard optimal control problem and, as main conclusions, it is obtained that the optimal policy of the standard problem and the fuzzy one coincide and the fuzzy optimal value function is of a convenient trapezoidal form. As illustrations, fuzzy extensions of an optimal stopping problem and of a red-black gambling model are presented. (English)
Keyword:	Markov decision process
Keyword:	total reward
Keyword:	fuzzy reward
Keyword:	trapezoidal fuzzy number
Keyword:	optimal stopping problem
Keyword:	gambling model
MSC:	90C40
MSC:	93C40
idZBL:	Zbl 07584152
idMR:	MR4467492
DOI:	10.14736/kyb-2022-2-0180
.
Date available:	2022-07-29T12:08:16Z
Last updated:	2023-03-13
Stable URL:	http://hdl.handle.net/10338.dmlcz/150463
.
Reference:	[1] Abbasbandy, S., Hajjari, T.: A new approach for ranking of trapezoidal fuzzy numbers..Comput. Math. Appl. 57 (2009), 413-419. MR 2488614,
Reference:	[2] Ban, A. I.: Triangular and parametric approximations of fuzzy numbers inadvertences and corrections..Fuzzy Sets and Systems 160 (2009), 3048-3058. MR 2567092,
Reference:	[3] Bartle, R. G.: The Elements of Integration..Wiley, New York 1995. MR 0200398
Reference:	[4] Bellman, R. E., Zadeh, L. A.: Decision-making in a fuzzy enviroment..Management Sci. 17 (1970), 141-164. MR 0301613,
Reference:	[5] Cavazos-Cadena, R., Montes-de-Oca, R.: Existence of optimal stationary policies in finite dynamic programs with nonnegative rewards..Probab. Engrg. Inform. Sci. 15 (2001), 557-564. MR 1852975,
Reference:	[6] Chen, S. H.: Operations of fuzzy numbers with step form membership function using function principle..Information Sci. 108 (1998), 149-155. Zbl 0922.04007, MR 1632503,
Reference:	[7] Diamond, P., Kloeden, P.: Metric Spaces of Fuzzy Sets: Theory and Applications..World Scientific, Singapore 1994. MR 1337027
Reference:	[8] Driankov, D., Hellendoorn, H., Reinfrank, M.: An Introduction to Fuzzy Control..Springer Science and Business Media, New York 2013. MR 3010569
Reference:	[9] Efendi, R., Arbaiy, N., Deris, M. M.: A new procedure in stock market forecasting based on fuzzy random auto-regression time series model..Information Sci. 441 (2018), 113-132. MR 3771167,
Reference:	[10] Fakoor, M., Kosari, A., Jafarzadeh, M.: Humanoid robot path planning with fuzzy Markov decision processes..J. Appl. Res. Tech. 14 (2016), 300-310.
Reference:	[11] Furukawa, N.: Parametric orders on fuzzy numbers and their roles in fuzzy optimization problems..Optimization 40 (1997), 171-192. MR 1620380,
Reference:	[12] Kurano, M., Yasuda, M., Nakagami, J., Yoshida, Y.: Markov decision processes with fuzzy rewards..In: Proc. Int. Conf. on Nonlinear Analysis, Hirosaki 2002, pp. 221-232. MR 1986973
Reference:	[13] López-Díaz, M., Ralescu, D. A.: Tools for fuzzy random variables: embeddings and measurabilities..Comput. Statist. Data Anal. 51 (2006), 109-114. MR 2297590,
Reference:	[14] Pedrycz, W.: Why triangular membership functions?..Fuzzy Sets and Systems 64 (1994), 21-30. MR 1281283,
Reference:	[15] Puri, M. L., Ralescu, D. A.: Fuzzy random variable..J. Math. Anal. Appl. 114 (1986), 402-422. MR 0833596,
Reference:	[16] Puterman, M. L.: Markov Decision Processes: Discrete Stochastic Dynamic. First edition..Wiley-Interscience, California 2005. MR 1270015
Reference:	[17] Rezvani, S., Molani, M.: Representation of trapezoidal fuzzy numbers with shape function..Ann. Fuzzy Math. Inform. 8 (2014), 89-112. MR 3214770
Reference:	[18] Ross, S.: Dynamic programming and gambling models..Adv. Appl. Probab. 6 (1974), 593-606. MR 0347381,
Reference:	[19] Ross, S.: Introduction to Stochastic Dynamic Programming..Academic Press, New York 1983. MR 0749232
Reference:	[20] Semmouri, A., Jourhmane, M., Belhallaj, Z.: Discounted Markov decision processes with fuzzy costs..Ann. Oper. Res. 295 (2020), 769-786. MR 4181708,
Reference:	[21] Syropoulos, A., Grammenos, T.: A Modern Introduction to Fuzzy Mathematics..Wiley, New Jersey 2020.
Reference:	[22] Zadeh, L.: Fuzzy sets..Inform. Control 8 (1965), 338-353. Zbl 0942.00007, MR 0219427,
Reference:	[23] Zeng, W., Li, H.: Weighted triangular approximation of fuzzy numbers..Int. J. Approx. Reason. 46 (2007), 137-150. MR 2362230,
.

Files

Files	Size	Format	View
Kybernetika_58-2022-2_3.pdf	458.5Kb	application/pdf	View/Open

Back to standard record

Browse
- Collections
- Titles
- Authors
- MSC

About DML-CZ

Partner of

Article

Files

Search

Browse