
Strong Uniform Value in Gambling Houses and Partially Observable Markov Decision Processes
Venel, Xavier; Ziliotto, Bruno (2016), Strong Uniform Value in Gambling Houses and Partially Observable Markov Decision Processes, SIAM Journal on Control and Optimization, 54, 4, p. 1983-2008. 10.1137/15M1043340
Voir/Ouvrir
Type
Article accepté pour publication ou publiéDate
2016Nom de la revue
SIAM Journal on Control and OptimizationVolume
54Numéro
4Éditeur
SIAM - Society for Industrial and Applied Mathematics
Pages
1983-2008
Identifiant publication
Métadonnées
Afficher la notice complèteAuteur(s)
Venel, Xavier
Centre d'économie de la Sorbonne [CES]
Ziliotto, Bruno
CEntre de REcherches en MAthématiques de la DEcision [CEREMADE]
Résumé (EN)
In several standard models of dynamic programming (gambling houses, MDPs, POMDPs), we prove the existence of a robust notion of value for the infinitely repeated problem, namely the strong uniform value. This solves two open problems. First, this shows that for any > 0, the decision-maker has a pure strategy σ which is-optimal in any n-stage problem, provided that n is big enough (this result was only known for behavior strategies, that is, strategies which use randomization). Second, for any > 0, the decision-maker can guarantee the limit of the n-stage value minus in the infinite problem where the payoff is the expectation of the inferior limit of the time average payoff.Mots-clés
dynamic programming; Markov decision processes; partial observation; uniform value; long-run average payoffPublications associées
Affichage des éléments liés par titre et auteur.
-
Zanuttini, Bruno; Lang, Jérôme; Saffidine, Abdallah; Schwarzentruber, François (2019) Article accepté pour publication ou publié
-
Li, Junkang; Cazenave, Tristan; Zanuttini, Bruno; Ventos, Veronique (2022) Communication / Conférence
-
Cazenave, Tristan; Liu, Jialin; Teytaud, Fabien; Teytaud, Olivier (2016) Communication / Conférence
-
Ziliotto, Bruno (2016) Article accepté pour publication ou publié
-
Gensbittel, Fabien; Oliu-Barton, Miquel; Venel, Xavier (2014) Article accepté pour publication ou publié