• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • CEREMADE (UMR CNRS 7534)
  • CEREMADE : Publications
  • View Item
  •   BIRD Home
  • CEREMADE (UMR CNRS 7534)
  • CEREMADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail - No thumbnail

Blackwell's Approachability with Time-Dependent Outcome Functions and Dot Products. Application to the Big Match

Kwon, Joon; Ziliotto, Bruno (2023), Blackwell's Approachability with Time-Dependent Outcome Functions and Dot Products. Application to the Big Match. https://basepub.dauphine.psl.eu/handle/123456789/24827

View/Open
2303.04956.pdf (415.9Kb)
Type
Document de travail / Working paper
External document link
https://hal.science/hal-04046399
Date
2023
Series title
Cahier de recherche CEREMADE, Université Paris Dauphine-PSL
Published in
Paris
Pages
16
Metadata
Show full item record
Author(s)
Kwon, Joon cc
Mathématiques et Informatique Appliquées [MIA Paris-Saclay]
Ziliotto, Bruno
CEntre de REcherches en MAthématiques de la DEcision [CEREMADE]
Abstract (EN)
Blackwell's approachability is a very general sequential decision framework where a Decision Maker obtains vector-valued outcomes, and aims at the convergence of the average outcome to a given "target" set. Blackwell gave a sufficient condition for the decision maker having a strategy guaranteeing such a convergence against an adversarial environment, as well as what we now call the Blackwell's algorithm, which then ensures convergence. Blackwell's approachability has since been applied to numerous problems, in online learning and game theory, in particular. We extend this framework by allowing the outcome function and the dot product to be time-dependent. We establish a general guarantee for the natural extension to this framework of Blackwell's algorithm. In the case where the target set is an orthant, we present a family of time-dependent dot products which yields different convergence speeds for each coordinate of the average outcome. We apply this framework to the Big Match (one of the most important toy examples of stochastic games) where an ϵ-uniformly optimal strategy for Player I is given by Blackwell's algorithm in a well-chosen auxiliary approachability problem.

Related items

Showing items related by title and author.

  • Thumbnail
    On a probabilistic approach to the Schrödinger equation with a time-dependent potential 
    Doss, Halim (2011) Article accepté pour publication ou publié
  • Thumbnail
    Global-in-time existence of solutions to the multiconfiguration time-dependent Hartree-Fock equations: A sufficient condition 
    Trabelsi, Saber; Mauser, Norbert; Bardos, Claude; Catto, Isabelle (2009) Article accepté pour publication ou publié
  • Thumbnail
    Time-dependent rescalings and Lyapunov functionals for the Vlasov-Poisson and Euler-Poisson systems, and for related models of kinetic equations, fluid dynamics and quantum physics 
    Dolbeault, Jean; Rein, Gerhard (2001) Article accepté pour publication ou publié
  • Thumbnail
    Setting and analysis of the multi-configuration time-dependent Hartree-Fock equations 
    Bardos, Claude; Catto, Isabelle; Mauser, Norbert; Trabelsi, Saber (2010) Article accepté pour publication ou publié
  • Thumbnail
    Time-Dependent Rescalings and Dispersion for the Boltzmann Equation 
    Dolbeault, Jean (1999) Document de travail / Working paper
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo