• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail - Request a copy

Playout Policy Adaptation with Move Features

Cazenave, Tristan (2016), Playout Policy Adaptation with Move Features, Theoretical Computer Science, 644, p. 43-52. 10.1016/j.tcs.2016.06.024

Type
Article accepté pour publication ou publié
Date
2016
Journal name
Theoretical Computer Science
Volume
644
Publisher
Elsevier
Pages
43-52
Publication identifier
10.1016/j.tcs.2016.06.024
Metadata
Show full item record
Author(s)
Cazenave, Tristan
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Abstract (EN)
Monte Carlo Tree Search (MCTS) is the state of the art algorithm for General Game Playing (GGP). We propose to learn a playout policy online so as to improve MCTS for GGP. We also propose to learn a policy not only using the moves but also according to the features of the moves. We test the resulting algorithms named Playout Policy Adaptation (PPA) and Playout Policy Adaptation with move Features (PPAF) on Atarigo, Breakthrough, Misere Breakthrough, Domineering, Misere Domineering, Knightthrough, MisereKnightthrough and Nogo. The experiments compare PPA and PPAF to Upper Confidence for Trees (UCT) and to the closely related Move-Average Sampling Technique (MAST) algorithm.
Subjects / Keywords
Computer Games; Monte Carlo Tree Search; Reinforcement Learning; Playout policy; Machine learning

Related items

Showing items related by title and author.

  • Thumbnail
    Application of the Nested Rollout Policy Adaptation Algorithm to the Traveling Salesman Problem with Time Windows 
    Cazenave, Tristan; Teytaud, Fabien (2012) Communication / Conférence
  • Thumbnail
    Beam Nested Rollout Policy Adaptation 
    Cazenave, Tristan; Teytaud, Fabien (2012) Communication / Conférence
  • Thumbnail
    Enhancing Playout Policy Adaptation for General Game Playing 
    Sironi, Chiara; Cazenave, Tristan; Winands, Mark (2021) Communication / Conférence
  • Thumbnail
    Stabilized Nested Rollout Policy Adaptation 
    Cazenave, Tristan; Sevestre, Jean-Baptiste; Toulemont, Matthieu (2020) Communication / Conférence
  • Thumbnail
    Stabilized Nested Rollout Policy Adaptation 
    Cazenave, Tristan; Sevestre, Jean-Baptiste; Toulemont, Matthieu (2020) Communication / Conférence
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo