House allocation with randomly generated preference lists
Benhamou, Eric; Saltiel, David; Ohana, Jean-Jacques; Atif, Jamal (2021), House allocation with randomly generated preference lists, 2020 25th International Conference on Pattern Recognition (ICPR), IEEE - Institute of Electrical and Electronics Engineers : Piscataway, NJ, p. 10050 - 10057. 10.1109/ICPR48806.2021.9412958
Type
Communication / ConférenceExternal document link
https://arxiv.org/abs/2009.07200Date
2021Conference title
2020 25th International Conference on Pattern Recognition (ICPR)Conference date
2021-01Conference city
MilanConference country
ItalyBook title
2020 25th International Conference on Pattern Recognition (ICPR)Publisher
IEEE - Institute of Electrical and Electronics Engineers
Published in
Piscataway, NJ
ISBN
978-1-7281-8808-9
Number of pages
10743Pages
10050 - 10057
Publication identifier
Metadata
Show full item recordAuthor(s)
Benhamou, Eric
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Saltiel, David
Laboratoire d'Informatique Signal et Image de la Côte d'Opale [LISIC]
Ohana, Jean-Jacques
Multi Assets Solutions, Homa Capital
Atif, Jamal
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Abstract (EN)
Deep reinforcement learning (DRL) has reached super human levels in complex tasks like game solving (Go [1], StarCraft II [2]), and autonomous driving [3]. However, it remains an open question whether DRL can reach human level in applications to financial problems and in particular in detecting pattern crisis and consequently dis-investing. In this paper, we present an innovative DRL framework consisting in two sub-networks fed respectively with portfolio strategies past performances and standard deviations as well as additional contextual features. The second sub network plays an important role as it captures dependencies with common financial indicators features like risk aversion, economic surprise index and correlations between assets that allows taking into account context based information. We compare different network architectures either using layers of convolutions to reduce network's complexity or LSTM block to capture time dependency and whether previous allocations is important in the modeling. We also use adversarial training to make the final model more robust. Results on test set show this approach substantially over-performs traditional portfolio optimization methods like Markovitz and is able to detect and anticipate crisis like the current Covid one.Subjects / Keywords
Economics; Training; Adaptation models; Correlation; Reinforcement learning; Resource management; IndexesRelated items
Showing items related by title and author.
-
Benhamou, Éric; Saltiel, David; Ohana, Jean-Jacques; Atif, Jamal; Laraki, Rida Communication / Conférence
-
Ohana, Jean Jacques; Benhamou, Éric; Saltiel, David; Guez, Beatrice (2021) Document de travail / Working paper
-
Benhamou, Éric; Saltiel, David; Ungari, Sandrine; Mukhopadhyay, Abhishek; Atif, Jamal (2020) Document de travail / Working paper
-
Saltiel, David; Benhamou, Eric; Laraki, Rida; Atif, Jamal (2021) Communication / Conférence
-
Benhamou, Éric; Ohana, Jean; Saltiel, David; Guez, Beatrice (2021) Document de travail / Working paper