Experiments with Adaptive Transfer Rate in Reinforcement Learning
Chevaleyre, Yann; Machado Pamponet, Aydano; Zucker, Jean-Daniel (2009), Experiments with Adaptive Transfer Rate in Reinforcement Learning, in Richards, Debbie; Kang, Byeong-Ho, Knowledge Acquisition: Approaches, Algorithms and Applications. Pacific Rim Knowledge Acquisition Workshop, PKAW 2008, Hanoi, Vietnam, December 15-16, 2008, Revised Selected Papers, Springer : Berlin, p. 1-11. http://dx.doi.org/10.1007/978-3-642-01715-5_1
Type
Communication / ConférenceDate
2009Conference title
Pacific Rim Knowledge Acquisition Workshop, PKAW 2008Conference date
2008-12Conference city
HanoiConference country
Viêt NamBook title
Knowledge Acquisition: Approaches, Algorithms and Applications. Pacific Rim Knowledge Acquisition Workshop, PKAW 2008, Hanoi, Vietnam, December 15-16, 2008, Revised Selected PapersBook author
Richards, Debbie; Kang, Byeong-HoPublisher
Springer
Series title
Lecture Notes in Computer ScienceSeries number
5465Published in
Berlin
ISBN
978-3-642-01714-8
Pages
1-11
Publication identifier
Metadata
Show full item recordAbstract (EN)
Transfer algorithms allow the use of knowledge previously learned on related tasks to speed-up learning of the current task. Recently, many complex reinforcement learning problems have been successfully solved by efficient transfer learners. However, most of these algorithms suffer from a severe flaw: they are implicitly tuned to transfer knowledge between tasks having a given degree of similarity. In other words, if the previous task is very dissimilar (resp. nearly identical) to the current task, then the transfer process might slow down the learning (resp. might be far from optimal speed-up). In this paper, we address this specific issue by explicitly optimizing the transfer rate between tasks and answer to the question : “can the transfer rate be accurately optimized, and at what cost ?”. We show that this optimization problem is related to the continuum bandit problem. We then propose a generic adaptive transfer method (AdaTran), which allows to extend several existing transfer learning algorithms to optimize the transfer rate. Finally, we run several experiments validating our approach.Subjects / Keywords
transfer algorithmsRelated items
Showing items related by title and author.
-
Zucker, Jean-Daniel; Machado Pamponet, Aydano; Chevaleyre, Yann (2006) Communication / Conférence
-
Nguyen, Thanh Hai; Prifti, Edi; Chevaleyre, Yann; Sokolovska, Nataliya; Zucker, Jean-Daniel (2018) Communication / Conférence
-
Thanh Hai, Nguyen; Chevaleyre, Yann; Prifti, Edi; Sokolovska, Nataliya; Zucker, Jean-Daniel (2017) Communication / Conférence
-
Belahcene, Khaled; Sokolovska, Nataliya; Chevaleyre, Yann; Zucker, Jean-Daniel (2020) Communication / Conférence
-
Sokolovska, Nataliya; Chevaleyre, Yann; Zucker, Jean-Daniel (2018) Communication / Conférence