Item request has been placed!

Item request cannot be made.

Processing Request

Policy transfer using Value Function as Prior Information

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Aittahar, Samy; Sootla, Aivar
المصدر:
Phd Forum, Riva del Garda, Italy [IT], 19th September 2016 - 23th September 2016
الموضوع:
Transfer Learning; Reinforcement Learning; Engineering; computing & technology; Computer science; Ingénierie; informatique & technologie; Sciences informatiques
نوع التسجيلة:
conference object
اللغة:
English

معلومة اضافية
- Contributors:
  Ernst, Damien
- الموضوع:
  2016
- Collection:
  University of Liège: ORBi (Open Repository and Bibliography)
- نبذة مختصرة :
  This work proposes an approach based on reward shaping techniques in a reinforcement learning setting to approximate the opti- mal decision-making process (also called the optimal policy) in a desired task with a limited amount of data. We extract prior information from an existing family of policies have been used as a heuristic to help the construction of the new one under this challenging condition. We use this approach to study the relationship between the similarity of two tasks and the minimal amount of data needed to compute a near-optimal pol- icy for the second one using the prior information of the existing policy. Preliminary results show that for the least similar existing task consid- ered compared to the desired one, only 10% of the dataset was needed to compute the corresponding near-optimal policy.
- Relation:
  https://orbi.uliege.be/handle/2268/221884; info:hdl:2268/221884; https://orbi.uliege.be/bitstream/2268/221884/1/report_springer.pdf
- الدخول الالكتروني :
  https://orbi.uliege.be/handle/2268/221884
  https://orbi.uliege.be/bitstream/2268/221884/1/report_springer.pdf
- Rights:
  open access ; http://purl.org/coar/access_right/c_abf2 ; info:eu-repo/semantics/openAccess
- الرقم المعرف:
  edsbas.EE9E5C20

تعليقات

No Comments.