Model-Based Relational RL When Object Existence is Partially Observable

Ngo Anh Vien, Marc Toussaint

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

We consider learning and planning in relational MDPs when object existence is uncertain and new objects may appear or disappear depending on previous actions or properties of other objects. Optimal policies actively need to discover objects to achieve a goal; planning in such domains in general amounts to a POMDP problem, where the belief is about the existence and properties of potential not-yet-discovered objects. We propose a computationally efficient extension of model-based relational RL methods that approximates these beliefs using discrete uncertainty predicates. In this formulation the belief update is learned using probabilistic rules and planning in the approximated belief space can be achieved using an extension of existing planners. We prove that the learned belief update rules encode an approximation of the exact belief updates of a POMDP formulation and demonstrate experimentally that the proposed approach successfully learns a set of relational rules appropriate to solve such problems.
Original languageEnglish
Title of host publicationThe 31th International Conference on Machine Learning
Subtitle of host publicationICML
Pages559-567
Number of pages9
Volume32
Publication statusPublished - 2014
EventInternational Conference on Machine Learning - Beijing, China
Duration: 21 Jun 201426 Oct 2017

Conference

ConferenceInternational Conference on Machine Learning
CountryChina
CityBeijing
Period21/06/201426/10/2017

Fingerprint Dive into the research topics of 'Model-Based Relational RL When Object Existence is Partially Observable'. Together they form a unique fingerprint.

  • Cite this

    Anh Vien, N., & Toussaint, M. (2014). Model-Based Relational RL When Object Existence is Partially Observable. In The 31th International Conference on Machine Learning: ICML (Vol. 32, pp. 559-567) http://proceedings.mlr.press/v32/ngo14.pdf