Generalized Policy Learning with Planning

Yang, Ryan P.

dc.contributor.advisor	Kaelbling, Leslie P.
dc.contributor.author	Yang, Ryan P.
dc.date.accessioned	2025-10-06T17:37:02Z
dc.date.available	2025-10-06T17:37:02Z
dc.date.issued	2025-05
dc.date.submitted	2025-06-23T14:04:36.023Z
dc.identifier.uri	https://hdl.handle.net/1721.1/162962
dc.description.abstract	Generalized policy learning seeks to find policies that solve multiple tasks within a planning domain. We introduce methods to search for policies independently in a domain from empty initialized policies. As an extension, we also propose a problem setting to learn satisficing policies between domains. In an independent domain, we propose a score function to guide the policy search. Our approach, Policy-Guided Planning for Generalized Policy Generation (PG3), evaluates policies based on how well it can be used to plan. Empirically, we show that PG3 allows generalized policy learning to occur more efficiently than other baselines with PDDL-based problems and policies represented as lifted decision lists. Finally, our experiments show that policies independently learned are qualitiatively similar, prompting further investigation on the possibilities of further accelerating the policy search process.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright retained by author(s)
dc.rights.uri	https://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Generalized Policy Learning with Planning
dc.type	Thesis
dc.description.degree	M.Eng.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
thesis.degree.name	Master of Engineering in Electrical Engineering and Computer Science

Files in this item

Name:: yang-ryanyang-meng-eecs-2025-t ...
Size:: 4.342Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record