Learning general policies for planning through GPT models

Rossetti, Nicholas

Transformer-based architectures, such as BERT, GPT and T5, have achieved remarkable results across various Natural Language Processing (NLP) tasks. Beyond these linguistic capabilities, these Large Language Models (LLMs) exhibit varying degrees of factual knowledge, common sense reasoning, and even programming capabilities. However, their effectiveness in performing logical inference and automated planning remains an open question. Recent attempts to apply LLMs to Classical Planning have produced mixed results. In this thesis, we tackle this challenge by introducing PlanGPT, a GPT-based model trained from scratch on solved planning instances to learn a general policy for Classical Planning. By leveraging domain-specific training data and incorporating automated planning knowledge, PlanGPT can generate solution plans for unseen problems within the same domain, demonstrating good coverage and performance relative to other deep learning approaches. However, there are no formal guarantees of validity and PlanGPT can produce invalid plans that fail to meet all goals or contain actions with unsatisfied preconditions. To mitigate these problems, we propose two approaches. First, we incorporate a validator directly into the generation process, which allows us to prune invalid partial plans on the fly and generate valid solutions. Second, we combine PlanGPT with a plan-repair planner, LPG, which refines invalid or incomplete candidate plans into fully valid solutions. Our empirical evaluations across diverse Classical Planning domains confirm the efficacy of these strategies. Ultimately, this work demonstrates the potential of integrating learned policies with model-based reasoning.

Learning general policies for planning through GPT models

ROSSETTI, NICHOLAS

2025

Abstract

Transformer-based architectures, such as BERT, GPT and T5, have achieved remarkable results across various Natural Language Processing (NLP) tasks. Beyond these linguistic capabilities, these Large Language Models (LLMs) exhibit varying degrees of factual knowledge, common sense reasoning, and even programming capabilities. However, their effectiveness in performing logical inference and automated planning remains an open question. Recent attempts to apply LLMs to Classical Planning have produced mixed results. In this thesis, we tackle this challenge by introducing PlanGPT, a GPT-based model trained from scratch on solved planning instances to learn a general policy for Classical Planning. By leveraging domain-specific training data and incorporating automated planning knowledge, PlanGPT can generate solution plans for unseen problems within the same domain, demonstrating good coverage and performance relative to other deep learning approaches. However, there are no formal guarantees of validity and PlanGPT can produce invalid plans that fail to meet all goals or contain actions with unsatisfied preconditions. To mitigate these problems, we propose two approaches. First, we incorporate a validator directly into the generation process, which allows us to prune invalid partial plans on the fly and generate valid solutions. Second, we combine PlanGPT with a plan-repair planner, LPG, which refines invalid or incomplete candidate plans into fully valid solutions. Our empirical evaluations across diverse Classical Planning domains confirm the efficacy of these strategies. Ultimately, this work demonstrates the potential of integrating learned policies with model-based reasoning.

Scheda breve

Scheda completa

Scheda completa (DC)

	Facoltà/Dipartimento
	
				Universita` degli Studi di ROMA "La Sapienza"
			
	Corso di studio
	
				Altro corso di dottorato
			
	Data di pubblicazione
	
				30-mag-2025
			
	Lingua
	
				Inglese
			
	Relatore, Supervisor, Advisor o Tutor
	
				GEREVINI, ALFONSO EMILIO
			
	Correlatore, Controrelatore, Co-Supervisor,  Co-Tutor o Coordinatori
	
				LENZERINI, Maurizio
			
	Nome Editore
	
				Università degli Studi di Roma "La Sapienza"
			
	Numero di pagine
	
				121
			
	Collezione di appartenenza
	
				Università degli Studi di Roma La Sapienza

File in questo prodotto:

File	Dimensione	Formato
Tesi_dottorato_Rossetti.pdf accesso aperto Licenza: Tutti i diritti riservati Dimensione 2.86 MB Formato Adobe PDF Visualizza/Apri	2.86 MB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/212174

Il codice NBN di questa tesi è URN:NBN:IT:UNIROMA1-212174