An investigation on the use of Large Language Models for hyperparameter tuning in Evolutionary Algorithms

Custode, Leonardo Lucio; Caraffini, Fabio; Yaman, Anil; Iacca, Giovanni

doi:10.1145/3638530.3664163

Hyperparameter optimization is a crucial problem in Evolutionary Computation. In fact, the values of the hyperparameters directly impact the trajectory taken by the optimization process, and their choice requires extensive reasoning by human operators. Although a variety of self-adaptive Evolutionary Algorithms have been proposed in the literature, no definitive solution has been found. In this work, we perform a preliminary investigation to automate the reasoning process that leads to the choice of hyperparameter values. We employ two open-source Large Language Models (LLMs), namely Llama2-70b and Mixtral, to analyze the optimization logs online and provide novel real-time hyperparameter recommendations. We study our approach in the context of step-size adaptation for (1 + 1)-ES. The results suggest that LLMs can be an effective method for optimizing hyperparameters in Evolution Strategies, encouraging further research in this direction.

An investigation on the use of Large Language Models for hyperparameter tuning in Evolutionary Algorithms / Custode, L.L., Caraffini, F., Yaman, A., Iacca, G.. - (2024), pp. 1838-1845. (2024 Genetic and Evolutionary Computation Conference Companion, GECCO 2024 Companion Melbourne 14th July- 18th July 2024) [10.1145/3638530.3664163].

An investigation on the use of Large Language Models for hyperparameter tuning in Evolutionary Algorithms

Leonardo Lucio Custode;Fabio Caraffini;Anil Yaman;Giovanni Iacca

2024-01-01

Abstract

Hyperparameter optimization is a crucial problem in Evolutionary Computation. In fact, the values of the hyperparameters directly impact the trajectory taken by the optimization process, and their choice requires extensive reasoning by human operators. Although a variety of self-adaptive Evolutionary Algorithms have been proposed in the literature, no definitive solution has been found. In this work, we perform a preliminary investigation to automate the reasoning process that leads to the choice of hyperparameter values. We employ two open-source Large Language Models (LLMs), namely Llama2-70b and Mixtral, to analyze the optimization logs online and provide novel real-time hyperparameter recommendations. We study our approach in the context of step-size adaptation for (1 + 1)-ES. The results suggest that LLMs can be an effective method for optimizing hyperparameters in Evolution Strategies, encouraging further research in this direction.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2024
			
	Titolo del volume (Proceedings title)
	
				GECCO '24 Companion: Proceedings of the Genetic and Evolutionary Computation Conference Companion
			
	Luogo di edizione (Place of publication)
	
				New York
			
	Casa editrice (Publisher)
	
				Association for Computing Machinery, Inc
			
	ISBN
	
				9798400704956
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85201935509
			
	Codice WOS (WOS identifier)
	
				WOS:001535131300281
			
	Tutti gli autori
	
						Custode, Leonardo Lucio; Caraffini, Fabio; Yaman, Anil; Iacca, Giovanni
					
	Citazione
	
				An investigation on the use of Large Language Models for hyperparameter tuning in Evolutionary Algorithms / Custode, L.L., Caraffini, F., Yaman, A., Iacca, G.. - (2024), pp. 1838-1845. (2024 Genetic and Evolutionary Computation Conference Companion, GECCO 2024 Companion Melbourne 14th July- 18th July 2024) [10.1145/3638530.3664163].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
3638530.3664163.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 695.49 kB Formato Adobe PDF Visualizza/Apri	695.49 kB	Adobe PDF	Visualizza/Apri