Interactive Groupwise Comparison for Reinforcement Learning from Human Feedback

IRIS

Reinforcement learning from human feedback (RLHF) has emerged as a key enabling technology for aligning AI behaviour with human preferences. The traditional way to collect data in RLHF is via pairwise comparisons: human raters are asked to indicate which one of two samples they prefer. We present an interactive visualisation that better exploits the human visual ability to compare and explore whole groups of samples. The interface is comprised of two linked views: 1) an exploration view showing a contextual overview of all sampled behaviours organised in a hierarchical clustering structure; and 2) a comparison view displaying two selected groups of behaviours for user queries. Users can efficiently explore large sets of behaviours by iterating between these two views. Additionally, we devised an active learning approach suggesting groups for comparison. As shown by our evaluation in six simulated robotics tasks, our approach increases the final rewards by 69.34%. It leads to lower error rates and better policies. We open-source the code that can be easily integrated into the RLHF training loop, supporting research on human–AI alignment.

Interactive Groupwise Comparison for Reinforcement Learning from Human Feedback / Kompatscher, Jan; Shi, Danqing; Varni, Giovanna; Weinkauf, Tino; Oulasvirta, Antti. - In: COMPUTER GRAPHICS FORUM. - ISSN 0167-7055. - 2025:(2025). [10.1111/cgf.70290]

Interactive Groupwise Comparison for Reinforcement Learning from Human Feedback

Kompatscher, Jan;Shi, Danqing;Varni, Giovanna;Weinkauf, Tino;Oulasvirta, Antti

2025-01-01

Abstract

Reinforcement learning from human feedback (RLHF) has emerged as a key enabling technology for aligning AI behaviour with human preferences. The traditional way to collect data in RLHF is via pairwise comparisons: human raters are asked to indicate which one of two samples they prefer. We present an interactive visualisation that better exploits the human visual ability to compare and explore whole groups of samples. The interface is comprised of two linked views: 1) an exploration view showing a contextual overview of all sampled behaviours organised in a hierarchical clustering structure; and 2) a comparison view displaying two selected groups of behaviours for user queries. Users can efficiently explore large sets of behaviours by iterating between these two views. Additionally, we devised an active learning approach suggesting groups for comparison. As shown by our evaluation in six simulated robotics tasks, our approach increases the final rewards by 69.34%. It leads to lower error rates and better policies. We open-source the code that can be easily integrated into the RLHF training loop, supporting research on human–AI alignment.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2025
			
	Titolo del periodico (Journal title)
	
				COMPUTER GRAPHICS FORUM
			
	DOI
	
				https://dx.doi.org/10.1111/cgf.70290
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-105021975665
			
	Codice WOS (WOS identifier)
	
				WOS:001615460600001
			
	Tutti gli autori
	
						Kompatscher, Jan; Shi, Danqing; Varni, Giovanna; Weinkauf, Tino; Oulasvirta, Antti
					
	Citazione
	
				Interactive Groupwise Comparison for Reinforcement Learning from Human Feedback / Kompatscher, Jan; Shi, Danqing; Varni, Giovanna; Weinkauf, Tino; Oulasvirta, Antti. - In: COMPUTER GRAPHICS FORUM. - ISSN 0167-7055. - 2025:(2025). [10.1111/cgf.70290]

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/472691

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

1

0

0

social impact