Intelligent Reconfigurable Intelligent Surface Control Using Machine Learning for Future Wireless Communications

Hassan, Muhammad Abul

Reconfigurable intelligent surfaces (RIS) have emerged as a key enabler for beyond-5G and 6G wireless networks by enabling programmable propagation environments. However, realizing these gains in practice requires joint optimization of base-station precoding, RIS configuration, power allocation, and channel estimation under hardware and feedback constraints. This dissertation develops learning-based methods to address these challenges in different RIS-assisted communication scenarios. The first part of this dissertation deals with system-level optimization. A three-dimensional RIS-assisted MISO downlink is formulated as a reinforcement learning problem in which a Randomized Ensembled Double Q-learning (REDQ) agent is trained and learns how to adjust the base-station beamforming vectors and RIS phase shifts to maximize the spectral efficiency of multiple users under different RIS elements and wireless signal power levels. Simulation results indicate consistent gains in average spectral efficiency over well-known model-free, off-policy baselines, including Deep Deterministic Policy Gradient (DDPG), Twin Delayed Deep Deterministic Policy Gradient (TD3), and Soft Actor-Critic (SAC). The same idea is then extended to a wideband OFDM RIS-aided NOMA setting with more realistic assumptions: a beyond-diagonal RIS with finite phase resolution, delayed and noisy CSI feedback, and explicit checks on the feasibility of successive interference cancellation. For this scenario, an OFE–REDQ agent is introduced that jointly adapts the RIS configuration and the inter- and intra-cluster power allocation, leading to improved sum rate, faster and more stable learning, and more reliable SIC compared with competing schemes such as REDQ and DDPG. The second part of the dissertation focuses on measurement-based modelling and hardware efficiency. Using real sub-6 GHz RIS measurements collected in anechoic chamber, a one-dimensional convolutional neural network (1D-CNN) is proposed and trained to map beam-steering angles to received power using k-fold cross-validation, to outperform standard feed-forward and recurrent networks such as Long Short Term Memory (LSTM), Gated Recurrent Unit (GRU) and Fully Connected Feedforward Neural Networks (FCN). This is followed by the Row-Vector Transformer (RVT), which treats each sample as a single dense token so that all features are processed jointly; RVT achieves lower prediction errors and more stable training than 1D-CNN, LSTM, GRU, FCN, and Feature Tokenizer (FT)-Transformer across several datasets. To link prediction with actuation, a multi-task architecture based on Symmetric-PCGrad is proposed that simultaneously selects the active group of RIS elements and predicts the corresponding received power, allowing the system to meet quality-of-service targets with a reduced number of active elements. Finally, a sub-terahertz drone scenario is investigated, in which a multi-layer LSTM is trained for channel estimation in a Flying-RIS link supported by a ground base station, and is found to outperform GRU-, CNN-, and fully connected baselines. In summary, the dissertation develops and evaluates a set of learning-based tools for intelligent RIS control, covering system optimization, data-driven power prediction, energy-aware element activation, and Flying-RIS channel estimation. The proposed methods improve spectral efficiency, reduce operational overhead, and increase robustness, and they provide concrete design guidance for future RIS-enabled wireless networks.

Intelligent Reconfigurable Intelligent Surface Control Using Machine Learning for Future Wireless Communications / Hassan, Muhammad Abul. - (2026 Mar 19), pp. 1-139.

Intelligent Reconfigurable Intelligent Surface Control Using Machine Learning for Future Wireless Communications

Hassan, Muhammad Abul

2026-03-19

Abstract

Reconfigurable intelligent surfaces (RIS) have emerged as a key enabler for beyond-5G and 6G wireless networks by enabling programmable propagation environments. However, realizing these gains in practice requires joint optimization of base-station precoding, RIS configuration, power allocation, and channel estimation under hardware and feedback constraints. This dissertation develops learning-based methods to address these challenges in different RIS-assisted communication scenarios. The first part of this dissertation deals with system-level optimization. A three-dimensional RIS-assisted MISO downlink is formulated as a reinforcement learning problem in which a Randomized Ensembled Double Q-learning (REDQ) agent is trained and learns how to adjust the base-station beamforming vectors and RIS phase shifts to maximize the spectral efficiency of multiple users under different RIS elements and wireless signal power levels. Simulation results indicate consistent gains in average spectral efficiency over well-known model-free, off-policy baselines, including Deep Deterministic Policy Gradient (DDPG), Twin Delayed Deep Deterministic Policy Gradient (TD3), and Soft Actor-Critic (SAC). The same idea is then extended to a wideband OFDM RIS-aided NOMA setting with more realistic assumptions: a beyond-diagonal RIS with finite phase resolution, delayed and noisy CSI feedback, and explicit checks on the feasibility of successive interference cancellation. For this scenario, an OFE–REDQ agent is introduced that jointly adapts the RIS configuration and the inter- and intra-cluster power allocation, leading to improved sum rate, faster and more stable learning, and more reliable SIC compared with competing schemes such as REDQ and DDPG. The second part of the dissertation focuses on measurement-based modelling and hardware efficiency. Using real sub-6 GHz RIS measurements collected in anechoic chamber, a one-dimensional convolutional neural network (1D-CNN) is proposed and trained to map beam-steering angles to received power using k-fold cross-validation, to outperform standard feed-forward and recurrent networks such as Long Short Term Memory (LSTM), Gated Recurrent Unit (GRU) and Fully Connected Feedforward Neural Networks (FCN). This is followed by the Row-Vector Transformer (RVT), which treats each sample as a single dense token so that all features are processed jointly; RVT achieves lower prediction errors and more stable training than 1D-CNN, LSTM, GRU, FCN, and Feature Tokenizer (FT)-Transformer across several datasets. To link prediction with actuation, a multi-task architecture based on Symmetric-PCGrad is proposed that simultaneously selects the active group of RIS elements and predicts the corresponding received power, allowing the system to meet quality-of-service targets with a reduced number of active elements. Finally, a sub-terahertz drone scenario is investigated, in which a multi-layer LSTM is trained for channel estimation in a Flying-RIS link supported by a ground base station, and is found to outperform GRU-, CNN-, and fully connected baselines. In summary, the dissertation develops and evaluates a set of learning-based tools for intelligent RIS control, covering system optimization, data-driven power prediction, energy-aware element activation, and Flying-RIS channel estimation. The proposed methods improve spectral efficiency, reduce operational overhead, and increase robustness, and they provide concrete design guidance for future RIS-enabled wireless networks.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di esame finale/Defended on
	
				19-mar-2026
			
	Ciclo
	
				XXXVIII
			
	Anno Accademico
	
				2024-2025
			
	Dipartimento
	
				Ingegneria e scienza dell'Informaz (29/10/12-)
			
	Corso di dottorato
	
				Informatica e telecomunicazioni (fino a.a. 2020-21, 36° ciclo)
			
	Supervisore/Relatore di tesi Unitn (Unitn internal supervisor)
	
				Granelli, Fabrizio
			
	Supervisore/Relatore di tesi esterno (External supervisor)
	
				Bassoli, Riccardo; Astudillo Trujillo, Carlos Alberto;
			
	Supervisore aggiunto/Correlatore esterno (External Co-supervisor)
	
				Ali Hassan Sodhro
			
	Tesi in cotutela (Bi-nationally supervised Doctoral Thesis)
	
				no
			
	Lingua (Language)
	
				Inglese
			
	Appare nelle tipologie:
	
				08.1 Tesi di dottorato (Doctoral Thesis)

File in questo prodotto:

File	Dimensione	Formato
Intelligent_Reconfigurable_Intelligent_Surface_Control_Using_Machine_Learning_for_Future_Wireless_Communications.pdf embargo fino al 15/04/2027 Descrizione: Doctoral Thesis Tipologia: Tesi di dottorato (Doctoral Thesis) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 58.18 MB Formato Adobe PDF Visualizza/Apri	58.18 MB	Adobe PDF	Visualizza/Apri