Guest Editorial: Introduction to the Special Section on Communication-Efficient Distributed Machine Learning

IRIS

The papers in this special section focus on communication-efficient distributed machine learning. Machine learning, especially deep learning, has been successfully applied in a wealth of practical AI applications in the field of computer vision, natural language processing, healthcare, finance, robotics, etc. With the increasing size of machine learning models and training data sets, training deep learning models requires significant amount of computations and may take days to months on a single GPU or TPU. Therefore, it becomes a common practice to exploit distributed machine learning to accelerate the training process with multiple processors. Distributed machine learning typically requires the processors to exchange information repeatedly throughout the training process. With the fast-growing computing power of the AI processors, the data communications among processors gradually become the performance bottleneck and excessively limit the system scalability due to Amdahl's law. The ...

Guest Editorial: Introduction to the Special Section on Communication-Efficient Distributed Machine Learning / Chu, X.; Giunchiglia, F.; Neglia, G.; Gregg, D.; Liu, J.. - In: IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING. - ISSN 2327-4697. - 9:4(2022), pp. 1949-1950. [10.1109/TNSE.2022.3181503]

Guest Editorial: Introduction to the Special Section on Communication-Efficient Distributed Machine Learning

Chu X.;Giunchiglia F.;Neglia G.;Gregg D.;Liu J.

2022-01-01

Abstract

The papers in this special section focus on communication-efficient distributed machine learning. Machine learning, especially deep learning, has been successfully applied in a wealth of practical AI applications in the field of computer vision, natural language processing, healthcare, finance, robotics, etc. With the increasing size of machine learning models and training data sets, training deep learning models requires significant amount of computations and may take days to months on a single GPU or TPU. Therefore, it becomes a common practice to exploit distributed machine learning to accelerate the training process with multiple processors. Distributed machine learning typically requires the processors to exchange information repeatedly throughout the training process. With the fast-growing computing power of the AI processors, the data communications among processors gradually become the performance bottleneck and excessively limit the system scalability due to Amdahl's law. The ...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2022
			
	Titolo del periodico (Journal title)
	
				IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING
			
	Numero e parte del fascicolo (Issue number and part)
	
				4
			
	DOI
	
				https://dx.doi.org/10.1109/TNSE.2022.3181503
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85133774259
			
	Codice WOS (WOS identifier)
	
				WOS:000818899600002
			
	Tutti gli autori
	
						Chu, X.; Giunchiglia, F.; Neglia, G.; Gregg, D.; Liu, J.
					
	Citazione
	
				Guest Editorial: Introduction to the Special Section on Communication-Efficient Distributed Machine Learning / Chu, X.; Giunchiglia, F.; Neglia, G.; Gregg, D.; Liu, J.. - In: IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING. - ISSN 2327-4697. - 9:4(2022), pp. 1949-1950. [10.1109/TNSE.2022.3181503]

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/464155

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

1

1

ND

social impact