Breadth-First Search (BFS) performance on shared-memory systems is often limited by irregular memory access and cache inefficiencies. This work presents two optimizations for BFS graph traversal: a bitmap-based algorithm designed for small-diameter graphs and MergedCSR, a graph storage format that improves cache locality for large-scale graphs. Experimental results on real-world datasets show an average 1.3x speedup over a state-of-the-art implementation, with MergedCSR reducing RAM accesses by approximately 15%.

Breadth-First Search (BFS) performance on shared-memory systems is often limited by irregular memory access and cache inefficiencies. This work presents two optimizations for BFS graph traversal: a bitmap-based algorithm designed for small-diameter graphs and MergedCSR, a graph storage format that improves cache locality for large-scale graphs. Experimental results on real-world datasets show an average 1.3× speedup over a state-of-the-art implementation, with MergedCSR reducing RAM accesses by approximately 15%.

Cache-optimized BFS on multi-core CPUs / Domenico Andaloro, Salvatore; Pasquali, Thomas; Vella, Flavio. - (2025), pp. 23-27. ( 1st FastCode Programming Challenge, FCPC 2025 usa 2025) [10.1145/3711708.3723452].

Cache-optimized BFS on multi-core CPUs

Thomas Pasquali
;
Flavio Vella
2025-01-01

Abstract

Breadth-First Search (BFS) performance on shared-memory systems is often limited by irregular memory access and cache inefficiencies. This work presents two optimizations for BFS graph traversal: a bitmap-based algorithm designed for small-diameter graphs and MergedCSR, a graph storage format that improves cache locality for large-scale graphs. Experimental results on real-world datasets show an average 1.3x speedup over a state-of-the-art implementation, with MergedCSR reducing RAM accesses by approximately 15%.
2025
FCPC '25: Proceedings of the 1st FastCode Programming Challenge
1601 Broadway, 10th Floor, NEW YORK, NY, UNITED STATES
Association for Computing Machinery, Inc
9798400714467
Settore INF/01 - Informatica
Settore INFO-01/A - Informatica
Domenico Andaloro, Salvatore; Pasquali, Thomas; Vella, Flavio
Cache-optimized BFS on multi-core CPUs / Domenico Andaloro, Salvatore; Pasquali, Thomas; Vella, Flavio. - (2025), pp. 23-27. ( 1st FastCode Programming Challenge, FCPC 2025 usa 2025) [10.1145/3711708.3723452].
File in questo prodotto:
File Dimensione Formato  
3711708.3723452.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 732.54 kB
Formato Adobe PDF
732.54 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/474134
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex 0
social impact