We introduce GMOT-Mamba, a novel Mamba-based model prediction framework for Generic Multiple Object Tracking (GMOT) in video sequences. Our approach features a Weighted Feature Pooling (WFP) layer, which processes encoded target states, and an innovative encoder-decoder architecture that leverages Vision-Mamba (ViM) to predict filter weights. We train our model on combinations of large-scale datasets to capture strong priors and discriminative features necessary for generic object tracking. Through extensive experiments and ablation studies, we demonstrate the effectiveness of our approach, showcasing its competitive performance against state-of-the-art GMOT methods while outperforming SOT methods in both accuracy and inference speed. Our findings underscore the potential of Mamba for enhancing model prediction in visual tracking applications.

GMOT-Mamba: Mamba-Based Model Prediction For Generic Multiple Object Tracking / Verma, Shashikant; Sebe, Nicu; Raman, Shanmuganathan. - (2025), pp. 1253-1258. ( ICIP Anchorage, AK, USA 14-17 September 2025) [10.1109/icip55913.2025.11084714].

GMOT-Mamba: Mamba-Based Model Prediction For Generic Multiple Object Tracking

Sebe, Nicu;
2025-01-01

Abstract

We introduce GMOT-Mamba, a novel Mamba-based model prediction framework for Generic Multiple Object Tracking (GMOT) in video sequences. Our approach features a Weighted Feature Pooling (WFP) layer, which processes encoded target states, and an innovative encoder-decoder architecture that leverages Vision-Mamba (ViM) to predict filter weights. We train our model on combinations of large-scale datasets to capture strong priors and discriminative features necessary for generic object tracking. Through extensive experiments and ablation studies, we demonstrate the effectiveness of our approach, showcasing its competitive performance against state-of-the-art GMOT methods while outperforming SOT methods in both accuracy and inference speed. Our findings underscore the potential of Mamba for enhancing model prediction in visual tracking applications.
2025
2025 IEEE International Conference on Image Processing (ICIP)
New York
IEEE
979-8-3315-2379-4
Verma, Shashikant; Sebe, Nicu; Raman, Shanmuganathan
GMOT-Mamba: Mamba-Based Model Prediction For Generic Multiple Object Tracking / Verma, Shashikant; Sebe, Nicu; Raman, Shanmuganathan. - (2025), pp. 1253-1258. ( ICIP Anchorage, AK, USA 14-17 September 2025) [10.1109/icip55913.2025.11084714].
File in questo prodotto:
File Dimensione Formato  
GMOT-Mamba_Mamba-Based_Model_Prediction_For_Generic_Multiple_Object_Tracking.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 3.47 MB
Formato Adobe PDF
3.47 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/467774
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex 1
social impact