Most recent approaches to 3D pose estimation from RGB-D images address the problem in a two-stage pipeline. First, they learn a classifier-typically a random forest-to predict the position of each input pixel on the object surface. These estimates are then used to define an energy function that is minimized w.r.t. the object pose. In this paper, we focus on the first stage of the problem and propose a novel classifier based on a depth-aware Convolutional Neural Network. This classifier is able to learn a scale-adaptive regression model that yields very accurate pixel-level predictions, allowing to finally estimate the pose using a simple RANSAC-based scheme, with no need to optimize complex ad hoc energy functions. Our experiments on publicly available datasets show that our approach achieves remarkable improvements over state-of-the-art methods.

Depth-aware convolutional neural networks for accurate 3D pose estimation in RGB-D images / Porzi, Lorenzo; Penate-Sanchez, Adrian; Ricci, Elisa; Moreno-Noguer, Francesc. - 2017-:(2017), pp. 5777-5783. ( 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2017 Vancouver, Canada December 2017) [10.1109/IROS.2017.8206469].

Depth-aware convolutional neural networks for accurate 3D pose estimation in RGB-D images

Ricci, Elisa;
2017-01-01

Abstract

Most recent approaches to 3D pose estimation from RGB-D images address the problem in a two-stage pipeline. First, they learn a classifier-typically a random forest-to predict the position of each input pixel on the object surface. These estimates are then used to define an energy function that is minimized w.r.t. the object pose. In this paper, we focus on the first stage of the problem and propose a novel classifier based on a depth-aware Convolutional Neural Network. This classifier is able to learn a scale-adaptive regression model that yields very accurate pixel-level predictions, allowing to finally estimate the pose using a simple RANSAC-based scheme, with no need to optimize complex ad hoc energy functions. Our experiments on publicly available datasets show that our approach achieves remarkable improvements over state-of-the-art methods.
2017
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems
Porzi, Lorenzo
Piscataway, New Jersey, US
IEEE
978-1-5386-2682-5
Porzi, Lorenzo; Penate-Sanchez, Adrian; Ricci, Elisa; Moreno-Noguer, Francesc
Depth-aware convolutional neural networks for accurate 3D pose estimation in RGB-D images / Porzi, Lorenzo; Penate-Sanchez, Adrian; Ricci, Elisa; Moreno-Noguer, Francesc. - 2017-:(2017), pp. 5777-5783. ( 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2017 Vancouver, Canada December 2017) [10.1109/IROS.2017.8206469].
File in questo prodotto:
File Dimensione Formato  
08206469.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 3.13 MB
Formato Adobe PDF
3.13 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/194349
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 8
  • OpenAlex ND
social impact