New regulations on transparency and the recent policy for privacy force the public administration (PA) to make their documents available, but also to limit the diffusion of personal data. The present work displays a first approach to the extraction of sensitive data from PA documents in terms of named entities and semantic relations among them, speeding up the process of extraction of these personal data in order to easily select those which need to be hidden. We also present the process of collection and annotation of the dataset.

REDIT: A Tool and Dataset for Extraction of Personal Data in Documents of the Public Administration Domain / Paccosi, Teresa; Palmero, Alessio. - 3033:(2022). (Intervento presentato al convegno CLiC-it - 2021 Italian Conference on Computational Linguistics 2021 tenutosi a Milano, Italia nel 29th June - 1st July 2021).

REDIT: A Tool and Dataset for Extraction of Personal Data in Documents of the Public Administration Domain

Paccosi, Teresa;Palmero, Alessio
2022-01-01

Abstract

New regulations on transparency and the recent policy for privacy force the public administration (PA) to make their documents available, but also to limit the diffusion of personal data. The present work displays a first approach to the extraction of sensitive data from PA documents in terms of named entities and semantic relations among them, speeding up the process of extraction of these personal data in order to easily select those which need to be hidden. We also present the process of collection and annotation of the dataset.
2022
Proceedings of the Eighth Italian Conference on Computational Linguistics
Milano, Italia
CEUR
Paccosi, Teresa; Palmero, Alessio
REDIT: A Tool and Dataset for Extraction of Personal Data in Documents of the Public Administration Domain / Paccosi, Teresa; Palmero, Alessio. - 3033:(2022). (Intervento presentato al convegno CLiC-it - 2021 Italian Conference on Computational Linguistics 2021 tenutosi a Milano, Italia nel 29th June - 1st July 2021).
File in questo prodotto:
File Dimensione Formato  
paper58.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 426.58 kB
Formato Adobe PDF
426.58 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/412716
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact