A preliminary release of the Italian Parliamentary Corpus

IRIS

Political debates have been used for years in political and social studies on languages and their cultures. In this paper, we release a preliminary version of the Italian Parliamentary Corpus, a dataset containing 1.2 billion words that includes the political debates in the Italian Parliament from 1848 to 2018. The data has been collected applying an Optical Character Recognition (OCR) software to the original documents, available in PDF format on the websites of Camera dei Deputati and Senato della Repubblica

A preliminary release of the Italian Parliamentary Corpus / Frasnelli, V., Palmero Aprosio, A.. - 3596:(2023). (9th Italian Conference on Computational Linguistics, CLiC-it 2023 Venezia, Italia 30th November - 2nd December 2023).

A preliminary release of the Italian Parliamentary Corpus

Frasnelli, Valentino^Co-primo;Palmero Aprosio, Alessio^Co-primo

2023-01-01

Abstract

Political debates have been used for years in political and social studies on languages and their cultures. In this paper, we release a preliminary version of the Italian Parliamentary Corpus, a dataset containing 1.2 billion words that includes the political debates in the Italian Parliament from 1848 to 2018. The data has been collected applying an Optical Character Recognition (OCR) software to the original documents, available in PDF format on the websites of Camera dei Deputati and Senato della Repubblica

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the 9th Italian Conference on Computational Linguistics
			
	Luogo di edizione (Place of publication)
	
				Venezia, Italia
			
	Casa editrice (Publisher)
	
				CEUR-WS
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85181166764
			
	Tutti gli autori
	
						Frasnelli, Valentino; Palmero Aprosio, Alessio
					
	Citazione
	
				A preliminary release of the Italian Parliamentary Corpus / Frasnelli, V., Palmero Aprosio, A.. - 3596:(2023). (9th Italian Conference on Computational Linguistics, CLiC-it 2023 Venezia, Italia 30th November - 2nd December 2023).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
short11.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 1.06 MB Formato Adobe PDF Visualizza/Apri	1.06 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/412712

Citazioni

ND

0

ND

ND

social impact