According to a recent trend of research, there is a growing interest in applications of machine learning techniques to business analytics. In this work, both supervised and unsupervised machine learning techniques are applied to the analysis of a dataset made of both family and non-family firms. This is worth investigating, because the two kinds of firms typically differ in some aspects related to performance, which can be reflected in balance sheet data. First, binary classification techniques are applied to discriminate the two kinds of firms, by combining an unlabeled dataset with the labels provided by a survey. The most important features for performing such binary classification are identified. Then, clustering is applied to highlight why supervised learning can be effective in the previous task, by showing that most of the largest clusters found are quite unequally populated by the two classes.
Machine Learning Application to Family Business Status Classification / Gnecco, G.; Amato, S.; Patuelli, A.; Lattanzi, N.. - 12565:(2020), pp. 25-36. (Intervento presentato al convegno LOD 2022 tenutosi a Certosa di Pontignano nel 19-23 luglio 2020) [10.1007/978-3-030-64583-0_3].
Machine Learning Application to Family Business Status Classification
Amato S.;
2020-01-01
Abstract
According to a recent trend of research, there is a growing interest in applications of machine learning techniques to business analytics. In this work, both supervised and unsupervised machine learning techniques are applied to the analysis of a dataset made of both family and non-family firms. This is worth investigating, because the two kinds of firms typically differ in some aspects related to performance, which can be reflected in balance sheet data. First, binary classification techniques are applied to discriminate the two kinds of firms, by combining an unlabeled dataset with the labels provided by a survey. The most important features for performing such binary classification are identified. Then, clustering is applied to highlight why supervised learning can be effective in the previous task, by showing that most of the largest clusters found are quite unequally populated by the two classes.File | Dimensione | Formato | |
---|---|---|---|
8. Gnecco2020_Chapter.pdf
Solo gestori archivio
Descrizione: GAPL_LOD2020
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
238.42 kB
Formato
Adobe PDF
|
238.42 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione