Text classification occupies an important role in natural language processing and has many applications in real life. Short text classification, as one of its subtopics, has attracted increasing interest from researchers since it is more challenging due to its semantic sparsity and insufficient labeled data. Recent studies attempt to combine graph learning and contrastive learning to alleviate the above problems in short text classification. Despite their fruitful success, there are still several inherent limitations. First, the generation of augmented views may disrupt the semantic structure within the text and introduce negative effects due to noise permutation. Second, they ignore the clustering-friendly features in unlabeled data and fail to further utilize the prior information in few valuable labeled data. To this end, we propose a novel model that utilizes improved Graph contrastIve learning for short text classiFicaTion (GIFT). Specifically, we construct a heterogeneous graph c...

Text classification occupies an important role in natural language processing and has many applications in real life. Short text classification, as one of its subtopics, has attracted increasing interest from researchers since it is more challenging due to its semantic sparsity and insufficient labeled data. Recent studies attempt to combine graph learning and contrastive learning to alleviate the above problems in short text classification. Despite their fruitful success, there are still several inherent limitations. First, the generation of augmented views may disrupt the semantic structure within the text and introduce negative effects due to noise permutation. Second, they ignore the clustering-friendly features in unlabeled data and fail to further utilize the prior information in few valuable labeled data. To this end, we propose a novel model that utilizes improved Graph contrastIve learning for short text classiFicaTion (GIFT). Specifically, we construct a heterogeneous graph containing several component graphs by mining from an internal corpus and introducing an external knowledge graph. Then, we use singular value decomposition to generate augmented views for graph contrastive learning. Moreover, we employ constrained kmeans on labeled texts to learn clustering-friendly features, which facilitate cluster-oriented contrastive learning and assist in obtaining better category boundaries. Extensive experimental results show that GIFT significantly outperforms previous state-of-the-art methods. Our code can be found in https://github.com/KEAML-JLU/GIFT.

Improved Graph Contrastive Learning for Short Text Classification / Liu, Yonghao; Huang, Lan; Giunchiglia, Fausto; Feng, Xiaoyue; Guan, Renchu. - 38:17(2024), pp. 18716-18724. ( 38th AAAI Conference on Artificial Intelligence, AAAI 2024 can 2024) [10.1609/aaai.v38i17.29835].

Improved Graph Contrastive Learning for Short Text Classification

Fausto Giunchiglia;
2024-01-01

Abstract

Text classification occupies an important role in natural language processing and has many applications in real life. Short text classification, as one of its subtopics, has attracted increasing interest from researchers since it is more challenging due to its semantic sparsity and insufficient labeled data. Recent studies attempt to combine graph learning and contrastive learning to alleviate the above problems in short text classification. Despite their fruitful success, there are still several inherent limitations. First, the generation of augmented views may disrupt the semantic structure within the text and introduce negative effects due to noise permutation. Second, they ignore the clustering-friendly features in unlabeled data and fail to further utilize the prior information in few valuable labeled data. To this end, we propose a novel model that utilizes improved Graph contrastIve learning for short text classiFicaTion (GIFT). Specifically, we construct a heterogeneous graph c...
2024
Proceedings of the AAAI Conference on Artificial Intelligence,
2275 E BAYSHORE RD, STE 160, PALO ALTO, CA 94303 USA
Association for the Advancement of Artificial Intelligence
Liu, Yonghao; Huang, Lan; Giunchiglia, Fausto; Feng, Xiaoyue; Guan, Renchu
Improved Graph Contrastive Learning for Short Text Classification / Liu, Yonghao; Huang, Lan; Giunchiglia, Fausto; Feng, Xiaoyue; Guan, Renchu. - 38:17(2024), pp. 18716-18724. ( 38th AAAI Conference on Artificial Intelligence, AAAI 2024 can 2024) [10.1609/aaai.v38i17.29835].
File in questo prodotto:
File Dimensione Formato  
2024 02 AAAI renchu.pdf

accesso aperto

Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 442.58 kB
Formato Adobe PDF
442.58 kB Adobe PDF Visualizza/Apri
29835-Article Text-33889-1-2-20240324.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 433.89 kB
Formato Adobe PDF
433.89 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/441009
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 26
  • ???jsp.display-item.citation.isi??? 20
  • OpenAlex 15
social impact