Modern Chinese characters evolved from 3,000 years ago. Up to now, tens of thousands of glyphs of ancient characters have been discovered, which must be deciphered by experts to interpret unearthed documents. Experts usually need to compare each ancient character to be examined with similar known ones in whole historical periods. However, it is inevitably limited by human memory and experience, which often cost a lot of time but associations are limited to a small scope. To help researchers discover glyph similar characters, this paper introduces ZiNet, the first diachronic knowledge base describing relationships and evolution of Chinese characters and words. In addition, powered by the knowledge of radical systems in ZiNet, this paper introduces glyph similarity measurement between ancient Chinese characters, which could capture similar glyph pairs that are potentially related in origins or semantics. Results show strong positive correlations between scores from the method and from human experts. Finally, qualitative analysis and implicit future applications are presented.

ZiNet: Linking Chinese Characters Spanning Three Thousand Years / Chi, Yang; Giunchiglia, Fausto; Shi, Daqian; Diao, Xiaolei; Li, Chuntao; Xu, Hao. - (2022), pp. 3061-3070. (Intervento presentato al convegno ACL tenutosi a Dublin, Ireland nel 2022) [10.18653/v1/2022.findings-acl.242].

ZiNet: Linking Chinese Characters Spanning Three Thousand Years

Giunchiglia, Fausto;Shi, Daqian;Diao, Xiaolei;Xu, Hao
2022-01-01

Abstract

Modern Chinese characters evolved from 3,000 years ago. Up to now, tens of thousands of glyphs of ancient characters have been discovered, which must be deciphered by experts to interpret unearthed documents. Experts usually need to compare each ancient character to be examined with similar known ones in whole historical periods. However, it is inevitably limited by human memory and experience, which often cost a lot of time but associations are limited to a small scope. To help researchers discover glyph similar characters, this paper introduces ZiNet, the first diachronic knowledge base describing relationships and evolution of Chinese characters and words. In addition, powered by the knowledge of radical systems in ZiNet, this paper introduces glyph similarity measurement between ancient Chinese characters, which could capture similar glyph pairs that are potentially related in origins or semantics. Results show strong positive correlations between scores from the method and from human experts. Finally, qualitative analysis and implicit future applications are presented.
2022
ACL
USA
ACL
978-1-955917-25-4
Chi, Yang; Giunchiglia, Fausto; Shi, Daqian; Diao, Xiaolei; Li, Chuntao; Xu, Hao
ZiNet: Linking Chinese Characters Spanning Three Thousand Years / Chi, Yang; Giunchiglia, Fausto; Shi, Daqian; Diao, Xiaolei; Li, Chuntao; Xu, Hao. - (2022), pp. 3061-3070. (Intervento presentato al convegno ACL tenutosi a Dublin, Ireland nel 2022) [10.18653/v1/2022.findings-acl.242].
File in questo prodotto:
File Dimensione Formato  
2022.findings-acl.242.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 1.21 MB
Formato Adobe PDF
1.21 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/369580
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
social impact