基于语义先验知识与类型嵌入的复杂实体识别

Jiang Xiao-Bo; He Kun<sup>*</sup>; Yan Guang-Yu

doi:10.13328/j.cnki.jos.006750

摘要

Entity recognition is a key task of information extraction. With the development of information extraction technology, researchers turn the research direction from the recognition of simple entities to the recognition of complex ones. Complex entities usually have no explicit features, and they are more complicated in syntactic constructions and parts of speech, which makes the recognition of complex entities a great challenge. In addition, existing models widely use span-based methods to identify nested entities. As a result, they always have an ambiguity in the detection of entity boundaries, which affects recognition performance. In response to the above challenge and problem, this study proposes an entity recognition model GIA-2DPE based on prior semantic knowledge and type embedding. The model uses keyword sequences of entity categories as prior semantic knowledge to improve the cognition of entities, utilizes type embedding to capture potential features of different entity types, and then combines prior knowledge with entity-type features through the gated interactive attention mechanism to assist in the recognition of complex entities. Moreover, the model uses 2D probability encoding to predict entity boundaries and combines boundary features and contextual features to enhance accurate boundary detection, thereby improving the performance of nested entity recognition. This study conducts extensive experiments on seven English datasets and two Chinese datasets. The results show that GIA-2DPE outperforms state-of-the-art models and achieves a 10.4% F1 boost compared with the baseline in entity recognition tasks on the ScienceIE dataset. ? 2023 Chinese Academy of Sciences.

单位
华南理工大学

全文

访问全文

收藏分享被引浏览

更新时间：2024-11-28 22:02

基于语义先验知识与类型嵌入的复杂实体识别

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友