Citation: | ZHANG Chunxiang, TANG Libo, GAO Xueyao. Word Sense Disambiguation Based on Semi-Supervised Convolutional Neural Networks[J]. Journal of Southwest Jiaotong University, 2022, 57(1): 11-17, 27. doi: 10.3969/j.issn.0258-2724.20200105 |
In order to solve the difficulty of acquiring tagged corpus, a Chinese word sense disambiguation method is proposed on the basis of semi-supervised learning convolutional neural networks (CNN). Firstly, the word, part of speech and semantic category are extracted as discriminative features, which are acquired from 2 word units on the both left and right adjacent to ambiguous word. Word vector tool is used to denote discriminative features as vector. Secondly, tagged corpus is preprocessed to obtain initialized clustering centers and thresholds. At the same time, it is used to train convolutional neural networks. The optimized CNN is applied for determining the semantic categories of ambiguous words in the untagged corpus. Corpus with high confidence that meets threshold conditions is selected into the training corpus. The above process is repeated until the training corpus is no longer expanded. In the last, SemEval-2007: Task#5 is used as the tagged corpus, and the unannotated corpus from Harbin Institute of Technology is used as the untagged corpus. Experimental results show that the proposed method improve disambiguation accuracy of CNN by 3.1%.
[1] |
LESK M. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine code from an ice cream[C]//The Figth Annual International Conference on Systems Documentation. Toronto: ACM Press, 1986: 24-26
|
[2] |
杨安,李素建,李芸. 基于领域知识和词向量的词义消歧方法[J]. 北京大学学报(自然科学版),2017,53(2): 204-210.
YANG An, LI Sujian, LI Yun. Word sense disambiguation based on domain knowledge and word vector model[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2017, 53(2): 204-210.
|
[3] |
FRANCO R L, IVAN L A , PINTO D, et al. Context expansion for domain-specific word sense disambiguation[J]. IEEE Latin America Transactions, 2015, 13(3): 784-789. doi: 10.1109/TLA.2015.7069105
|
[4] |
唐共波,于东,荀恩东. 基于知网义原词向量表示的无监督词义消歧方法[J]. 中文信息学报,2015,29(6): 23-29. doi: 10.3969/j.issn.1003-0077.2015.06.004
TANG Gongbo, YU Dong, XUN Endong. An unsupervised word sense disambiguation method based on sememe vector in HowNet[J]. Journal of Chinese Information Processing, 2015, 29(6): 23-29. doi: 10.3969/j.issn.1003-0077.2015.06.004
|
[5] |
ARAB M, JAHROMI M Z, FAKHRAHMAD S M. A graph-based approach to word sense disambiguation. An unsupervised method based on semantic relatedness[C]//2016 24th Iranian Conference on Electrical Engineering (CEE). Shiraz: IEEE, 2016: 250-255.
|
[6] |
孟禹光,周俏丽,张桂平,等. 引入词性标记的基于语境相似度的词义消歧[J]. 中文信息学报,2018,32(8): 9-18. doi: 10.3969/j.issn.1003-0077.2018.08.003
MENG Yuguang, ZHOU Qiaoli, ZHANG Guiping, et al. Word sense disambiguation based on context simila- rity with POS tagging[J]. Journal of Chinese Information Processing, 2018, 32(8): 9-18. doi: 10.3969/j.issn.1003-0077.2018.08.003
|
[7] |
鹿文鹏,黄河燕,吴昊. 基于领域知识的图模型词义消歧方法[J]. 自动化学报,2014,40(12): 2836-2850.
LU Wenpeng, HUANG Heyan, WU Hao. Word sense disambiguation with graph model based on domain knowledge[J]. Acta Automatica Sinica, 2014, 40(12): 2836-2850.
|
[8] |
DUQUE A, STEVENSON M, MARTINEZ-ROMO J, et al. Co-occurrence graphs for word sense disambiguation in the biomedical domain[J]. Artificial Intelligence in Medicine, 2018, 87: 9-19. doi: 10.1016/j.artmed.2018.03.002
|
[9] |
TRIPODI R, PELILLO M. A game-theoretic approach to word sense disambiguation[J]. Computational Linguistics, 2017, 43(1): 31-70. doi: 10.1162/COLI_a_00274
|
[10] |
XU Xueping, YU Jianping, PIAO Xiaoyu. Contribution of governors to word sense disambiguation of English preposition[J]. ICIC Express Letters, 2015, 6(3): 723-730.
|
[11] |
杨陟卓. 基于上下文翻译的有监督词义消歧研究[J]. 计算机科学,2017,44(4): 252-255, 280. doi: 10.11896/j.issn.1002-137X.2017.04.053
YANG Zhizhuo. Supervised WSD method based on context translation[J]. Computer Science, 2017, 44(4): 252-255, 280. doi: 10.11896/j.issn.1002-137X.2017.04.053
|
[12] |
CARDELLINO C, ALONSO ALEMANY L. Exploring the impact of word embeddings for disjoint semisupervised Spanish verb sense disambiguation[J]. Inteligencia Artificial, 2018, 21(61): 67-81. doi: 10.4114/intartif.vol21iss61pp67-81
|
[13] |
HUANG Z H, CHEN Y D, SHI X D. A novel word sense disambiguation algorithm based on semi-supervised statistical learning[J]. International Journal of Applied Mathematics and Statistics, 2013, 43(13): 452-458.
|
[14] |
MAHMOODVAND M, HOURALI M. Semi-supervised approach for Persian word sense disambiguation[C]// 2017 7th International Conference on Computer and Knowledge Engineering (ICCKE). Mashhad: IEEE, 2017: 104-110.
|
[15] |
刘子图,全紫薇,毛如柏,等. NT-EP:一种无拓扑结构的社交消息传播范围预测方法[J]. 计算机研究与发展,2020,57(6): 1312-1322. doi: 10.7544/issn1000-1239.2020.20190584
LIU Zitu, QUAN Ziwei, MAO Rubai, et al. NT-EP:a non-topology method for predicting the scope of social message propogation[J]. Journal of Computer Research and Development, 2020, 57(6): 1312-1322. doi: 10.7544/issn1000-1239.2020.20190584
|
[16] |
刘勇,谢胜男,仲志伟,等. 社会网中基于主题兴趣的影响最大化算法[J]. 计算机研究与发展,2018,55(11): 2406-2418. doi: 10.7544/issn1000-1239.2018.20170672
LIU Yong, XIE Shengnan, ZHONG Zhiwei, et al. Topic-interest based influence maximization algorithm in social networks[J]. Journal of Computer Research and Development, 2018, 55(11): 2406-2418. doi: 10.7544/issn1000-1239.2018.20170672
|
[17] |
薛涛,王雅玲,穆楠. 基于词义消歧的卷积神经网络文本分类模型[J]. 计算机应用研究,2018,35(10): 2898-2903. doi: 10.3969/j.issn.1001-3695.2018.10.004
XUE Tao, WANG Yaling, MU Nan. Convolutional neural network based on word sense disambiguation for text classification[J]. Application Research of Computers, 2018, 35(10): 2898-2903. doi: 10.3969/j.issn.1001-3695.2018.10.004
|
[18] |
PESARANGHADER A, MATWIN S, SOKOLOVA M, et al. DeepBioWSD:effective deep neural word sense disambiguation of biomedical text data[J]. Journal of the American Medical Informatics Association, 2019, 26(5): 438-446. doi: 10.1093/jamia/ocy189
|
[19] |
BORDES A, GLOROT X, WESTON J, et al. A semantic matching energy function for learning with multi-relational data[J]. Machine Learning, 2014, 94(2): 233-259. doi: 10.1007/s10994-013-5363-6
|
[20] |
CHEN S J, HUNG C. Word sense disambiguation based sentiment lexicons for sentiment classification[J]. Knowledge-Based Systems, 2016, 110: 224-232. doi: 10.1016/j.knosys.2016.07.030
|