| Electronics | |
| Word Sense Disambiguation Using Prior Probability Estimation Based on the Korean WordNet | |
| Minho Kim1  Hyuk-Chul Kwon2  | |
| [1] Department of Software, Catholic University of Pusan, Busan 46252, Korea;School of Computer Science and Engineering, Pusan National University, Busan 46241, Korea; | |
| 关键词: word sense disambiguation; Korean WordNet; knowledge-based model; data mining; information extraction; | |
| DOI : 10.3390/electronics10232938 | |
| 来源: DOAJ | |
【 摘 要 】
Supervised disambiguation using a large amount of corpus data delivers better performance than other word sense disambiguation methods. However, it is not easy to construct large-scale, sense-tagged corpora since this requires high cost and time. On the other hand, implementing unsupervised disambiguation is relatively easy, although most of the efforts have not been satisfactory. A primary reason for the performance degradation of unsupervised disambiguation is that the semantic occurrence probability of ambiguous words is not available. Hence, a data deficiency problem occurs while determining the dependency between words. This paper proposes an unsupervised disambiguation method using a prior probability estimation based on the Korean WordNet. This performs better than supervised disambiguation. In the Korean WordNet, all the words have similar semantic characteristics to their related words. Thus, it is assumed that the dependency between words is the same as the dependency between their related words. This resolves the data deficiency problem by determining the dependency between words by calculating the
【 授权许可】
Unknown