期刊论文详细信息
Information Research: An International Electronic Journal
Stemming and N-gram matching for term conflation in Turkish texts
关键词: free text;    indexing;    retrieval;    information retrieval;    word forms;    spelling errors;    alternative spellings;    multi-word concepts;    transliteration;    affixes;    abbreviations;    conflation algorithm;    Turkish;   
DOI  :  
来源: DOAJ
【 摘 要 】

One of the main problems involved in the use of free text for indexing and retrieval is the variation in word forms that is likely to be encountered. The most common type of variations are spelling errors, alternative spellings, multi-word concepts, transliteration, affixes and abbreviations. One way to alleviate this problem is to use a conflation algorithm, a computational procedure that is designed to bring together words that are semantically related, and to reduce them to a single form for retrieval purposes. In this paper, we discuss the use of conflation techniques for Turkish text databases.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次