期刊论文

【摘要】

One of the most effective factors on the natural language researches is the data set which plays a significant role in designing, improving and evaluation the information retrieval systems and other applications for natural language processing. Unfortunately, building a proper data set consume time, labor and effort, in particular the query extraction from the data set documents. In this study, a novel algorithm for query extraction from any collection of documents was suggested, the algorithm elaborate the similarity thesaurus for query extraction, which leads to the ability of using the algorithm on any language, to evaluate the suggested algorithm a data set that consist of 242 Arabic documents and 60 queries was used, 48 queries was extracted 20 of them appeared in manual data set and all of them was relevant with more than one document in the used collection.

【授权许可】

Unknown

【预览】

附件列表
Files	Size	Format	View
RO201911300879432ZK.pdf	108KB	PDF	download

American Journal of Applied Sciences
Novel Automatic Query Building Algorithm Using Similarity Thesaurus \| Science Publications

Ashraf Odeh¹ Aymen Abu-Errub¹ Hayel Khafajeh¹ Nidal Yousef¹
关键词: Information retrieval; natural language processing; Arabic corpora; similarity thesaurus;
DOI : 10.3844/ajassp.2012.1373.1377
学科分类：自然科学（综合）
来源: Science Publications
PDF


	文献评价指标
	下载次数：0次	浏览次数：5次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】