期刊论文详细信息
Cognitive Studies | Études cognitives 卷:0
Extraction and Presentation of Bilingual Correspondences from Slovak-Bulgarian Parallel Corpus
Radovan Garabík1  Ludmila Dimitrova2 
[1] Jazykovedný ústav Ľudovíta Štúra, Slovenská akadémia vied [Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences], BratislavaSlovakia;
[2] Институт по математикa и информатика, Българска академия на науките [Institute of Mathematics and Informatics, Bulgarian Academy of Sciences], София [Sofia];
关键词: translation equivalents;    GIZA++;    parallel corpora;    aligned text;    Slovak;    Bulgarian;   
DOI  :  10.11649/cs.2015.022
来源: DOAJ
【 摘 要 】

Extraction and Presentation of Bilingual Correspondences from Slovak-Bulgarian Parallel CorpusIn this paper the results of the automatic extraction and presentation of bilingual correspondences from Slovak-Bulgarian Parallel corpus are described. The equivalent phrases are extracted from sentence and word level automatically aligned corpus, filtered, indexed and presented in a dictionary-like interface. The bilingual dictionary database contains 80 thousand phrase pairs consisting of approximately 350 thousand words (per each language). Counting unique word forms, the size is 31 thousand in the Slovak part of the dictionary, 26 thousand in the Bulgarian part.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次