期刊论文详细信息
Proceedings of the Romanian Academy Series A-Mathematics Physics Technical Sciences Information Science
Computing distributed representations of words using the CoRoLa corpus
Vasile PĂIS1 
关键词: neural networks;    word embeddings;    vector representation;    CoRoLa.;   
DOI  :  
学科分类:计算机科学(综合)
来源: Editura Academiei Romane
PDF
【 摘 要 】

We investigate the usability of the CoRoLa corpus for generating high quality vectorrepresentations of words for Romanian language. Different model parameters are tested and modelquality is compared in three test cases: two word analogies data sets and a word similarity correlationwith human judgment. Furthermore, we prove that CoRoLa provides superior word representationscompared to other known Romanian corpora, such as the Wikipedia corpus.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201910284824014ZK.pdf 400KB PDF download
  文献评价指标  
  下载次数:3次 浏览次数:3次