期刊论文详细信息
Proceedings of the Romanian Academy Series A-Mathematics Physics Technical Sciences Information Science | |
Computing distributed representations of words using the CoRoLa corpus | |
Vasile PĂIS1  | |
关键词: neural networks; word embeddings; vector representation; CoRoLa.; | |
DOI : | |
学科分类:计算机科学(综合) | |
来源: Editura Academiei Romane | |
【 摘 要 】
We investigate the usability of the CoRoLa corpus for generating high quality vectorrepresentations of words for Romanian language. Different model parameters are tested and modelquality is compared in three test cases: two word analogies data sets and a word similarity correlationwith human judgment. Furthermore, we prove that CoRoLa provides superior word representationscompared to other known Romanian corpora, such as the Wikipedia corpus.
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201910284824014ZK.pdf | 400KB | download |