期刊论文详细信息
Frontiers in Psychology
Long-Range Correlation Underlying Childhood Language and Generative Models
Kumiko Tanaka-Ishii1 
关键词: long-range correlation;    fluctuation analysis;    CHILDES;    generative models;    Simon Model;    Pitman-Yor model;   
DOI  :  10.3389/fpsyg.2018.01725
学科分类:心理学(综合)
来源: Frontiers
PDF
【 摘 要 】

Long-range correlation, a property of time series exhibiting relevant statistical dependence between two distant subsequences, is mainly studied in the statistical physics domain and has been reported to exist in natural language. By using a state-of-the-art method for such analysis, long-range correlation is first shown to occur in long CHILDES data sets. To understand why, generative stochastic models of language, originally proposed in the cognitive scientific domain, are investigated. Among representative models, the Simon model is found to exhibit surprisingly good long-range correlation, but not the Pitman-Yor model. Because the Simon model is known not to correctly reflect the vocabulary growth of natural languages, a simple new model is devised as a conjunct of the Simon and Pitman-Yor models, such that long-range correlation holds with a correct vocabulary growth rate. The investigation overall suggests that uniform sampling is one cause of long-range correlation and could thus have some relation with actual linguistic processes.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201904026994383ZK.pdf 2201KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:5次