期刊论文详细信息
Pramana: Journal of physics
A statistical probe into the word frequency and length distributions prevalent in the translations of Bhagavad Gita
NIKHIL KUMAR RAJPUT^11  BHAVYA AHUJA^12 
[1]Department of Computer Science, Ramanujan College, University of Delhi, New Delhi 110 019, India^1
[2]Department of Physics, Veer Chandra Singh Garhwali Uttarakhand University of Horticulture and Forestry, Tehri Garhwal 246 123, India^2
关键词: Shannon entropy;    power law;    word frequency distribution;    vocabulary quotient;    Kullback–Leibler divergence;   
DOI  :  
学科分类:物理(综合)
来源: Indian Academy of Sciences
PDF
【 摘 要 】
A statistical study has been conducted on Bhagavad Gita. Four measures have been derived for the original text in Sanskrit and its translations in Hindi, English and French. First, word frequency distributions for the documents were modelled. Power law was observed with the longest tail in the case of Sanskrit. For other versions, the distributions well replicated the Zipf–Mandelbrot pattern. Second, the Kullback–Leibler (KL) divergence betweenthe documents has been computed with the highest value recorded in all three translations from the Sanskrit text. Next, a Shannon entropy-based measure: vocabulary quotient has been calculated, which estimates the vocabulary richness the texts offer; the highest being in the case of Bhagavad Gita in Sanskrit. Finally, word-length distributions were obtained with the longest word length in Sanskrit. The results attribute to the inflectional nature of Sanskrit.
【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201910250728070ZK.pdf 591KB PDF download
  文献评价指标  
  下载次数:20次 浏览次数:27次