Zeitschrift für Sprachwissenschaft
Quantifying graphemic variation via large text corpora
Lüschow Hanna1 
[1] Institut für Germanistik, Carl von Ossietzky Universität Oldenburg, Ammerländer Heerstr. 114–118, 26111Oldenburg, Germany;
关键词: historical graphematics;    spelling variation;    german text archive;    methodology;    entropy;   
DOI  :  10.1515/zfs-2021-2038
来源: DOAJ
【 摘 要 】

The use of some basic computer science concepts could expand the possibilities of (manual) graphematic text corpus analysis. With these it can be shown that graphematic variation decreases constantly in printed German texts from 1600 to 1900. While the variability is continuously lesser on a text-internal level, it decreases faster for the whole available writing system of individual decades. But which changes took place exactly? Which types of variation went away more quickly, which ones persisted? How do we deal with large amounts of data which cannot be processed manually anymore? Which aspects are of special importance or go missing while working with a large textual base?

【 授权许可】


  下载次数:0次 浏览次数:7次