Journal of Data Mining and Digital Humanities | |
An interactive visualization of Google Books Ngrams with R and Shiny: Exploring a(n) historical increase in onset strength in a(n) huge database | |
关键词: r; data visualization; shiny; google books ngrams; google books; corpus linguistics; historical phonology; historical linguistics; n-grams; [shs.langue]humanities and social sciences/linguistics; | |
DOI : | |
来源: DOAJ |
【 摘 要 】
International audience Using the re-emergence of the /h/ onset from Early Modern to Present-Day English as a case study, we illustrate the making and the functions of a purpose-built web application named (an:a) lyzer for the interactive visualization of the raw n-gram data provided by Google Books Ngrams (GBN). The database has been compiled from the full text of over 4.5 million books in English, totalling over 468 billion words and covering roughly five centuries. We focus on bigrams consisting of words beginning with graphic
【 授权许可】
Unknown