@article{jbp:/content/journals/10.1075/jul.00002.nor, author = "Norvik, Miina and Jing, Yingqi and Dunn, Michael and Forkel, Robert and Honkola, Terhi and Klumpp, Gerson and Kowalik, Richard and Metslang, Helle and Pajusalu, Karl and Piha, Minerva and Saar, Eva and Saarinen, Sirkka and Vesakoski, Outi", title = "Uralic typology in the light of a new comprehensive dataset", journal= "Journal of Uralic Linguistics", year = "2022", volume = "1", number = "1", pages = "4-42", doi = "https://doi.org/10.1075/jul.00002.nor", url = "https://www.jbe-platform.com/content/journals/10.1075/jul.00002.nor", publisher = "John Benjamins", issn = "2772-3720", type = "Journal Article", keywords = "typology", keywords = "syntax", keywords = "areal linguistics", keywords = "quantitative linguistics", keywords = "Uralic languages", keywords = "morphology", keywords = "phonology", abstract = "Abstract

This paper presents the Uralic Areal Typology Online (UraTyp 1.0), a typological dataset of 35 Uralic languages and a total of 360 features, mainly covering the levels of morphology, syntax, and phonology. The features belong to two different datasets: 195 features’ definitions originate from the Grambank (GB) database, developed for comparison of world language typology, whereas 165 features (UT) have been designed specifically to describe the typological variation within the Uralic language family. We present a series of analyses of the dataset demonstrating its scope and possibilities. The complete data set correctly identifies the main Uralic subgroups in a Principal Components Analysis, whereas GB data alone is insufficiently granular to detect this family-internal structure. Similar analyses limited to various typological subdomains also give variable results. A model-based admixture analysis identifies four distinct areas of historical interaction: Saami, Finnic, the Volga area and Ob-Ugric.", }