From the extraction of continuous features in parallel texts to visual analytics of heterogeneous areal-typological datasets
The aim of this paper is twofold. First, we show that functionally motivated procedural approaches may help to automatically extract typological features from texts. This idea is illustrated with measuring cross-linguistic variation in the domain of morphological typology based on parallel texts. Second, we demonstrate that the methodology developed in the field of visual analytics allows for detecting patterns or regularities in the automatically extracted features. At the heart of our approach lies an extended sunburst visualization, which enables a cross-comparison of a large number of features within the context of language genealogy and areal information. We provide evidence of the usefulness of the present approach with case studies where the visualizations of the extracted features reveal interesting insights.