%0 Journal Article %A Diemer, Stefan %A Brunner, Marie-Louise %A Schmidt, Selina %T Compiling computer-mediated spoken language corpora: Key issues and recommendations %D 2016 %J International Journal of Corpus Linguistics %V 21 %N 3 %P 348-371 %@ 1384-6655 %R https://doi.org/10.1075/ijcl.21.3.03die %K spoken language corpora %K best practice %K data compilation and transcription %K Computer-mediated communication (CMC) %K %I John Benjamins %X This paper discusses key issues in the compilation of spoken language corpora in a computer-mediated communication (CMC) environment, using data from the Corpus of Academic Spoken English (CASE), a corpus of Skype conversations currently being compiled at Saarland University, Germany, in cooperation with European and US partners. Based on first findings, Skype is presented as a suitable tool for collecting informal spoken data. In addition, new recommendations concerning data compilation and transcription are put forward to supplement existing best practice as presented in Wynne (2005). We recommend the preservation of multimodal features during anonymisation, and the addition of annotation elements already at the transcription stage, particularly CMC-related discourse features, English as a Lingua Franca (ELF) features (e.g. non-standard language and code-switching), as well as the inclusion of prosodic, paralinguistic, and non-verbal annotation. Additionally, we propose a layered corpus design in order to allow researchers to focus on specific annotation features. %U https://www.jbe-platform.com/content/journals/10.1075/ijcl.21.3.03die