Phonoprosodic corpus of spoken Catalan (PhonCAT)
This article describes the corpus of spoken Catalan elaborated within the research project “Phonoprosodic development of Catalan in its current bilingual context”. The corpus contains 174 interviews with speakers from three districts of Barcelona varying on the presence of Spanish. The subjects belong to three age groups: children aged 3 to 5, young people aged 19 to 23 and adults aged 32 to 40. The collected data consist of semi-spontaneous speech, free conversations, a role-play, a reading task and a sociolinguistic questionnaire. The goals of the project include auditory and acoustic analyses of Catalan segments (exemplified here by some results on vowels), the study of loan words and of cognates with different gender across Catalan and Spanish, as well as prosodic analyses of intonational phrasing of declaratives and interrogatives.