1887
image of Time-based integration of Mandarin medial /j/

Abstract

This study investigates how listeners integrate dynamic cues to perceive the Mandarin medial glide /j/ over time, and how this process is modulated by speech rate and second-language (L2) background. We tested 30 adults in three groups: Mandarin native speakers, intermediate Japanese learners, and beginning Japanese learners. A female Beijing Mandarin talker produced disyllabic carrier phrases of the form + target syllable, containing the four target syllables and , at three speech rates (fast, mid, slow). For each token, the first syllable was kept intact, while the second syllable was progressively revealed across ten gates spanning the interval from the onset of the second syllable to the early steady-state portion of the vowel. On each trial, participants heard a two-syllable gated sequence and rated how much the second syllable sounded as if it began with an /i/-like quality on a five-point scale. Cumulative-link and linear mixed-effects models revealed robust effects of gate, speech rate, syllable structure, and group. Ratings increased steadily with gate level, were highest in slow speech, and followed a stable gradient of MNS > JP_INT > JP_BEG; CGVX syllables, especially , were easier and faster than the GVX syllable . The results highlight time-based cue integration as a locus of L2 difficulty, consistent with dynamic cue-integration accounts of speech perception and contemporary models of L2 phonology.

Available under the CC BY 4.0 license.
Loading

Article metrics loading...

/content/journals/10.1075/jslp.25074.niu
2026-05-27
2026-06-07
Loading full text...

Full text loading...

/deliver/fulltext/10.1075/jslp.25074.niu/jslp.25074.niu.html?itemId=/content/journals/10.1075/jslp.25074.niu&mimeType=html&fmt=ahah

References

  1. Anwyl-Irvine, A. L., Massonnié, J., Flitton, A., Kirkham, N. Z., & Evershed, J. K.
    (2020) Gorilla in our midst: An online behavioral experiment builder. Behavior Research Methods, (), –. 10.3758/s13428‑019‑01237‑x
    https://doi.org/10.3758/s13428-019-01237-x [Google Scholar]
  2. Baayen, R. H.
    (2008) Analyzing linguistic data: A practical introduction to statistics using R. Cambridge University Press. 10.1017/CBO9780511801686
    https://doi.org/10.1017/CBO9780511801686 [Google Scholar]
  3. Baayen, R. H., Davidson, D. J., & Bates, D. M.
    (2008) Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language, (), –. 10.1016/j.jml.2007.12.005
    https://doi.org/10.1016/j.jml.2007.12.005 [Google Scholar]
  4. Bates, D., Mächler, M., Bolker, B., & Walker, S.
    (2015) Fitting linear mixed-effects models using lme4. Journal of Statistical Software, (), –. 10.18637/jss.v067.i01
    https://doi.org/10.18637/jss.v067.i01 [Google Scholar]
  5. Bard, E. G., Shillcock, R. C., & Altmann, G. T. M.
    (1988) The recognition of words after their acoustic offsets in spontaneous speech: Effects of subsequent context. Perception & Psychophysics, (), –. 10.3758/BF03210424
    https://doi.org/10.3758/BF03210424 [Google Scholar]
  6. Barr, D. J., Levy, R., Scheepers, C., & Tily, H. J.
    (2013) Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, (), –. 10.1016/j.jml.2012.11.001
    https://doi.org/10.1016/j.jml.2012.11.001 [Google Scholar]
  7. Best, C. T., & Tyler, M. D.
    (2007) Nonnative and second-language speech perception: Commonalities and complementarities. InO.-S. Bohn & M. J. Munro (Eds.), Language experience in second language speech learning: In honor of James Emil Flege (pp.–). John Benjamins. 10.1075/lllt.17.07bes
    https://doi.org/10.1075/lllt.17.07bes [Google Scholar]
  8. Bosker, H. R.
    (2017) How our own speech rate influences our perception of speech. Journal of Experimental Psychology: Learning, Memory, and Cognition, (), –. 10.1037/xlm0000364
    https://doi.org/10.1037/xlm0000364 [Google Scholar]
  9. Christensen, R. H. B.
    (2019) ordinal: Regression models for ordinal data (R package version 2019.12–10) [Computer software]. https://CRAN.R-project.org/package=ordinal
    [Google Scholar]
  10. Cutler, A., & Otake, T.
    (1994) Mora or phoneme? Further evidence for language-specific listening. Journal of Memory and Language, (), –. 10.1006/jmla.1994.1039
    https://doi.org/10.1006/jmla.1994.1039 [Google Scholar]
  11. Dupoux, E., Pallier, C., Kakehi, K., & Mehler, J.
    (1999) Epenthetic vowels in Japanese: A perceptual illusion?Journal of Experimental Psychology: Human Perception and Performance, (), –. 10.1037/0096‑1523.25.6.1568
    https://doi.org/10.1037/0096-1523.25.6.1568 [Google Scholar]
  12. Flege, J. E.
    (1995) Second language speech learning: Theory, findings, and problems. InW. Strange (Ed.), Speech perception and linguistic experience: Issues in cross-language research (pp.–). York Press.
    [Google Scholar]
  13. Grosjean, F.
    (1980) Spoken word recognition processes and the gating paradigm. Perception & Psychophysics, (), –. 10.3758/BF03204386
    https://doi.org/10.3758/BF03204386 [Google Scholar]
  14. Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B.
    (2017) lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, (), –. 10.18637/jss.v082.i13
    https://doi.org/10.18637/jss.v082.i13 [Google Scholar]
  15. Lee-Kim, S.-I.
    (2014) Revisiting Mandarin “apical vowels”: An articulatory and acoustic study. Journal of the International Phonetic Association, (), –. 10.1017/S0025100314000267
    https://doi.org/10.1017/S0025100314000267 [Google Scholar]
  16. Lin, Y. J.
    (2019) Where do the glides glide into?Perceiving Mandarin syllable structure (Doctoral dissertation). Indiana University.
    [Google Scholar]
  17. McMurray, B., & Jongman, A.
    (2011) What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations. Psychological Review, (), –. 10.1037/a0022325
    https://doi.org/10.1037/a0022325 [Google Scholar]
  18. Miller, J. L.
    (1981) Some effects of speaking rate on phonetic perception. Phonetica, (), –. 10.1159/000260021
    https://doi.org/10.1159/000260021 [Google Scholar]
  19. Miller, J. D.
    (1989) Auditory-perceptual interpretation of the vowel. Journal of the Acoustical Society of America, (), –. 10.1121/1.397862
    https://doi.org/10.1121/1.397862 [Google Scholar]
  20. Otake, T., Hatano, G., Cutler, A., & Mehler, J.
    (1993) Mora or syllable? Speech segmentation in Japanese. Journal of Memory and Language, (), –. 10.1006/jmla.1993.1014
    https://doi.org/10.1006/jmla.1993.1014 [Google Scholar]
  21. Pisoni, D. B.
    (1997) Some thoughts on “normalization” in speech perception. InK. Johnson & J. W. Mullennix (Eds.), Talker variability in speech processing (pp.–). Academic Press.
    [Google Scholar]
  22. R Core Team
    R Core Team (2024) R: A language and environment for statistical computing (Version 4.x) [Computer software]. R Foundation for Statistical Computing. https://www.r-project.org/
    [Google Scholar]
  23. Sanker, C., Babinski, S., Burns, R., Evans, M., Johns, J., Kim, J., Smith, S., Weber, N., & Bowern, C.
    (2021) (Don’t) try this at home! The effects of recording devices and software on phonetic analysis. Language, (), –. 10.1353/lan.2021.0075
    https://doi.org/10.1353/lan.2021.0075 [Google Scholar]
  24. Toscano, J. C., & McMurray, B.
    (2010) Cue integration with categories: Weighting acoustic cues in speech using unsupervised learning and distributional statistics. Cognitive Science, (), –. 10.1111/j.1551‑6709.2009.01077.x
    https://doi.org/10.1111/j.1551-6709.2009.01077.x [Google Scholar]
  25. Wang, M.
    (2020) The acoustic effect of speaking rate, focus and prosodic position on syllables in Chinese. Journal of Chinese Linguistics, (), –. 10.1353/jcl.2020.0004
    https://doi.org/10.1353/jcl.2020.0004 [Google Scholar]
  26. Xu, Y., & Liu, F.
    (2006) Tonal alignment, syllable structure and coarticulation: Toward an integrated model. Italian Journal of Linguistics / Rivista di Linguistica, (), –.
    [Google Scholar]
  27. (2007) Determining the temporal interval of segments with the help of F0 contours. Journal of Phonetics, (), –. 10.1016/j.wocn.2006.06.002
    https://doi.org/10.1016/j.wocn.2006.06.002 [Google Scholar]
  28. Yuan, J.
    (2013) The spectral dynamics of vowels in Mandarin Chinese. InProceedings of Interspeech 2013 (pp.–). International Speech Communication Association. 10.21437/Interspeech.2013‑18
    https://doi.org/10.21437/Interspeech.2013-18 [Google Scholar]
/content/journals/10.1075/jslp.25074.niu
Loading
/content/journals/10.1075/jslp.25074.niu
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error