1887
Volume 15, Issue 4
  • ISSN 1879-9264
  • E-ISSN: 1879-9272

Abstract

Abstract

This research addresses two major challenges in studying second language acquisition and bilingualism: reducing overlap in predictor variables and correctly classifying participants into language proficiency levels. Too many relevant predictors can harm statistical analysis due to an increased chance of overlap, known as multicollinearity. To tackle this, we use Principal Component Analysis (PCA) on selected predictors to identify proficiency indicators, combining the length of stay in the UK and language test scores. Additionally, traditional methods, especially IELTS-based proficiency classifications, often miss subtle differences in language skills, particularly when they fail to consider how long participants have been exposed to the target language. We counter this by using non-hierarchical Cluster Analysis (NCA) for a grounded, data-driven way of detecting distinct language proficiency groups. This new approach is demonstrated on a dataset of eye movements from reading tasks, collected from Chinese–English bilinguals in the UK.

Available under the CC BY 4.0 license.
Loading

Article metrics loading...

/content/journals/10.1075/lab.24019.he
2025-04-28
2026-01-23
Loading full text...

Full text loading...

/deliver/fulltext/lab.24019.he.html?itemId=/content/journals/10.1075/lab.24019.he&mimeType=html&fmt=ahah

References

  1. Aizawa, I., Rose, H., Thompson, G., & Curle, S.
    (2020) Beyond the threshold: Exploring English language proficiency, linguistic challenges, and academic language skills of Japanese students in an English medium instruction programme. Language Teaching Research, 27(4), 837–861. 10.1177/1362168820965510
    https://doi.org/10.1177/1362168820965510 [Google Scholar]
  2. Ashby, J., Rayner, K., & Clifton, C.
    (2005) Eye movements of highly skilled and average readers: Differential effects of frequency and predictability. The Quarterly Journal of Experimental Psychology Section A, 58(61), 1065–1086. 10.1080/02724980443000476
    https://doi.org/10.1080/02724980443000476 [Google Scholar]
  3. Baayen, R. H., & Milin, P.
    (2010) Analyzing reaction times. International Journal of Psychological Research, 3(2), 12–28. 10.21500/20112084.807
    https://doi.org/10.21500/20112084.807 [Google Scholar]
  4. Baayen, R. H., Milin, P., & Ramscar, M.
    (2016) Frequency in lexical processing. Aphasiology, 30(11), 1174–1220. 10.1080/02687038.2016.1147767
    https://doi.org/10.1080/02687038.2016.1147767 [Google Scholar]
  5. Barton, J. J., Hanif, H. M., Eklinder Björnström, L., & Hills, C.
    (2014) The word-length effect in reading: A review. Cognitive Neuropsychology, 31(5–6), 378–412. 10.1080/02643294.2014.895314
    https://doi.org/10.1080/02643294.2014.895314 [Google Scholar]
  6. Bialystok, E.
    (2011) Coordination of executive functions in monolingual and bilingual children. Journal of Experimental Child Psychology, 110(3), 461–468. 10.1016/j.jecp.2011.05.005
    https://doi.org/10.1016/j.jecp.2011.05.005 [Google Scholar]
  7. BNC Consortium
    BNC Consortium (2007) The British National Corpus, XML Edition [Data set]. Oxford Text Archive. hdl.handle.net/20.500.14106/2554 (Oxford Text Archive)
    [Google Scholar]
  8. Brown, J. I., Fishco, V. V., & Hanna, G. S.
    (1993) Nelson-Denny reading test: Technical report, formsG & H. Riverside Publishing Company.
    [Google Scholar]
  9. Cattell, R. B.
    (1966) The scree test for the number of factors. Multivariate Behavioral Research, 1(2), 245–276. 10.1207/s15327906mbr0102_10
    https://doi.org/10.1207/s15327906mbr0102_10 [Google Scholar]
  10. Chaouch-Orozco, A., Alonso, J. G., & Rothman, J.
    (2021) Individual differences in bilingual word recognition: The role of experiential factors and word frequency in cross-language lexical priming. Applied Psycholinguistics, 42(2), 447–474. 10.1017/S014271642000082X
    https://doi.org/10.1017/S014271642000082X [Google Scholar]
  11. Cop, U., Drieghe, D., & Duyck, W.
    (2015) Eye movement patterns in natural reading: A comparison of monolingual and bilingual reading of a novel. PLOS One, 10(8), Articlee0134008. 10.1371/journal.pone.0134008
    https://doi.org/10.1371/journal.pone.0134008 [Google Scholar]
  12. Coretta, S., & Casillas, J. V.
    (2024) A tutorial on generalised additive mixed effects models for bilingualism research. Linguistic Approaches to Bilingualism, this issue. Advance online publication. 10.1075/lab.23076.cor
    https://doi.org/10.1075/lab.23076.cor [Google Scholar]
  13. Divjak, D., & Milin, P.
    (2020) Exploring and exploiting uncertainty: Statistical learning ability affects how we learn to process language along multiple dimensions of experience. Cognitive Science, 44(5), Articlee12835. 10.1111/cogs.12835
    https://doi.org/10.1111/cogs.12835 [Google Scholar]
  14. Dussias, P. E.
    (2010) Uses of eye-tracking data in second language sentence processing research. Annual Review of Applied Linguistics, 301, 149–166. 10.1017/S026719051000005X
    https://doi.org/10.1017/S026719051000005X [Google Scholar]
  15. Dussias, P. E., & Sagarra, N.
    (2007) The effect of exposure on syntactic parsing in Spanish–English bilinguals. Bilingualism: Language and Cognition, 10(1), 101–116. 10.1017/S1366728906002847
    https://doi.org/10.1017/S1366728906002847 [Google Scholar]
  16. Findelsberger, E., Hutzler, F., & Hawelka, S.
    (2019) Spill the load: Mixed evidence for a foveal load effect, reliable evidence for a spillover effect in eye-movement control during reading. Attention, Perception, & Psychophysics, 811, 1442–1453. 10.3758/s13414‑019‑01689‑5
    https://doi.org/10.3758/s13414-019-01689-5 [Google Scholar]
  17. Wickham, H., François, R., Henry, L., Müller, K., & Vaughan, D.
    (2023) dplyr: A grammar of data manipulation (R package version 1.1.4) [Computer software]. https://CRAN.R-project.org/package=dplyr9
    [Google Scholar]
  18. He, S.
    (2024) Learning to optimise reading: an exploratory study of eye movement transfer across languages [Doctoral dissertation, University of Birmingham]. University of Birmingham eTheses Repository. etheses.bham.ac.uk/id/eprint/14849
  19. He, S., Divjak, D., & Milin, P.
    (2022, June6–8). Examining the role of L2 proficiency in eye-movement patterns [Conference session]. University of Edinburgh Linguistics and English Language Postgraduate Conference, Edinburgh, United Kingdom.
    [Google Scholar]
  20. Heister, J., Würzner, K. M., & Kliegl, R.
    (2012) Analysing large datasets of eye movements during reading. Visual Word Recognition, 21, 102–130. Psychology Press.
    [Google Scholar]
  21. Hersch, J., & Andrews, S.
    (2012) Lexical quality and reading skill: Bottom-up and top-down contributions to sentence processing. Scientific Studies of Reading, 161, 240–262. 10.1080/10888438.2011.564244
    https://doi.org/10.1080/10888438.2011.564244 [Google Scholar]
  22. Hofweber, J., Marinis, T., & Treffers-Daller, J.
    (2016) Effects of dense code-switching on executive control. Linguistic Approaches to Bilingualism, 6(5), 648–668. 10.1075/lab.15052.hof
    https://doi.org/10.1075/lab.15052.hof [Google Scholar]
  23. Hulstijn, J. H.
    (2012) The construct of language proficiency in the study of bilingualism from a cognitive perspective. Bilingualism: Language and Cognition, 15(2), 422–433. 10.1017/S1366728911000678
    https://doi.org/10.1017/S1366728911000678 [Google Scholar]
  24. Jia, G., Aaronson, D., & Wu, Y.
    (2002) Long-term language attainment of bilingual immigrants: Predictive variables and language group differences. Applied Psycholinguistics, 23(4), 599–621. 10.1017/S0142716402004058
    https://doi.org/10.1017/S0142716402004058 [Google Scholar]
  25. Just, M. A., & Carpenter, P. A.
    (1980) A theory of reading: From eye fixations to comprehension. Psychological Review, 87(4), 329–353. 10.1037/0033‑295X.87.4.329
    https://doi.org/10.1037/0033-295X.87.4.329 [Google Scholar]
  26. Kaiser, H. F.
    (1960) The application of electronic computers to factor analysis. Educational and Psychological Measurement, 20(1), 141–151. 10.1177/001316446002000116
    https://doi.org/10.1177/001316446002000116 [Google Scholar]
  27. Keating, G. D.
    (2024) Normalization of timed measures in bilingualism research: Make it optimal with the Box-Cox transformation. Linguistic Approaches to Bilingualism. Advance online publication. 10.1075/lab.24017.kea
    https://doi.org/10.1075/lab.24017.kea [Google Scholar]
  28. Kliegl, R., Nuthmann, A., & Engbert, R.
    (2006) Tracking the mind during reading: the influence of past, present, and future words on fixation durations. Journal of Experimental Psychology: General, 135(1), 12–35. 10.1037/0096‑3445.135.1.12
    https://doi.org/10.1037/0096-3445.135.1.12 [Google Scholar]
  29. Kroll, J. F., & Bialystok, E.
    (2013) Understanding the consequences of bilingualism for language processing and cognition. Journal of Cognitive Psychology, 25(5), 497–514. 10.1080/20445911.2013.799170
    https://doi.org/10.1080/20445911.2013.799170 [Google Scholar]
  30. Kroll, J. F., Dussias, P. E., Bogulski, C. A., & Kroff, J. R. V.
    (2012) Juggling two languages in one mind: What bilinguals tell us about language processing and its consequences for cognition. InPsychology of Learning and Motivation (Vol.561, pp.229–262). Academic press. 10.1016/B978‑0‑12‑394393‑4.00007‑8
    https://doi.org/10.1016/B978-0-12-394393-4.00007-8 [Google Scholar]
  31. Lantolf, J. P., & Poehner, M. E.
    (2011) Dynamic assessment in the classroom: Vygotskian praxis for second language development. Language teaching research, 15(1), 11–33. 10.1177/1362168810383328
    https://doi.org/10.1177/1362168810383328 [Google Scholar]
  32. Li, P., Sepanski, S., & Zhao, X.
    (2006) Language history questionnaire: A web-based interface for bilingual research. Behavior Research Methods, 38(2), 202–210. 10.3758/BF03192770
    https://doi.org/10.3758/BF03192770 [Google Scholar]
  33. Linck, J. A., Kroll, J. F., & Sunderman, G.
    (2009) Losing access to the native language while immersed in a second language: Evidence for the role of inhibition in second-language learning. Psychological Science, 20(12), 1507–1515. 10.1111/j.1467‑9280.2009.02480.x
    https://doi.org/10.1111/j.1467-9280.2009.02480.x [Google Scholar]
  34. Marian, V., & Fausey, C. M.
    (2006) Language‐dependent memory in bilingual learning. Applied Cognitive Psychology: The Official Journal of the Society for Applied Research in Memory and Cognition, 20(8), 1025–1047. 10.1002/acp.1242
    https://doi.org/10.1002/acp.1242 [Google Scholar]
  35. Mishra, R. K.
    (2018) Bilingualism and cognitive control (Vol.61).Springer International. 10.1007/978‑3‑319‑92513‑4
    https://doi.org/10.1007/978-3-319-92513-4 [Google Scholar]
  36. Nilsson, M.
    (2012) Computational Models of Eye Movements in Reading: A Data-driven Approach to the Eye-mind Link [Doctoral dissertation, Acta Universitatis Upsaliensis]. Retrieved fromhttps://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-167403
    [Google Scholar]
  37. Park, H. I., & Ziegler, N.
    (2014) Cognitive shift in the bilingual mind: Spatial concepts in Korean–English bilinguals. Bilingualism: Language and Cognition, 17(2), 410–430. 10.1017/S1366728913000400
    https://doi.org/10.1017/S1366728913000400 [Google Scholar]
  38. Poehner, M.
    (2008) Dynamic assessment: A Vygotskian approach to understanding and promoting L2 development (Vol.91). Springer Science & Business Media. 10.1007/978‑0‑387‑75775‑9
    https://doi.org/10.1007/978-0-387-75775-9 [Google Scholar]
  39. R Core Team
    R Core Team (2023) R: A language and environment for statistical computing (Version 4.3.2) [Computer software]. https://www.R-project.org/
    [Google Scholar]
  40. Ramanujan, K., & Weekes, B. S.
    (2020) Predictors of lexical retrieval in Hindi–English bilingual speakers. Bilingualism: Language and Cognition, 23(2), 265–273. 10.1017/S1366728918001177
    https://doi.org/10.1017/S1366728918001177 [Google Scholar]
  41. Rayner, K.
    (1998) Eye movements in reading and information processing: 20 years of research. Psychological Bulletin, 124(3), 372–422. 10.1037/0033‑2909.124.3.372
    https://doi.org/10.1037/0033-2909.124.3.372 [Google Scholar]
  42. Rayner, K., & Liversedge, S. P.
    (2011) Linguistic and cognitive influences on eye movements during reading. InS. P. Liversedge, I. D. Gilchrist, & S. Everling (Eds.), The Oxford Handbook of Eye Movements (pp.751–766). Oxford University Press.
    [Google Scholar]
  43. Schotter, E. R., Angele, B., & Rayner, K.
    (2012) Parafoveal processing in reading. Attention, Perception, & Psychophysics, 74(1), 5–35. 10.3758/s13414‑011‑0219‑2
    https://doi.org/10.3758/s13414-011-0219-2 [Google Scholar]
  44. Schotter, E. R., Reichle, E. D., & Rayner, K.
    (2014) Rethinking parafoveal processing in reading: Serial-attention models can explain semantic preview benefit and N+ 2 preview effects. Visual Cognition, 22(3–4), 309–333. 10.1080/13506285.2013.873508
    https://doi.org/10.1080/13506285.2013.873508 [Google Scholar]
  45. Schroeder, S. R., Lam, T. Q., & Marian, V.
    (2017) Linguistic predictors of cultural identification in bilinguals. Applied Linguistics, 38(4), 463–488. 10.1093/applin/amv049
    https://doi.org/10.1093/applin/amv049 [Google Scholar]
  46. Shabani, K.
    (2023) Diagnostic and developmental potentials of computerized dynamic assessment (C-DA) for L2 vocabulary. Interdisciplinary Studies in English Language Teaching, 1(2), 130–149. 10.22080/iselt.2022.23570.1035
    https://doi.org/10.22080/iselt.2022.23570.1035 [Google Scholar]
  47. Tempelaar, D., Rienties, B., & Nguyen, Q.
    (2020) Subjective data, objective data and the role of bias in predictive modelling: Lessons from a dispositional learning analytics application. PLOS One, 15(6), Articlee0233977. 10.1371/journal.pone.0233977
    https://doi.org/10.1371/journal.pone.0233977 [Google Scholar]
  48. Tomoschuk, B., Ferreira, V. S., & Gollan, T. H.
    (2019) When a seven is not a seven: Self-ratings of bilingual language proficiency differ between and within language populations. Bilingualism: Language and Cognition, 22(3), 516–536. 10.1017/S1366728918000421
    https://doi.org/10.1017/S1366728918000421 [Google Scholar]
  49. Whitford, V., & Joanisse, M. F.
    (2021) Eye movement measures of within-language and cross-language activation during reading in monolingual and bilingual children and adults: A focus on neighborhood density effects. Frontiers in Psychology, 121, Article674007. 10.3389/fpsyg.2021.674007
    https://doi.org/10.3389/fpsyg.2021.674007 [Google Scholar]
  50. Whitford, V., & Titone, D.
    (2012) Second-language experience modulates first-and second-language word frequency effects: Evidence from eye movement measures of natural paragraph reading. Psychonomic Bulletin & Review, 19(1), 73–80. 10.3758/s13423‑011‑0179‑5
    https://doi.org/10.3758/s13423-011-0179-5 [Google Scholar]
  51. Wurm, L. H., & Fisicaro, S. A.
    (2014) What residualizing predictors in regression analyses does (and what it does not do). Journal of Memory and Language, 721, 37–48. 10.1016/j.jml.2013.12.003
    https://doi.org/10.1016/j.jml.2013.12.003 [Google Scholar]
  52. Yan, M., Kliegl, R., Shu, H., Pan, J., & Zhou, X.
    (2010) Parafoveal load of word N+ 1 modulates preprocessing effectiveness of word N+ 2 in Chinese reading. Journal of Experimental Psychology: Human Perception and Performance, 36(6), 1669. 10.1037/a0019329
    https://doi.org/10.1037/a0019329 [Google Scholar]
  53. Yen, M. H., Radach, R., Tzeng, O. J. L., Hung, D. L., & Tsai, J. L.
    (2009) Early parafoveal processing in reading Chinese sentences. Acta Psychologica, 131(1), 24–3. 10.1016/j.actpsy.2009.02.005
    https://doi.org/10.1016/j.actpsy.2009.02.005 [Google Scholar]
/content/journals/10.1075/lab.24019.he
Loading
/content/journals/10.1075/lab.24019.he
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error