1887
image of Normalization of timed measures in bilingualism research
USD
Buy:$35.00 + Taxes

Abstract

Abstract

The time it takes an individual to respond to a probe (e.g., a word, picture, or question) or to read a word or phrase provides useful insights into cognitive processes. Consequently, timed measures are a staple in bilingualism research. However, timed measures usually violate assumptions of linear models, one being normal distribution of the residuals. Power transformations are a common solution but which of the many possible transformations to apply is often guesswork. Box and Cox (1964) developed a procedure to estimate the best-fitting normalizing transformation, coefficient lambda (λ), that is easy to run using standard R packages. This practical primer demonstrates how to perform the Box-Cox transformation in R using as a testbed the distractor items from a recent eye-tracking study on sentence reading in speakers of Spanish as a majority and a heritage language. The analyses show (a) that the exponents selected via the Box-Cox procedure reduce positive skewness as well as or better than the natural log; (b) that the best-fitting value of λ varies based on factors such as group and, in the case of eye-movement data, the measure of interest; and (c) that the choice of transformation sometimes impacts values for model estimates.

Loading

Article metrics loading...

/content/journals/10.1075/lab.24017.kea
2024-09-25
2024-10-13
Loading full text...

Full text loading...

References

  1. Baayen, R. H.
    (2008) Analyzing linguistic data: A practical introduction to statistics. CUP. 10.1017/CBO9780511801686
    https://doi.org/10.1017/CBO9780511801686 [Google Scholar]
  2. Barr, D. J., Levy, R., Scheepers, C., & Tily, H. J.
    (2013) Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, (), –. 10.1016/j.jml.2012.11.001
    https://doi.org/10.1016/j.jml.2012.11.001 [Google Scholar]
  3. Bates, D., Mächler, M., Bolker, B., & Walker, S.
    (2015) Fitting linear mixed-effects models using lme4. Journal of Statistical Software, (), –. 10.18637/jss.v067.i01
    https://doi.org/10.18637/jss.v067.i01 [Google Scholar]
  4. Box, G. E. P., & Cox, D. R.
    (1964) An analysis of transformations. Journal of the Royal Statistical Society. Series B (Methodological), , –. 10.1111/j.2517‑6161.1964.tb00553.x
    https://doi.org/10.1111/j.2517-6161.1964.tb00553.x [Google Scholar]
  5. Burchill, Z. J., & Jaeger, T. F.
    (2024) How reliable are standard reading time analyses? Hierarchical bootstrap reveals substantial power over-optimism and scale-dependent Type I error inflation. Journal of Memory and Language, , Article 104494. 10.1016/j.jml.2023.104494
    https://doi.org/10.1016/j.jml.2023.104494 [Google Scholar]
  6. Cuetos, F., Glez-Nosti, M., Barbón, A., & Brysbaert, M.
    (2011) SUBTLEX-ESP: Spanish word frequencies based on film subtitles. Psicológica, (), –. hdl.handle.net/10651/6265
    [Google Scholar]
  7. Drummer, J.-D., & Felser, C.
    (2018) Cataphoric pronoun resolution in native and non-native sentence comprehension. Journal of Memory and Language, , –. 10.1016/j.jml.2018.04.001
    https://doi.org/10.1016/j.jml.2018.04.001 [Google Scholar]
  8. Fox, J., & Weisberg, S.
    (2019) An R companion to applied regression (3rd ed.). Sage.
    [Google Scholar]
  9. Keating, G. D.
    (2022) The effect of age of onset of bilingualism on gender agreement processing in Spanish as a heritage language. Language Learning, (), –. 10.1111/lang.12510
    https://doi.org/10.1111/lang.12510 [Google Scholar]
  10. (2024) Morphological markedness and the temporal dynamics of gender agreement processing in Spanish as a majority and a heritage language. Language Learning. Advance online publication. 10.1111/lang.12662
    https://doi.org/10.1111/lang.12662 [Google Scholar]
  11. Nicklin, C., & Plonsky, L.
    (2020) Outliers in L2 research in applied linguistics: A synthesis and data re-analysis. Annual Review of Applied Linguistics, , –. 10.1017/S0267190520000057
    https://doi.org/10.1017/S0267190520000057 [Google Scholar]
  12. Osborne, J.
    (2010) Improving your data transformations: Applying the Box-Cox transformation. Practical Assessment, Research, and Evaluation, , Article 12. https://scholarworks.umass.edu/pare/vol15/iss1/12
    [Google Scholar]
  13. R Core Team
    R Core Team (2023) R: A language and environment for statistical computing (Version 4.3.1) [Computer software]. R Foundation for Statistical Computing. Retrieved fromhttps://www.R-project.org/
    [Google Scholar]
  14. Ratcliff, R.
    (1993) Methods for dealing with reaction time outliers. Psychological Bulletin, (), –. 10.1037/0033‑2909.114.3.510
    https://doi.org/10.1037/0033-2909.114.3.510 [Google Scholar]
  15. Rayner, K.
    (1998) Eye movements in reading and information processing: 20 years of research. Psychological Bulletin, (), –. 10.1037/0033‑2909.124.3.372
    https://doi.org/10.1037/0033-2909.124.3.372 [Google Scholar]
  16. Revelle, W.
    (2024) Psych: Procedures for psychological, psychometric, and personality research. Northwestern University, Evanston, Illinois.
    [Google Scholar]
  17. SR Research
    SR Research (2005) EyeLink 1000 [Apparatus and software]. https://www.sr-research.com/eyelink-1000-plus/
    [Google Scholar]
  18. Venables, W. N., & Ripley, B. D.
    (2002) Modern applied statistics with S (4th ed.). Springer. 10.1007/978‑0‑387‑21706‑2
    https://doi.org/10.1007/978-0-387-21706-2 [Google Scholar]
/content/journals/10.1075/lab.24017.kea
Loading
/content/journals/10.1075/lab.24017.kea
Loading

Data & Media loading...

  • Article Type: Research Article
Keywords: skewness ; outliers ; response times ; Box-Cox transformation ; reading times
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error