Volume 68, Issue 2
  • ISSN 0521-9744
  • E-ISSN: 1569-9668
Buy:$35.00 + Taxes



The development of technology, in particular, innovations in natural language processing and means to explore big data, has influenced different aspects in the training of translators and interpreters. This paper investigates how learner corpora and their research contribute to the teaching and learning of translation and interpreting. It starts with a review of the evolvement of learner corpora in translator and interpreter training. Drawing on data from the Chinese/English Translation and Interpreting Learner Corpus (CETILC), a learner corpus developed for the study of lexical cohesion, the paper introduces three case studies to illustrate the possibilities of exploring learner data through human annotation, machine-facilitated human annotation, and finally human-supervised/edited machine annotation. The findings of the case studies suggest the complexity of learner language and its intricate relationships with various factors concerning the learner, text, and task. The paper ends with a discussion of the great potentials of purposely made learner corpora such as the CETILC in translator and interpreter training, as well as the application of learner corpora in (semi-) automatic processing of learner texts.


Article metrics loading...

Loading full text...

Full text loading...


  1. Alfuraih, Reem F.
    2020 “The Undergraduate Learner Translator Corpus: A New Resource for Translation Studies and Computational Linguistics.” Lang Resources and Evaluation54 (3): 801–830. 10.1007/s10579‑019‑09472‑6
    https://doi.org/10.1007/s10579-019-09472-6 [Google Scholar]
  2. Angelelli, Claudia V., and Holly E. Jacobson
    eds. 2009Testing and Assessment in Translation and Interpreting Studies: A Call for Dialogue between Research and Practice. Amsterdam: John Benjamins Publishing Company. 10.1075/ata.xiv
    https://doi.org/10.1075/ata.xiv [Google Scholar]
  3. Bowker, Lynne, and Peter Bennison
    2003 “Student Translation Archive: Design, Development and Application.” InCorpora in Translator Education, edited byFederico Zanettin, Silvia Bernardini, and Dominic Stewart, 103–117. London: Routledge.
    [Google Scholar]
  4. Brezina, Vaclav, and Lynne Flowerdew
    2017Learner Corpus Research: New Perspectives and Applications. London: Bloomsbury Publishing.
    [Google Scholar]
  5. Centre for English Corpus Linguistics
    Centre for English Corpus Linguistics 2019 “Learner Corpora around the World.” Louvain-la-Neuve: Université catholique de Louvain. https://uclouvain.be/en/research-institutes/ilc/cecl/learner-corpora-around-the-world.html
    [Google Scholar]
  6. Danglli, Leonard, and Griselda Abazaj
    2014 “Lexical Cohesion, Word Choice and Synonymy in Academic Writing.” Mediterranean Journal of Social Sciences14 (5): 628–632. 10.5901/mjss.2014.v5n14p628
    https://doi.org/10.5901/mjss.2014.v5n14p628 [Google Scholar]
  7. Díaz-Negrillo, Ana, Nicolas Ballier, and Paul Thompson
    eds. 2013Automatic Treatment and Analysis of Learner Corpus Data. Amsterdam: John Benjamins Publishing Company. 10.1075/scl.59
    https://doi.org/10.1075/scl.59 [Google Scholar]
  8. Díaz-Negrillo, Ana, and Paul Thompson
    2013 “Learner Corpora: Looking towards the Future.” InAutomatic Treatment and Analysis of Learner Corpus Data, edited byAna Díaz-Negrillo, Nicolas Ballier, and Paul Thompson, 9–30. Amsterdam: John Benjamins Publishing Company. 10.1075/scl.59.03dia
    https://doi.org/10.1075/scl.59.03dia [Google Scholar]
  9. Fellbaum, Christiane
    ed. 1998WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press. 10.7551/mitpress/7287.001.0001
    https://doi.org/10.7551/mitpress/7287.001.0001 [Google Scholar]
  10. Ficchi, Velia
    1999 “Learning Consecutive Interpretation: An Empirical Study and an Autonomous Approach.” Interpreting4 (2): 199–218. 10.1075/intp.4.2.04fic
    https://doi.org/10.1075/intp.4.2.04fic [Google Scholar]
  11. Gamon, Michael, Martin Chodorow, Claudia Leacock, and Joel Tetreault
    2013 “Using Learner Corpora for Automatic Error Detection and Correction.” InAutomatic Treatment and Analysis of Learner Corpus Data, edited byAna Díaz-Negrillo, Nicolas Ballier, and Paul Thompson, 127–150. Amsterdam: John Benjamins Publishing Company. 10.1075/scl.59.09gam
    https://doi.org/10.1075/scl.59.09gam [Google Scholar]
  12. Gile, Daniel
    2009Basic Concepts and Models for Interpreter and Translator Training, 2nd ed. Amsterdam: John Benjamins Publishing Company. 10.1075/btl.8
    https://doi.org/10.1075/btl.8 [Google Scholar]
  13. Granger, Sylviane
    ed. 1998aLearner English on Computer. London: Addison Wesley Longman.
    [Google Scholar]
  14. 1998b “The Computer Learner Corpus: A Versatile New Source of Data for SLA Research.” InLearner English on Computer, edited bySylviane Granger, 3–18. London: Addison Wesley Longman.
    [Google Scholar]
  15. Granger, Sylviane, Gaëtanelle Gilquin, and Fanny Meunier
    eds. 2015The Cambridge Handbook of Learner Corpus Research. Cambridge: Cambridge University Press. 10.1017/CBO9781139649414
    https://doi.org/10.1017/CBO9781139649414 [Google Scholar]
  16. Granger, Sylviane, Joseph Hung, and Stephanie Petch-Tyson
    2002a “Preface.” InComputer Learner Corpora, Second Language Acquisition, and Foreign Language Teaching, vol.6, edited bySylviane Granger, Joseph Hung and Stephanie Petch-Tyson, vii–viii. Amsterdam: John Benjamins Publishing Company.10.1075/lllt.6.01pre
    https://doi.org/10.1075/lllt.6.01pre [Google Scholar]
  17. eds. 2002bComputer Learner Corpora, Second Language Acquisition, and Foreign Language Teaching, vol.6. Amsterdam: John Benjamins Publishing Company. 10.1075/lllt.6
    https://doi.org/10.1075/lllt.6 [Google Scholar]
  18. Granger, Sylviane, and Marie-Aude Lefer
    2018 “The Translation-Oriented Annotation System: A Tripartite Annotation System for Translation Research.” InInternational Symposium on Parallel Corpora (ECETT – PaCor), Faculty of Philology, 5–7 November 2018, Compultense University of Madrid, 61–63. https://eventos.ucm.es/_files/_event/_18651/_editorFiles/file/Book%20of%20abstracts_ECETT-PaCor2018b.pdf
    [Google Scholar]
  19. 2020 “The Multilingual Student Translation Corpus: A Resource for Translation Teaching and Research.” Language Resources and Evaluation54 (4): 1183–1199. 10.1007/s10579‑020‑09485‑6
    https://doi.org/10.1007/s10579-020-09485-6 [Google Scholar]
  20. Halliday, Michael A. K., and Ruqaiya Hasan
    1976Cohesion in English. London: Longman Group Limited.
    [Google Scholar]
  21. Hatim, Basil
    1998 “Text Linguistics and Translation.” InEncyclopedia of Translation Studies, edited byMona Baker, 262–265. London: Routledge.
    [Google Scholar]
  22. Hoey, Michael
    1991Patterns of Lexis in Text. Oxford: Oxford University Press.
    [Google Scholar]
  23. James, Carl
    1998Errors in Language Learning and Use: Exploring Error Analysis. London: Longman.
    [Google Scholar]
  24. Kalina, Sylvia
    2000 “Interpreting Competences as a Basis and a Goal for Teaching.” Interpreter’s Newsletter10: 3–32.
    [Google Scholar]
  25. Klaudy, Kinga, and Kristina Károly
    2002 “Lexical Repetition in Professional and Trainees’ Translation.” InTeaching Translation and Interpreting 4: Building Bridges, edited byEva Hung, 99–114. Amsterdam: John Benjamins Publishing Company. 10.1075/btl.42.13kla
    https://doi.org/10.1075/btl.42.13kla [Google Scholar]
  26. Li, Jen-Yu, and Thomas Gaillat
    2020 “Automatic Detection of Unexpected/Erroneous Collocations in Learner Corpus.” InProceedings of Joint Workshop on Multiword Expressions and Electronic Lexicons, 101–106. Barcelona: Association for Computational Linguistics. https://www.aclweb.org/anthology/2020.mwe-1.13/
    [Google Scholar]
  27. Lyashevkaya, Olga, and Irina Panteleeva
    2017 “Automatic Dependency Parsing of a Learner English Corpus Realec.” Higher School of Economics Research Paper No. WP BRP 62/LNG/2017. 10.2139/ssrn.3089660
    https://doi.org/10.2139/ssrn.3089660 [Google Scholar]
  28. Malmkjær, Kirsten
    ed. 2004Translation in Undergraduate Degree Programmes. Amsterdam: John Benjamins Publishing Company. 10.1075/btl.59
    https://doi.org/10.1075/btl.59 [Google Scholar]
  29. 2009 “What is Translation Competence.” Revue Française de Linguistique Appliquée [French journal of applied linguistics] 1: 121–134. 10.3917/rfla.141.0121
    https://doi.org/10.3917/rfla.141.0121 [Google Scholar]
  30. Martínez Martínez, Jose Manuel, Ekaterina Lapshinova-Koltunski, and Kerstin Kunz
    2016 “Annotation of Lexical Cohesion in English and German: Automatic and Manual Procedures.” InProceedings of the 13th Conference on Natural Language Processing, KONVENS 2016, Bochum, Germany, 19–21 September 2016, 165–176. https://www.linguistics.rub.de/konvens16/pub/21_konvensproc.pdf
    [Google Scholar]
  31. McCulley, George A.
    1985 “Writing Quality, Coherence, and Cohesion.” Research in the Teaching of English19 (3): 269–282.
    [Google Scholar]
  32. Nord, Christiane
    2005Text Analysis in Translation: Theory, Methodology, and Didactic Application of a Model for Translation-Oriented Text Analysis (No. 94). Amsterdam: Rodopi.
    [Google Scholar]
  33. OpenCLC (v1)
    OpenCLC (v1) 2017Open Cambridge Learner Corpus, distributed byLexical Computing Limited on Behalf of Cambridge University Press and Cambridge English Language Assessment. https://www.sketchengine.eu/cambridge-learner-corpus/
    [Google Scholar]
  34. Pan, Jun
    2012 “Problem Analysis and the Learning of Interpreting: Perceptions, Evaluation and Corpus Analysis of Students’ Interpreting Work.” Ph.D. diss., City University of Hong Kong.
  35. 2016 “A Corpus-based Study of College Students’ Translation Performance: The Construction and Initial Findings of the HK-CL(CE/EC)TC.” InMUST Kickoff Meeting, https://uclouvain.be/en/research-institutes/ilc/cecl/must-kickoff-meeting.html.Also inSylviane, Granger, and Marie-Aude Lefer, eds. 2017 General Report of the MUST Kickoff Meeting, 147–168. Louvain-la-Neuve: Centre for English Corpus Linguistics, Université catholique de Louvain.
    [Google Scholar]
  36. Pan, Jun, and Honghua Wang
    2012 “Investigating the Nature of the Semi-Natural Interpretation: A Case Study.” InInterpreting Brian Harris: Recent Developments in Translatology, edited byMaría Amparo, Jimenez Ivars, and María Jesús Blasco Mayor, 77–94. Bern: Perter Lang.
    [Google Scholar]
  37. Pan, Jun, and Jackie Xiu Yan
    2012 “Learner Variables and Problems Perceived by Students: An Investigation of a College Interpreting Program in China.” Perspectives: Studies in Translatology20 (2): 199–218. 10.1080/0907676X.2011.590594
    https://doi.org/10.1080/0907676X.2011.590594 [Google Scholar]
  38. Pan, Jun, Honghua Wang, and Jackie Xiu Yan
    2017 “Convergences and Divergences between Studies on Translator Training and Interpreter Training: Findings from a Database of English Journal Articles.” Target: International Journal of Translation Studies29 (1): 110–144. 10.1075/target.29.1.05pan
    https://doi.org/10.1075/target.29.1.05pan [Google Scholar]
  39. Porter, Martin F.
    1980 “An Algorithm for Suffix Stripping.” Program14 (3): 130–137. 10.1108/eb046814
    https://doi.org/10.1108/eb046814 [Google Scholar]
  40. Russo, Mariachiara, Claudio Bendazzoli, and Bart Defrancq
    eds. 2018Making Way in Corpus-based Interpreting Studies. Singapore: Springer. 10.1007/978‑981‑10‑6199‑8
    https://doi.org/10.1007/978-981-10-6199-8 [Google Scholar]
  41. Scott, Mike
    2016WordSmith Tools (version 7). Stroud: Lexical Analysis Software.
    [Google Scholar]
  42. Selinker, Larry
    (1972) 1983 “Interlanguage.” InSecond Language Learning: Contrastive Analysis, Error Analysis, and Related Aspects, edited byBetty Wallace Robinett and Jacquelyn Schachter, 173–196. Ann Arbor: University of Michigan Press.
    [Google Scholar]
  43. Shlesinger, Miriam
    1998 “Corpus-based Interpreting Studies as an Offshoot of Corpus-based Translation Studies.” Meta: Translators’ Journal43 (4): 486–493. 10.7202/004136ar
    https://doi.org/10.7202/004136ar [Google Scholar]
  44. Smith, Karin Sim, and Lucia Specia
    2017 “Examining Lexical Coherence in a Multilingual Setting.” InNew Perspectives on Cohesion and Coherence: Implications for Translation, edited byKatrin Menzel, Ekaterina Lapshinova-Koltunski, and Kerstin Kunz, 131–149. Berlin: Language Science Press.
    [Google Scholar]
  45. Tiwari, Ashima, and Deepak Dembla
    2019 “A Novel Algorithm for Automatic Text Summarisation System Using Lexical Chain.” InAmbient Communications and Computer Systems, edited byYu-Chen Hu, Shailesh Tiwari, Krishn K. Mishra, and Munesh C. Trivedi, 103–112. Singapore: Springer. 10.1007/978‑981‑13‑5934‑7_10
    https://doi.org/10.1007/978-981-13-5934-7_10 [Google Scholar]
  46. Toury, Gideon
    1979 “Interlanguage and Its Manifestation in Translation.” Meta: Translators’ Journal24 (2): 223–231. 10.7202/004502ar
    https://doi.org/10.7202/004502ar [Google Scholar]
  47. Wen, Qiufang 文秋芳 , and Jinquan Wang 王金铨
    2008Zhongguo daxuesheng Ying-Han Han-Ying koubiyi yuliaoku中国大学生英汉汉英口笔译语料库 [Parallel corpus of Chinese EFL learners]. Beijing: Waiyu jiaoxue yu yanjiu chubanshe.
    [Google Scholar]
  48. Williams, Sarah
    1994 “The Application of SLA Research to Interpreting.” Perspectives: Studies in Translatology2 (1): 19–28. 10.1080/0907676X.1994.9961219
    https://doi.org/10.1080/0907676X.1994.9961219 [Google Scholar]
  49. Wong, Billy T. M., and Chunyu Kit
    2012 “Extending Machine Translation Evaluation Metrics with Lexical Cohesion to Document Level.” InProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, South Korea, 12–14 July 2012, 1060–1068. https://aclanthology.org/D12-1097.pdf
    [Google Scholar]
  50. Wong, Billy T. M., Cecilia F. K. Pun, Chunyu Kit, and Jonathan J. Webster
    2011 “Lexical Cohesion for Evaluation of Machine Translation at Document Level.” In2012 7th International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE), Tokushima, Japan, 27–29 November 2011, 238–242. https://ieeexplore.ieee.org/document/613820110.1109/NLPKE.2011.6138201
    https://doi.org/10.1109/NLPKE.2011.6138201 [Google Scholar]
  51. Wu, Shuxuan
    2010 “Lexical Cohesion in Oral English.” Journal of Language Teaching and Research1 (1): 97–101. 10.4304/jltr.1.1.97‑101
    https://doi.org/10.4304/jltr.1.1.97-101 [Google Scholar]
  52. Yan, Jackie Xiu, Jun Pan, and Honghua Wang
    2010 “Learner Factors, Self-Perceived Language Ability and Interpreting Learning: An Investigation of Hong Kong Tertiary Interpreting Classes.” The Interpreter and Translator Trainer4 (2): 173–196. 10.1080/13556509.2010.10798803
    https://doi.org/10.1080/13556509.2010.10798803 [Google Scholar]
  53. 2018Research on Translator and Interpreter Training – A Collective Volume of Bibliometric Reviews and Empirical Studies on Learners. Singapore: Springer. 10.1007/978‑981‑10‑6958‑1
    https://doi.org/10.1007/978-981-10-6958-1 [Google Scholar]
  54. Zannirato, Alessandro
    2008 “Teaching Interpreting and Interpreting Teaching: A Conference Interpreter’s Overview of Second Language Acquisition.” InTranslator and Interpreter Training: Issues, Methods and Debates, edited byJohn Kearns, 19–38. London: Continuum International Publishing Group.
    [Google Scholar]

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error