1887
image of Evaluating the extraction of Italian institutional terminology
USD
Buy:$35.00 + Taxes

Abstract

Abstract

In institutional settings, terminology management is essential to ensure efficient communication. Traditionally, this task has been carried out manually or through the use of corpus analysis tools. However, recent advances in generative artificial intelligence have opened new avenues for automating terminographic tasks. In this context, the use of generative models is proposed for the extraction of specialised terminology in academic institutions. Specifically, this study compares two approaches to terminology extraction. On the one hand, a corpus-based approach using the Sketch Engine tool and, on the other hand, an approach based on generative artificial intelligence. To this end, UniPDTerms was implemented — a chatbot designed with ChatGPT-4o specialised in institutional terminology of the University of Padua (Italy) and fed with an corpus. The evaluation of both systems was performed using a reference list and analysing precision, recall, F-score and MRR metrics for each model. The results indicate that Sketch Engine and UniPDTerms performed at a comparable level under identical evaluation conditions. Although the two systems use different extraction mechanisms, their outputs produce similar results: Sketch Engine extracts relevant term candidates using frequency-based corpus analysis, whereas UniPDTerms draws on contextual and semantic relations. These results highlight the potential of incorporating generative artificial intelligence into terminographic workflows, offering new possibilities for improving efficiency and supporting end-users in terminology management.

Loading

Article metrics loading...

/content/journals/10.1075/term.25041.ort
2026-02-26
2026-03-06
Loading full text...

Full text loading...

References

  1. Bowker, Lynne, and Jennifer Pearson
    2002Working with Specialized Language: A Practical Guide to Using Corpora. London: Routledge. 10.4324/9780203469255
    https://doi.org/10.4324/9780203469255 [Google Scholar]
  2. Castillo-Pérez, Esther, and Silvia Montero-Martínez
    2021 “Internationalisation and Terminology Management in Higher Education.” InLinguistic, Educational and Intercultural Research 2021 (LEIC Research 2021), –. https://www.researchgate.net/publication/360672845
    [Google Scholar]
  3. Chiocchetti, Elena, Vesna Lušicky, and Tanja Wissik
    2023 “Multilingual Legal Terminology Databases: Workflows and Roles.” InHandbook of Terminology: Vol. 3. Legal Terminology, edited byŁucja Biel and Hendrik J. Kockaert, –. Amsterdam: John Benjamins Publishing Company. 10.1075/hot.3.mul1
    https://doi.org/10.1075/hot.3.mul1 [Google Scholar]
  4. Curry, Neil, Paul Baker, and Gavin Brookes
    2024 “Generative AI for Corpus Approaches to Discourse Studies: A Critical Evaluation of ChatGPT.” Applied Corpus Linguistics (): . 10.1016/j.acorp.2023.100082
    https://doi.org/10.1016/j.acorp.2023.100082 [Google Scholar]
  5. Daille, Béatrice
    2017Term Variation in Specialised Corpora: Characterisation, Automatic Discovery and Applications. Amsterdam: John Benjamins Publishing Company. 10.1075/tlrp.19
    https://doi.org/10.1075/tlrp.19 [Google Scholar]
  6. de Wit, Hans
    2011 “Globalisation and Internationalisation of Higher Education.” Revista de Universidad y Sociedad del Conocimiento (RUSC) (): –. rusc.uoc.edu/ojs/index.php/rusc/article/view/v8n2-dewit/v8n2-dewit-eng
    [Google Scholar]
  7. Dobrina, Claudia
    2015 “Getting to the Core of a Terminological Project.” InHandbook of Terminology, vol., edited byHendrik J. Kockaert and Frieda Steurs, –. Amsterdam: John Benjamins. 10.1075/hot.1.get1
    https://doi.org/10.1075/hot.1.get1 [Google Scholar]
  8. Drouin, Patrick
    2003 “Term Extraction Using Non-Technical Corpora as a Point of Leverage.” Terminology (): –. 10.1075/term.9.1.06dro
    https://doi.org/10.1075/term.9.1.06dro [Google Scholar]
  9. Gao, Yuan, Ruili Wang, and Feng Hou
    2023 “How to Design Translation Prompts for ChatGPT: An Empirical Study.” arXiv. arxiv.org/abs/2304.02182
    [Google Scholar]
  10. Hazem, Amir, Mérième Bouhandi, Florian Boudin, and Béatrice Daille
    2020 “TermEval 2020: TALN-LS2N System for Automatic Term Extraction.” InProceedings of the 6th International Workshop on Computational Terminology (COMPUTERM 2020), –. Marseille: European Language Resources Association (ELRA).
    [Google Scholar]
  11. Heylen, Kris, and Dirk De Hertog
    2015 “Automatic Term Extraction.” InHandbook of Terminology, vol., edited byHendrik J. Kockaert and Frieda Steurs, –. Amsterdam: John Benjamins Publishing Company. 10.1075/hot.1.aut1
    https://doi.org/10.1075/hot.1.aut1 [Google Scholar]
  12. Irrera, Ornella, Stefano Marchesin, and Gianmaria Silvello
    2024 “MetaTron: Advancing Biomedical Annotation Empowering Relation Annotation and Collaboration.” BMC Bioinformatics (): . 10.1186/s12859‑024‑05730‑9
    https://doi.org/10.1186/s12859-024-05730-9 [Google Scholar]
  13. Jiao, Wenxiang, Wenxuan Wang, Jen-tse Huang, Xing Wang, Shuming Shi, and Zhaopeng Tu
    2023 “Is ChatGPT a Good Translator? Yes with GPT-4 as the Engine.” arXiv. arxiv.org/abs/2301.08745
    [Google Scholar]
  14. Kilgarriff, Adam, Vít Baisa, Jan Bušta, Miloš Jakubíček, Vojtěch Kovář, Jan Michelfeit, Pavel Rychlý, and Vít Suchomel
    2014 “The Sketch Engine: Ten Years On.” Lexicography: –. 10.1007/s40607‑014‑0009‑9
    https://doi.org/10.1007/s40607-014-0009-9 [Google Scholar]
  15. Kilgarriff, Adam
    2009 “Simple Maths for Keywords.” InProceedings of Corpus Linguistics Conference CL2009 (University of Liverpool, UK, July 2009), edited byM. Mahlberg, V. González-Díaz, and C. Smith. https://www.sketchengine.eu/wp-content/uploads/2015/04/2009-Simple-maths-for-keywords.pdf
    [Google Scholar]
  16. Macken, Lieve, Els Lefever, and Véronique Hoste
    2013 “TExSIS: Bilingual Terminology Extraction from Parallel Corpora Using Chunk-Based Alignment.” Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication (): –. 10.1075/term.19.1.01mac
    https://doi.org/10.1075/term.19.1.01mac [Google Scholar]
  17. Montero-Martínez, Silvia
    2025Terminología en instituciones académicas: Modelos de gestión en un contexto internacional. Granada: Comares. ISBN: 978-84-1369-765-9.
    [Google Scholar]
  18. 2023a “UGRTerm®: Gestión Terminológica para el Ámbito Académico.” InTraducción e Interpretación Especializadas en Ámbito Panhispánico, vol., edited byPilar Sorbet and Verónica D. Valle Cacela, –. Berlin: Peter Lang GmbH, Internationaler Verlag der Wissenschaften. ebookcentral.proquest.com/lib/ugr/detail.action?docI
    [Google Scholar]
  19. 2023b “Training Corporate and Institutional Terminologists: A Case Study at the University of Granada.” The Interpreter and Translator Trainer (): –. 10.1080/1750399X.2023.2237326
    https://doi.org/10.1080/1750399X.2023.2237326 [Google Scholar]
  20. Montero-Martínez, Silvia, Pamela Faber-Benítez, and Miriam Buendía-Castro
    2011Terminología para Traductores e Intérpretes: Una Perspectiva Integradora. 2nd ed. Granada: Tragacanto.
    [Google Scholar]
  21. Ortiz-Garduño, Helena, and Daniel Torres-Salinas
    2025 “GPTBot Development for Translation Purposes: Flowchart, Practical Case and Future Prospects.” Journal of Language and Education (): –. 10.17323/jle.2025.21727
    https://doi.org/10.17323/jle.2025.21727 [Google Scholar]
  22. Ortiz-Garduño, Helena, and Victoria Di Césare
    2025 “Generative artificial intelligence applied to terminological variation: The concept of LOCAL RESEARCH as a case study for the development of a ChatGPT bot.” InProceedings of the III International Conference on Digital Linguistics (CILIDI), , Valencia, Spain.
    [Google Scholar]
  23. OpenAI
    OpenAI 2024 “Hello GPT-4-o.” OpenAI. May 13, 2024. https://openai.com/es-ES/index/hello-gpt-4o/
    [Google Scholar]
  24. OpenAI
    OpenAI 2023 “GPT-4 Technical Report.” arXiv. arxiv.org/abs/2303.08774
    [Google Scholar]
  25. Palomar-González, Virginia
    2004 “La Importancia de la Normalización Terminológica.” InActas del II Congreso El Español, Lengua de Traducción: Las Palabras del Traductor, edited byLuis González and Pollux Hernúñez, –. Madrid: Esletra.
    [Google Scholar]
  26. Pavel, Silvia
    2001Handbook of Terminology. Ottawa: Translation Bureau of Canada. www.translationbureau.gc.ca
    [Google Scholar]
  27. Pazienza, Maria Teresa, Marco Pennacchiotti, and Fabio Massimo Zanzotto
    2005 “Terminology Extraction: An Analysis of Linguistic and Statistical Approaches.” InStudies in Fuzziness and Soft Computing, vol., –. Berlin: Springer-Verlag. https://www.springerlink.com
    [Google Scholar]
  28. Repar, Andraž, Vid Podpečan, Anže Vavpetič, Nada Lavrač, and Senja Pollak
    2022 “TermEnsembler.” Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication (): –. 10.1075/term.00029.rep
    https://doi.org/10.1075/term.00029.rep [Google Scholar]
  29. Rigouts-Terryn, Ayla, Véronique Hoste, and Els Lefever
    2020 “In No Uncertain Terms: A Dataset for Monolingual and Multilingual Automatic Term Extraction from Comparable Corpora.” Language Resources and Evaluation (): –. 10.1007/s10579‑019‑09453‑9
    https://doi.org/10.1007/s10579-019-09453-9 [Google Scholar]
  30. Rigouts-Terryn, Ayla, Véronique Hoste, Patrick Drouin, and Els Lefever
    2020 “TermEval 2020: Shared Task on Automatic Term Extraction Using the Annotated Corpora for Term Extraction Research (ACTER) Dataset.” InProceedings of the 6th International Workshop on Computational Terminology (COMPUTERM 2020), –. Marseille: European Language Resources Association (ELRA).
    [Google Scholar]
  31. Sager, Juan C.
    1990A Practical Course in Terminology Processing. Amsterdam: John Benjamins Publishing Company. 10.1075/z.44
    https://doi.org/10.1075/z.44 [Google Scholar]
  32. Sahari, Yousef, Abdu M. T. Al-Kadi, and Jamak K. M. Ali
    2023 “A Cross-Sectional Study of ChatGPT in Translation: Magnitude of Use, Attitudes, and Uncertainties.” Journal of Psycholinguistic Research (): –. 10.1007/s10936‑023‑10031‑y
    https://doi.org/10.1007/s10936-023-10031-y [Google Scholar]
  33. San Martín, Antonio
    2024 “What Generative Artificial Intelligence Means for Terminological Definitions.” Paper presented at the3rd International Conference on Multilingual Digital Terminology Today: Design, Representation Formats and Management Systems, 27–28 June 2024, Granada, Spain.
    [Google Scholar]
  34. Schmitz, Klaus-Dirk, and Daniela Straub
    2018Successful Terminology Management in Companies. Stuttgart: tcworld GmbH.
    [Google Scholar]
  35. Schweizerische Bundeskanzlei
    Schweizerische Bundeskanzlei 2018Terminologiearbeit. DES — Konferenz der Übersetzungsdienste Europäischer Staaten. www.cotsoes.org/de/publications
    [Google Scholar]
  36. Sinclair, John
    1991Corpus, Concordance, Collocation. Oxford: Oxford University Press.
    [Google Scholar]
  37. TerminOrgs
    TerminOrgs 2016Terminology Starter Guide. Edited byKara Karburton. https://www.terminorgs.net/downloads/TerminOrgs_StarterGuide_V2.pdf
    [Google Scholar]
  38. UNESCO
    UNESCO 2019Preliminary Study on the Ethics of Artificial Intelligence. COMEST — World Commission on the Ethics of Scientific Knowledge and Technology. https://unesdoc.unesco.org/ark:/48223/pf0000367823
    [Google Scholar]
  39. Varantola, Krista
    2002 “Disposable Corpora as Intelligent Tools in Translation.” Cuadernos de Traducción: –. Available at: www.cadernos.ufsc.br/
    [Google Scholar]
  40. Vezzani, Federica
    2022Terminologie Numérique: Conception, Représentation et Gestion. Vol.. Berlin: Peter Lang Verlag. 10.3726/b19407
    https://doi.org/10.3726/b19407 [Google Scholar]
  41. Warburton, Kara
    2021The Corporate Terminologist. Vol.. Amsterdam: John Benjamins. ebookcentral.proquest.com/lib/ugr/detail.action?docID=6507002. 10.1075/tlrp.21
    https://doi.org/10.1075/tlrp.21 [Google Scholar]
  42. Zhao, Biao, Weiqiang Jin, Javier Del Ser, and Guang Yang
    2023 “ChatAgri: Exploring Potentials of ChatGPT on Cross-Linguistic Agricultural Text Classification.” Neurocomputing: . 10.1016/j.neucom.2023.126708
    https://doi.org/10.1016/j.neucom.2023.126708 [Google Scholar]
/content/journals/10.1075/term.25041.ort
Loading
/content/journals/10.1075/term.25041.ort
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error