1887
Volume 25, Issue 2
  • ISSN 0929-9971
  • E-ISSN: 1569-9994
USD
Buy:$35.00 + Taxes

Abstract

Abstract

Reutilization and interoperability are major issues in the fields of knowledge representation and extraction, as reflected in initiatives such as the Semantic Web and the Linked Open Data Cloud. This paper shows how terminological resources can be integrated and reused within different types of application. EcoLexicon is a multilingual terminological knowledge base (TKB) on environmental science that integrates conceptual, linguistic and visual information. It has led to the following by-products: (i) the EcoLexicon English Corpus; (ii) EcoLexiCAT, a terminology-enhanced translation tool; and (iii) Manzanilla, an image annotation tool. This paper explains EcoLexicon and its by-products, and shows how the latter exploit and enhance the data in the TKB.

Loading

Article metrics loading...

/content/journals/10.1075/term.00037.leo
2019-11-26
2020-09-29
Loading full text...

Full text loading...

References

  1. Barriuso, Adela Barriuso, and Antonio Torralba
    2012Notes on Image Annotation. arXiv:1210.3448 [cs.CV].
    [Google Scholar]
  2. Buendía Castro, Miriam
    2013 Phraseology in Specialized Language and its Representation in Environmental Knowledge Resources. PhD Thesis. Granada, Spain: University of Granada.
  3. Bodenreider, Olivier
    2004 “The Unified Medical Language System (UMLS): Integrating Biomedical Terminology.” Nucleic Acids Research32 (1): 267–270. doi:  10.1093/nar/gkh061
    https://doi.org/10.1093/nar/gkh061 [Google Scholar]
  4. Christensen, Tina Paulsen, and Anne Schjoldager
    2016 “Computer-aided Translation Tools: The Uptake and Use by Danish Service Providers.” Journal of Specialized Translation25: 89–105.
    [Google Scholar]
  5. Durán Muñoz, Isabel
    2012 “Specialized Lexicographical Resources: A Survey of Translators’ Needs.” IneLexicography in the 21st Century: New Challenges, New applications, ed. byS. Granger and M. Paquot, 55–66. Proceedings of ELEX2009. Cahiers du Cental. Vol.7, 55–66. Louvain-La-Neuve: Presses Universitaires de Louvain.
    [Google Scholar]
  6. Faber, Pamela
    (ed) 2012A Cognitive Linguistics View of Terminology and Specialized Language. Berlin/Boston: De Gruyter Mouton. 10.1515/9783110277203
    https://doi.org/10.1515/9783110277203 [Google Scholar]
  7. Faber, Pamela, Pilar León-Araúz, Juan Antonio Prieto Velasco, and Arianne Reimerink
    2007 “Linking Images and Words: The Description of Specialized Concepts.” International Journal of Lexicography, 20 (1): 39–65. doi:  10.1093/ijl/ecl038
    https://doi.org/10.1093/ijl/ecl038 [Google Scholar]
  8. Faber, Pamela, Pilar León-Araúz, and Arianne Reimerink
    2014 “Representing Environmental Knowledge in EcoLexicon.” InLanguages for Specific Purposes in the Digital Era, ed. byE. Bárcena, T. Read and J. Arhus. Educational Linguistics 19, 267–301. Berlin, Heidelberg: Springer. 10.1007/978‑3‑319‑02222‑2_13
    https://doi.org/10.1007/978-3-319-02222-2_13 [Google Scholar]
  9. 2016 “EcoLexicon: New Features and Challenges.” InGLOBALEX 2016: Lexicographic Resources for Human Language Technology in conjunction with the 10th edition of the Language Resources and Evaluation Conference, ed. byI. Kernerman, I. Kosem Trojina, S. Krek, and L. Trap-Jensen, 73–80, Portorož, Slovenia.
    [Google Scholar]
  10. Federico, Marcello, Nicola Bertoldi, Mauro Cettolo, Matteo Negri, Marco Turchi, Marco Trombetti, Alessandro Cattelan, Antonio Farina, Domenico Lupinetti, Andrea Martines, Alberto Massidda, Holger Schwenk, Loic Barrault, Frederic Blain, Philipp Koehn, Christian Buck, and Ulrich Germann
    2014 “The Matecat Tool.” InProceedings of COLING 2014: System Demonstrations, 129–132. Dublin, Ireland.
    [Google Scholar]
  11. Fellbaum, Christiane
    (ed.) 1998WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press. 10.7551/mitpress/7287.001.0001
    https://doi.org/10.7551/mitpress/7287.001.0001 [Google Scholar]
  12. Fillmore, Charles J.
    1982 “Frame Semantics.” InLinguistics in the Morning Calm, ed. byThe Linguistic Society of Korea, 111–137. Seoul: Hanshin.
    [Google Scholar]
  13. Fillmore, Charles J., and Beryl T. Sue Atkins
    1992 “Toward a Frame-based Lexicon: The Semantics of RISK and its Neighbors.” InFrames, Fields and Contrasts: New Essays in Semantic and Lexical Organization, ed. byA. Lehrer, and E. Kittay, 75–102. Hillsdale NJ: Erlbaum.
    [Google Scholar]
  14. Fontenelle, Thierry, and Dieter Rummel
    2014 “Term Banks.” InInternational Handbook of Lexis and Lexicography, ed. byP. Hanks, and G.-M. De Schrijver, 1–12. Berlin-Heidelberg: Springer. 10.1007/978‑3‑642‑45369‑4_21‑1
    https://doi.org/10.1007/978-3-642-45369-4_21-1 [Google Scholar]
  15. Gornostay, Tatiana, and Andrejs Vasiljevs
    2014 “Terminology Resources and Terminology Work Benefit from Cloud Services.” InProceedings of the 9th International Language Resources and Evaluation Conference (LREC2014), 1942–1948. Reykjavik, Iceland.
    [Google Scholar]
  16. HarperCollins Publishers L.L.C.
    HarperCollins Publishers L.L.C.The Collins English Dictionary. (https://www.collinsdictionary.com). Accessed26 June 2019.
    [Google Scholar]
  17. International Organization for Standarization (ISO)
    International Organization for Standarization (ISO) (1999) ISO 12620. Computer applications in terminology – Data categories. Geneva: ISO.
    [Google Scholar]
  18. International Organization for Standarization (ISO)
    International Organization for Standarization (ISO) (2001) ISO 9126. Software quality characteristics. Geneva: ISO.
    [Google Scholar]
  19. Jakubíček, Miloš, Adam Kilgarriff, Dianna McCarthy, and Pavel Rychlý
    2010 “Fast Syntactic Searching in Very Large Corpora for Many Languages.” InProceedings of the PACLIC24: 741–747. Sendai, Japan.
    [Google Scholar]
  20. Kellogg, M.
  21. Kilgarriff, Adam, Pavel Rychly, Pavel Smrz, and David Tugwell
    2004 “The Sketch Engine.” InProceedings of the 11th EURALEX International Congress, 105–116. Lorient, France.
    [Google Scholar]
  22. Lemon – Lexicon Model for Ontologies
    Lemon – Lexicon Model for Ontologies. (https://lemon-model.net/). Accessed26 June 2019.
  23. León-Aráuz, Pilar, Pamela Faber, and Silvia Montero Martínez
    2012 “Specialized Language Semantics.” InA cognitive linguistics view of terminology and specialized language, ed. byP. Faber, 95–175. Berlin, Boston: De Gruyter Mouton.
    [Google Scholar]
  24. León-Araúz, Pilar, and Arianne Reimerink
    2018 “Evaluating EcoLexiCAT: a Terminology-Enhanced CAT Tool.” InProceedings of the 11th International Language Resources and Evaluation Conference (LREC2018), 2374–2381. Miyazaki, Japan.
    [Google Scholar]
  25. 2016 “Evaluation of EcoLexicon Images.” InJoint Second Workshop on Language and Ontology & Terminology and Knowledge Structures (LangOnto2 + TermiKS) in conjunction with the 10th edition of the Language Resources and Evaluation Conference, ed. byF. Khan , 16–22, Portorož, Slovenia.
    [Google Scholar]
  26. León-Araúz, Pilar, Arianne Reimerink, and Alejandro García Aragón
    2013 “Dynamism and Context in Specialized Knowledge.” Terminology19(1): 31–61. doi:  10.1075/term.19.1.02leo
    https://doi.org/10.1075/term.19.1.02leo [Google Scholar]
  27. León-Araúz, Pilar, Arianne Reimerink, and Pamela Faber
    2017 “EcoLexiCAT: A Terminology-Enhanced Translation Tool for Texts on the Environment.” InElectronic Lexicography in the 21st century. Proceedings of eLex 2017 conference, ed. byI. Kosem, J. Kallas, C. Tiberius, S. Krek, M. Jakubíček, and V. Baisa, 321–341. Brno: Lexical Computing CZ s.r.o.
    [Google Scholar]
  28. 2009 “Knowledge Extraction on Multidimensional Concepts: Corpus Pattern Analysis (CPA) and Concordances.” In8ème Conférence Internationale Terminologie et Intelligence Artificielle. Toulouse. (ceur-ws.org/Vol-578/paper24.pdf). Accessed26 June 2019.
    [Google Scholar]
  29. León-Araúz, Pilar, and Antonio San Martín
    2018 “The EcoLexicon Semantic Sketch Grammar: from Knowledge Patterns to Word Sketches.” InProceedings of the LREC 2018 Workshop Globalex 2018 – Lexicography & WordNets, ed. byI. Kerneman and S. Krek, 94–99. Miyazaki: Globalex.
    [Google Scholar]
  30. León-Araúz, Pilar, Antonio San Martín, and Pamela Faber
    2016 “Pattern-based Word Sketches for the Extraction of Semantic Relations.” InProceedings of the 5th International Workshop on Computational Terminology (Computerm2016), 73–82. Osaka, Japan.
    [Google Scholar]
  31. León-Araúz, Pilar, Antonio San Martín, and Arianne Reimerink
    2018 “The EcoLexicon English Corpus as an Open Corpus in Sketch Engine.” InProceedings of the 18th EURALEX International Congress, ed. byJ. Čibej, V. Gorjanc, I. Kosem, and S. Krek, 893–901. Ljubljana, Slovenia.
    [Google Scholar]
  32. Mayer, Richard, and Richard B. Anderson
    1992 “The Instructive Animation: Helping Students Build Connections between Words and Pictures in Multimedia Learning.” Journal of Educational Psychology84 (4): 715–726. 10.1037/0022‑0663.84.4.444
    https://doi.org/10.1037/0022-0663.84.4.444 [Google Scholar]
  33. McCrae, John, Guadalupe Aguado de Cea, Paul Buitelaar, Philipp Cimiano, Thierry Declerck, Asunción Gómez-Pérez, Jorge Gracia, Laura Hollink, Elena Montiel-Ponsoda, Dennis Spohr, and Tobias Wunner
    2012 “Interchanging Lexical Resources on the Semantic Web.” Language Resources and Evaluation46 (6): 701–709. 10.1007/s10579‑012‑9182‑3
    https://doi.org/10.1007/s10579-012-9182-3 [Google Scholar]
  34. McCray, Alexa T., and Stuart J. Nelson
    1995 “The Representation of Meaning in the UMLS.” Methods of Information in Medicine34: 193–201. 10.1055/s‑0038‑1634592
    https://doi.org/10.1055/s-0038-1634592 [Google Scholar]
  35. Moro, Andrea, Francesco Cecconi, and Roberto Navigli
    2014 “Multilingual Word Sense Disambiguation and Entity Linking for Everybody.” InProceedings of the 2014 International Conference on Posters & Demonstrations Track1272, 25–28. Riva del Garda, Italy.
    [Google Scholar]
  36. Moro, Andrea, Alessandro Raganato, and Roberto Navigli
    2014 “Entity Linking Meets Word Sense Disambiguation: A Unified Approach.” Transactions of the Association for Computational Linguistics2: 231–244. 10.1162/tacl_a_00179
    https://doi.org/10.1162/tacl_a_00179 [Google Scholar]
  37. Navigli, Roberto, and Simone Paolo Ponzetto
    2012 “BabelNet: The Automatic Construction, Evaluation and Application of a Wide-coverage Multilingual Semantic Network.” Artificial Intelligence193: 217–250. 10.1016/j.artint.2012.07.001
    https://doi.org/10.1016/j.artint.2012.07.001 [Google Scholar]
  38. Paivio, Allan
    1971Imagery and Verbal Processes. New York: Holt, Rinehart & Winston.
    [Google Scholar]
  39. 1986Mental Representations: A Dual-Coding Approach. New York: Oxford University Press.
    [Google Scholar]
  40. Poignant, Johann, Mateusz Budnik, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau, Gilles Adda, Laurent Besacier, Hazim Ekenel, Gil Francopoulo, Javier Hernando, Joseph Mariani, Ramon Morros, Georges Quénot, Sophie Rosset, and Thomas Tamisier
    2016 “The CAMOMILE Collaborative Annotation Platform for Multi-modal, Multi-lingual and Multi-media Documents.” InProceedings of the 10th edition of the Language Resources and Evaluation Conference (LREC’16), ed. byN. Calzolari , 1421–1425, Portorož, Slovenia.
    [Google Scholar]
  41. Prieto Velasco, Juan Antonio
    2008 Información gráfica y grados de especialidad en el discurso científico-técnico: un estudio de corpus. PhD Thesis, University of Granada.
  42. Prieto Velasco, Juan Antonio, and Pamela Faber
    2012 “Graphical Information.” InA Cognitive Linguistics View of Terminology and Specialized Language, ed. byP. Faber, 225–248. Berlin/Boston: De Gruyter Mouton.
    [Google Scholar]
  43. Reimerink, Arianne, and Pilar León-Araúz
    2018 “Manzanilla: An Image Annotation Tool for TKB Building.” InProceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), ed. byN. Calzolari, K. Choukri, C. Cieri, T. Declerck, S. Goggi, K. Hasida, H. Isahara, B. Maegaard, J. Mariani, H. Mazo, A. Moreno, J. Odijk, S. Piperidis, and T. Tokunaga. Miyazaki, Japan (www.lrec-conf.org/proceedings/lrec2018/index.html). Accessed2 July 2019.
    [Google Scholar]
  44. Reimerink, Arianne, Pilar León-Araúz, and Pamela Faber
    2016 “Image Selection and Annotation for an Environmental Knowledge Base.” Language Resources and Evaluation50: 1–32. 10.1007/s10579‑016‑9345‑8
    https://doi.org/10.1007/s10579-016-9345-8 [Google Scholar]
  45. San Martín, Antonio, Melania Cabezas-García, Miriam Buendía Castro, Beatriz Sánchez-Cárdenas, Pilar León-Araúz, and Pamela Faber
    2017 “Recent Advances in EcoLexicon.” Dictionaries: Journal of the Dictionary Society of North America38 (1): 96–115. 10.1353/dic.2017.0004
    https://doi.org/10.1353/dic.2017.0004 [Google Scholar]
  46. Tudhope, Douglas, Traugott Koch, and Rachel Heery
    2006Terminology Services and Technology: JISC state of the art review. (www.ukoln.ac.uk/terminology/JISC-review2006.html). Accessed26 June 2019).
    [Google Scholar]
  47. Vossen, Piek
    2004 “EUROWORDNET: A Multilingual Database of Autonomous and Language-specific Wordnets Connected via an Inter-lingual Index.” International Journal of Lexicography17 (2): 161–173. doi:  10.1093/ijl/17.2.161
    https://doi.org/10.1093/ijl/17.2.161 [Google Scholar]
  48. DeepL GmbH
    DeepL GmbH. Linguee. (https://www.linguee.es/). Accessed26 June 2019.
    [Google Scholar]
  49. EcoLexicon: Terminological Knowledge Base on the Environment
    EcoLexicon: Terminological Knowledge Base on the Environment. (https://ecolexicon.ugr.es). Accessed26 June 2019.
  50. European Environment Information and Observation Network
    European Environment Information and Observation Network. GEneral Multilingual Environmental Thesaurus (GEMET). (https://www.eionet.europa.eu/gemet/en/themes/). Accessed26 June 2019.
    [Google Scholar]
  51. TaaS Consortium
    TaaS Consortium. Terminology as a Service (TaaS). (www.taas-project.eu/). Accessed26 June 2019.
    [Google Scholar]
  52. Translation Centre for the Bodies of the European Union in Luxembourg
    Translation Centre for the Bodies of the European Union in Luxembourg. Inter-Active Terminology for Europe (IATE). (https://iate.europa.eu/home). Accessed26 June 2019).
    [Google Scholar]
  53. United States Department of Health & Human Services
    United States Department of Health & Human Services. National Library of Medicine (NLM). (https://www.nlm.nih.gov/). Accessed26 June 2019.
    [Google Scholar]
  54. United Stated National Library of Medicine
    United Stated National Library of Medicine. Unified Medical Language System (UMLS). (https://www.nlm.nih.gov/research/umls/). Accessed26 June 2019.
  55. Wikimedia Foundation
    Wikimedia Foundation. Wikipedia. (https://www.wikipedia.org/). Accessed26 June 2019.
    [Google Scholar]
  56. World Wide Web Consortium
    World Wide Web Consortium. Simple Knowledge Organization System (SKOS). (https://www.w3.org/2004/02/skos/). Accessed26 June 2019.
    [Google Scholar]
http://instance.metastore.ingenta.com/content/journals/10.1075/term.00037.leo
Loading
/content/journals/10.1075/term.00037.leo
Loading

Data & Media loading...

  • Article Type: Research Article
Keyword(s): CAT tool , image annotation , interoperability , specialized corpus and terminological knowledge base
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error