1887
Volume 31, Issue 1
  • ISSN 0929-9971
  • E-ISSN: 1569-9994

Abstract

Abstract

The interoperability of language resources is crucial for effective communication and data exchange across various computational systems. In this context, the ISO/TC 37 standards, specifically the Lexical Markup Framework (LMF) and the Terminological Markup Framework (TMF), play a vital role by providing a common framework for the modelling, representation, and exchange of lexical and terminological data. The LMF has been deliberately aligned with TMF to facilitate close coordination between the two standards. This paper explores the convergence between LMF and TMF, underscoring the need for a Unified Markup Framework (UMF) that enhances interoperability and effective resource management. We propose a unified meta-model that integrates these frameworks through comparative analysis and real-world examples, facilitating the development of advanced language processing applications and multilingual lexicographic and terminology management. This study not only underscores the opportunities and challenges of such coordination but also sets the groundwork for future research directions in the harmonisation of lexicographic and terminology resources.

Available under the CC BY-NC 4.0 license.
Loading

Article metrics loading...

/content/journals/10.1075/term.00084.vez
2025-05-23
2025-06-24
Loading full text...

Full text loading...

/deliver/fulltext/term.00084.vez.html?itemId=/content/journals/10.1075/term.00084.vez&mimeType=html&fmt=ahah

References

  1. Bellandi, Andrea, Giorgio Maria Di Nunzio, Silvia Piccini, and Federica Vezzani
    2023a “From TBX to Ontolex Lemon: Issues and Desiderata.” InProceedings of the 2nd International Conference on Multilingual Digital Terminology Today (MDTT 2023), ed. byGiorgio Maria Di Nunzio, Rute Costa, and Federica Vezzani, Lisboa, Portugal: CEUR (CEUR Workshop Proceedings). Available at: https://ceur-ws.org/Vol-3427/#paper4
    [Google Scholar]
  2. 2023b “The Importance of Being Interoperable: Theoretical and Practical Implications in Converting TBX to OntoLex-Lemon.” InProceedings of the 4th Conference on Language, Data and Knowledge. LDK 2023, ed. bySara Carvalho , 646–651, Vienna, Austria: NOVA CLUNL, Portugal. Available at: https://aclanthology.org/2023.ldk-1.70
    [Google Scholar]
  3. Bosque-Gil, Julia, Jorge Gracia, Elena Montiel-Ponsoda, and Asunción Gómez-Pérez
    2018 “Models to represent linguistic linked data.” Natural Language Engineering24(6): 811–859. 10.1017/S1351324918000347
    https://doi.org/10.1017/S1351324918000347 [Google Scholar]
  4. Branco, António, Maria Eskevich, Francesca Frontini, Jan Hajič, Erhard Hinrichs, Franciska de Jong, Pawel Kamocki, Alexander König, Krister Lindén, Alexander Constanza Navarretta, Maciej Piasecki, Stelios Piperidis, Olli Pitkänen, Kiril Simov, Inguna Skadiņa, Thorsten Trippel, andreas Witt, and Claus Zinn
    2023 “The CLARIN infrastructure as an interoperable language technology platform for SSH and beyond”. Language Resources and Evaluation. Available at: 10.1007/s10579‑023‑09658‑z
    https://doi.org/10.1007/s10579-023-09658-z [Google Scholar]
  5. Caselli, Tommaso, and Johan Bos
    2023 “Investigating interoperable event corpora: Limitations of reusability of resources and portability of models”. Language Resources and Evaluation57(3): 1107–1137. Available at: 10.1007/s10579‑023‑09643‑6
    https://doi.org/10.1007/s10579-023-09643-6 [Google Scholar]
  6. Chen, Peter Pin-Shan
    1976 “The entity-relationship model — toward a unified view of data.” ACM Transaction on Database Systems1(1): 9–36. 10.1145/320434.320440
    https://doi.org/10.1145/320434.320440 [Google Scholar]
  7. Cimiano, Philipp, Christian Chiarcos, John P. McCrae, and Jorge Gracia
    2020Linguistic Linked Data: Representation, Generation and Applications. Cham, Switzerland: Springer International Publishing. Available at: 10.1007/978‑3‑030‑30225‑2
    https://doi.org/10.1007/978-3-030-30225-2 [Google Scholar]
  8. Costa, Rute
    2013 “Terminology and Specialised Lexicography: two complementary domains”. Lexicographica291: 29–42. 10.1515/lexi‑2013‑0004
    https://doi.org/10.1515/lexi-2013-0004 [Google Scholar]
  9. Eckle-Kohler, Judith, Iryna Gurevych, Silvana Hartmann, Michael Matuschek, and Christian M. Meyer
    2012 “UBY-LMF — A Uniform Model for Standardizing Heterogeneous Lexical-Semantic Resources in ISO-LMF”. InProceedings of the Eighth International Conference on Language Resources and Evaluation (LREC’12), ed. byN. Calzolari, K. Choukri, T. Declerck, M. U. Doğan, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis, 275–282. European Language Resources Association (ELRA). Available at: www.lrec-conf.org/proceedings/lrec2012/pdf/475_Paper.pdf
    [Google Scholar]
  10. Francopoulo, Gil
    (ed.) 2013LMF — Lexical Markup Framework. London: ISTE/Wiley. 10.1002/9781118712696
    https://doi.org/10.1002/9781118712696 [Google Scholar]
  11. Francopoulo, Gil, Bel Nuria, George Monte, Calzolari Nicoletta, Monachini Monica, Pet Mandy, and Claudia Soria
    2006 “Lexical markup framework (LMF) for NLP multilingual resources.” InProceedings of the Workshop on Multilingual Language Resources and Interoperability, ed. byWitt, A., Sérasset, G., Armstrong, S., Breen, J., Heid, U., Sasaki, F., 1–8. Stroudsburg, PA: Association for Computational Linguistics. 10.3115/1613162.1613163
    https://doi.org/10.3115/1613162.1613163 [Google Scholar]
  12. Gromann, Dagmar, Elena-Simona Apostol, Christian Chiarcos, Marco Cremaschi, Jorge Gracia, Katerina Gkirtzou, Chaya Liebeskind, Liudmila Mockiene, Michael Rosner, Ineke Schuurman, Gilles Sérasset, Purificação Silvano, Blerina Spahiu, Ciprian-Octavian Truică, Andriusm Utka, and Giedreh Valunaite Oleskeviciene
    2024 “Multilinguality and LLOD: A survey across linguistic description levels”, Semantic Web, Preprint (Preprint), 1–44. Available at: 10.3233/SW‑243591
    https://doi.org/10.3233/SW-243591 [Google Scholar]
  13. ISO 1087
    ISO 1087 2019Terminology Work and Terminology Science — Vocabulary — Part 1: Theory and application. Geneva: International Organization for Standardization.
    [Google Scholar]
  14. ISO 16642
    ISO 16642 2017Computer Applications in Terminology — Terminological markup framework. Geneva: International Organization for Standardization.
    [Google Scholar]
  15. ISO 24613–1
    ISO 24613–1 2024Language resource management — Lexical Markup Framework (LMF) — Part 1: Core model. Geneva: International Organization for Standardization.
    [Google Scholar]
  16. ISO 24613–2
    ISO 24613–2 2020Language resource management — Lexical Markup Framework (LMF) — Part 2: Machine Readable Dictionary (MRD) model. Geneva: International Organization for Standardization.
    [Google Scholar]
  17. ISO 24613-3
    ISO 24613-3 2021Language resource management — Lexical Markup Framework (LMF) — Part 3: Etymological Extension. Geneva: International Organization for Standardization.
    [Google Scholar]
  18. ISO 24613-4
    ISO 24613-4 2021Language resource management — Lexical Markup Framework (LMF) — Part 4: TEI serialisation. Geneva: International Organization for Standardization.
    [Google Scholar]
  19. ISO 24613-5
    ISO 24613-5 2022Language resource management — Lexical Markup Framework (LMF) — Part 5: Lexical base exchange (LBX) serialization. Geneva: International Organization for Standardization.
    [Google Scholar]
  20. ISO 30042
    ISO 30042 2019Management of Terminology Resources — Term-Base eXchange (TBX). Geneva: International Organization for Standardization.
    [Google Scholar]
  21. ISO 639
    ISO 639 2023Code for individual languages and language groups. Geneva: International Organization for Standardization.
    [Google Scholar]
  22. Pernes, Stefan, Laurent Romary, and Kara Warburton
    2017 “TBX in ODD: Schema-agnostic specification and documentation for TermBase eXchange”, inLOTKS 2017-Workshop on Language, Ontology, Terminology and Knowledge Structures. Available at: https://inria.hal.science/hal-01581440/document
    [Google Scholar]
  23. Piccini, Silvia, Federica Vezzani, and Andrea Bellandi
    2023 “TBX and “Lemon”: What perspectives in terminology?”. Digital Scholarship in the Humanities, 38 (Supplement_1): i61–i72. 10.1093/llc/fqad025
    https://doi.org/10.1093/llc/fqad025 [Google Scholar]
  24. Romary, Luarent, Mohamed Khemakhem, Fahad Khan, Jack Bowers, Nicoletta Calzolari, George Monte, Mandy Pet, and Piotr Banski
    2019 “LMF reloaded.” InProceedings of the 13th International Conference of the Asian Association for Lexicography, edited byAhmet, M. G., Çiçekler, N. & Taşdemir, Y., 533–539. Istanbul: Istanbul University Department of Linguistics. https://cdn.istanbul.edu.tr/FileHandler2.ashx?f=asialex_proceedings.pdf
    [Google Scholar]
  25. Silecchia, Sara, Federica Vezzani, and Giorgio Maria Di Nunzio
    2022 “Knowledge Representation and Language Simplification of Human Rights”, inProceedings of the Workshop on Terminology in the 21st century: many faces, many places. TERM 2022, 8–12, Marseille, France: European Language Resources Association. Available at: https://aclanthology.org/2022.term-1.2
    [Google Scholar]
  26. Vezzani, Federica
    2022Terminologie numérique: conception, représentation et gestion. Bern: Peter Lang International Academic Publishers. 10.3726/b19407
    https://doi.org/10.3726/b19407 [Google Scholar]
  27. Vezzani, Federica, and Giorgio Maria Di Nunzio
    2020 “Methodology for the standardization of terminological resources: Design of TriMED database to support multi-register medical communication”. Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication26(2): 265–297. 10.1075/term.00053.vez
    https://doi.org/10.1075/term.00053.vez [Google Scholar]
  28. Vezzani, Federica, Giorgio Maria Di Nunzio, and Rute Costa
    2023 “ISO standards for terminology resources management: Are they FAIR enough?”. Digital Translation10(2): 233–252. 10.1075/dt.00009.vez
    https://doi.org/10.1075/dt.00009.vez [Google Scholar]
  29. Vezzani, Federica, Giorgio Maria Di Nunzio, and Sara Silecchia
    2022 “La fraseologia dei trattati internazionali di disarmo: la risorsa terminologica DITTO”. Umanistica Digitale (14): 91–117. 10.6092/issn.2532‑8816/14796
    https://doi.org/10.6092/issn.2532-8816/14796 [Google Scholar]
  30. Wilkinson, Mark, Michel Dumontier, IJsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton, Arie Baak, Niklas Blomberg, Jan-Willem Boiten, Luiz Bonino da Silva Santos, Philip E. Bourne, Jildau Bouwman, Anthony J. Brookes, Tim Clark, Mercè Crosas, Ingrid Dillo, Olivier Dumon, Scott Edmunds, Chris T. Evelo, Richard Finkers, Alejandra Gonzalez-Beltran, Alasdair J. G. Gray, Paul Groth, Carole Goble, Jeffrey S. Grethe, …Barend Mons
    2016 “The FAIR Guiding Principles for scientific data management and stewardship”. Scientific Data, 3(1). 10.1038/sdata.2016.18
    https://doi.org/10.1038/sdata.2016.18 [Google Scholar]
/content/journals/10.1075/term.00084.vez
Loading
/content/journals/10.1075/term.00084.vez
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error