1887
Volume 28, Issue 1
  • ISSN 0929-9971
  • E-ISSN: 1569-9994
USD
Buy:$35.00 + Taxes

Abstract

Abstract

We describe the creation of a knowledge base in the field of karstology using the frame-based approach. Apart from providing a new multilingual resource using manually annotated definitions as the source of structured information, the main focus is on exploring text mining methods to identify targeted knowledge structures in specialised corpora. The first stage of this process is the design of a domain model and its implementation in a definition annotation task. Once annotation is completed, an analysis of typical co-occurrence patterns between semantic categories and the relations describing them allows us to discern ideal definition templates. We demonstrate that such templates contribute to a more comprehensive and structured representations of concepts, but also help us design targeted text mining experiments to retrieve new semantic relations from text. Two such experiments are presented, the first using intersections of word embeddings to identify words expressing a specific semantic relation, and the second using the embedding of the semantic relation to extract multiword units which contain the target relation. Results suggest that the proposed methods are promising for capturing the semantic properties of relations in frame-based knowledge modelling.

Loading

Article metrics loading...

/content/journals/10.1075/term.21005.vin
2022-01-27
2022-05-23
Loading full text...

Full text loading...

References

  1. Altmanova, Jana , Claudio Grimaldi , and Silvia Domenica Zollo
    2018 “Le rôle des adjectifs dans la catégorisation des déchets”. In F. Neveu , B. Harmegnies , L. Hriba et S. Prévost (Eds.), SHS Web Conferences46, 6ème Congrès Mondial de Linguistique Française. Université de Mons, Belgique: 1–15. 10.1051/shsconf/20184605004
    https://doi.org/10.1051/shsconf/20184605004 [Google Scholar]
  2. Bernier-Colborne, Gabriel , and Marie-Claude L’Homme
    2015 “Using a Distributional Neighbourhood Graph to Enrich Semantic Frames in the Field of the Environment.” Proceedings of the conference Terminology and Artificial Intelligence (TIA2015).
    [Google Scholar]
  3. Bertoldi, Anderson , and Rove Luiza de Oliveira Chishman
    2007 “Improving Legal Ontologies through Semantic Representation of Adjectives”. ICSC 2007: 767–774. 10.1109/ICSC.2007.44
    https://doi.org/10.1109/ICSC.2007.44 [Google Scholar]
  4. Bhat, D. N. S.
    1994The Adjectival Category: Criteria for Differentiation and Identification. Amsterdam: John Benjamins. 10.1075/slcs.24
    https://doi.org/10.1075/slcs.24 [Google Scholar]
  5. Bodenreider, O. , & Pakhomov, S.
    2003 “Exploring Adjectival Modification in Biomedical Discourse across Two Genres”. InProceedings of the ACL 2003 workshop on natural language processing in biomedicine: 105–112. 10.3115/1118958.1118972
    https://doi.org/10.3115/1118958.1118972 [Google Scholar]
  6. Bögli, Alfred
    1980Karst Hydrology and Physical Speleology. Berlin Heidelberg New York: Springer-Verlag. 10.1007/978‑3‑642‑67669‑7
    https://doi.org/10.1007/978-3-642-67669-7 [Google Scholar]
  7. Bojanowski, Piotr , Edouard Grave , Armand Joulin , and Tomas Mikolov
    2017 “Enriching word vectors with subword information.” Transactions of the Association for Computational Linguistics5: 135–146. 10.1162/tacl_a_00051
    https://doi.org/10.1162/tacl_a_00051 [Google Scholar]
  8. Cabezas-García, Melania , and Pilar León-Araúz
    2018 “Towards the Inference of Semantic Relations in Complex Nominals: A Pilot Study”. InProceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018): 2511–2518.
    [Google Scholar]
  9. Campos, Araceli Alonso , and Sergi Torner Castells
    2010 “Adjectives and collocations in specialized texts: lexicographical implications.” InProceedings of the XIV Euralex International Congress, ed. by Anne Dykstra , and Tanneke Schoonheim , pp.872–881.
    [Google Scholar]
  10. De Castilho, Eckart Richard , Chris Biemann , Irina Gurevych , and S. M. Yimam
    2014 “WebAnno: a Flexible, Web-based Annotation Tool for CLARIN”. InProceedings of the CLARIN Annual Conference (CAC) 2014, Soesterberg, Netherlands.
    [Google Scholar]
  11. Devlin, Jacob , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova
    2018 “Bert: Pre-training of Deep Bidirectional Transformers for Language Understanding”. arXiv preprint arXiv:1810.04805.
    [Google Scholar]
  12. Diaz, Fernando , Bhaskar Mitra , and Nick Craswell
    2016 “Query Expansion with Locally-Trained Word Embeddings”. arXiv preprint arXiv:1605.07891. 10.18653/v1/P16‑1035
    https://doi.org/10.18653/v1/P16-1035 [Google Scholar]
  13. Duran-Muñoz, Isabel
    2016 “Producing Frame-based Definitions”. Terminology, 22 (2): 223–249. 10.1075/term.22.2.04mun
    https://doi.org/10.1075/term.22.2.04mun [Google Scholar]
  14. Durán-Muñoz, Isabel
    2019 “Adjectives and Their Keyness: A Corpus-based Analysis of Tourism Discourse in English.” Corpora14.3: 351–378. 10.3366/cor.2019.0178
    https://doi.org/10.3366/cor.2019.0178 [Google Scholar]
  15. Faber, Pamela , Silvia Montero Martínez , María Rosa Castro Prieto , José Senso Ruiz , Juan Antonio Prieto Velasco , Pilar León Arauz , Carlos Márquez Linares , and Miguel Vega Expósito
    2006 “Process-oriented Terminology Management in the Domain of Coastal Engineering.” Terminology12, no.2: 189–213. 10.1075/term.12.2.03fab
    https://doi.org/10.1075/term.12.2.03fab [Google Scholar]
  16. Faber, Pamela
    2009 “The Cognitive Shift in Terminology and Specialized Translation”. MonTI. Monografías de Traducción e Interpretación1: 107–134. 10.6035/MonTI.2009.1.5
    https://doi.org/10.6035/MonTI.2009.1.5 [Google Scholar]
  17. Faber, Pamela , Pilar León-Araúz , and Arianne Reimerink
    2011 “Knowledge Representation in EcoLexicon.” Technological innovation in the teaching and processing of LSPs: proceedings of TISLID10 (2011): 367–386.
    [Google Scholar]
  18. Faber, Pamela
    ed. 2012A Cognitive Linguistics View of Terminology and Specialized Language. Berlin/Boston: De Gruyter Mouton. 10.1515/9783110277203
    https://doi.org/10.1515/9783110277203 [Google Scholar]
  19. Faber, Pamela , and Pilar León-Araúz
    2014 “Specialized knowledge dynamics.” InDynamics and Terminology: An Interdisciplinary Perspective on Monolingual and Multilingual Culture-bound Communication, ed. by Temmerman, R. , and M. Van Campenhoudt (2014): 135–158. 10.1075/tlrp.16.08fab
    https://doi.org/10.1075/tlrp.16.08fab [Google Scholar]
  20. Faber, Pamela , Pilar León-Araúz and Arianne Reimerink
    2016 “EcoLexicon: New Features and Challenges.” GLOBALEX: 73–80.
    [Google Scholar]
  21. Fader, Anthony , Stephen Soderland , and Oren Etzioni
    2011 “Identifying Relations for Open Information Extraction.” InProceedings of the 2011 conference on empirical methods in natural language processing: 1535–1545.
    [Google Scholar]
  22. Field, Malcolm S.
    2002A Lexicon Of Cave And Karst Terminology With Special Reference To Environmental Karst Hydrology. US Environmental Protection Agency.
    [Google Scholar]
  23. Fillmore, Charles J.
    1976 “Frame Semantics and the Nature of Language.” Origins and Evolution of Language and Speech. (Annals of the New York Academy of Sciences 280). Ed. New York Academy of Sciences: 20–32. 10.1111/j.1749‑6632.1976.tb25467.x
    https://doi.org/10.1111/j.1749-6632.1976.tb25467.x [Google Scholar]
  24. Ford, Derek and Williams, Paul
    2007Karst Hydrogeology and Geomorphology. Wiley, Chichester. 10.1002/9781118684986
    https://doi.org/10.1002/9781118684986 [Google Scholar]
  25. Fernández-Reyes, Francis. C. , Jorge Hermosillo-Valadez , and Manuel Montes-y-Gómez
    2018 “A Prospect-guided Global Query Expansion Strategy using Word Embeddings”. Information Processing and Management, 54(1): 1–13. 10.1016/j.ipm.2017.09.001
    https://doi.org/10.1016/j.ipm.2017.09.001 [Google Scholar]
  26. Gabor, Kata , Davide Buscaldi , Anne-Kathrin Schumann , Behrang Qasemi Zadeh , Haifa Zargayouna , and Thierry Charnois
    2018 “SemEval-2018 Task 7: Semantic Relation Extraction and Classification in Scientific Papers”. InProceedings of The 12th International Workshop on Semantic Evaluation: 679–688. 10.18653/v1/S18‑1111
    https://doi.org/10.18653/v1/S18-1111 [Google Scholar]
  27. Gams, Ivan , Jurij Kunaver and Darko Radinja
    1973Slovenska kraška terminologija. Ljubljana: Katedra za fizično geografijo, Univerza v Ljubljani.
    [Google Scholar]
  28. Gil-Berrozpe, Juan Carlos , Pilar León-Araúz , and Pamela Faber
    2017 “Specifying Hyponymy Subtypes and Knowledge Patterns: A Corpus-based Study.” InProceedings of the Fifth International Conference on Electronic Lexicography in the 21st Century (eLex 2017): 19–21 2017.
    [Google Scholar]
  29. Gillieson, David
    1996Caves. Processes, development and management. Cambridge, Massachusetts: Blackwell Publishers.
    [Google Scholar]
  30. Glossary and Multilingual Equivalents of Karst Terms
    Glossary and Multilingual Equivalents of Karst Terms 1972 Paris: UNESCO.
  31. Gunn, John
    2004Encyclopedia of Caves and Karst Science. New York, London: Fitzroy Dearborn. 10.4324/9780203483855
    https://doi.org/10.4324/9780203483855 [Google Scholar]
  32. Ittoo, Ashwin , and Gosse Bouma
    2010 “On Learning Subtypes of the Part-whole Relation: Do not Mix your Seeds.” InProceedings of the 48th Annual Meeting of the Association for Computational Linguistics: 1328–1336.
    [Google Scholar]
  33. Jennings, Joseph Newell
    1997Cave and Karst Terminology. Australian Speleological Federation. 10.1002/9781444313680.gloss
    https://doi.org/10.1002/9781444313680.gloss [Google Scholar]
  34. Juršič, Matjaž , Igor Mozetič , Tomaž Erjavec , and Nada Lavrač
    2010 “Lemmagen: Multilingual Lemmatisation with Induced Ripple-down Rules”. Journal of Universal Computer Science, 16 (9): 1190–1214.
    [Google Scholar]
  35. Lafourcade, Mathieu , and Lionel Ramadier
    2016 “Semantic Relation Extraction with Semantic Patterns: Experiment on Radiology Report”. InLREC: Language Resources and Evaluation Conference. ELRA: 4578–4582.
    [Google Scholar]
  36. León-Araúz, Pilar , Arianne Reimerink , and Pamela Faber
    2019 “EcoLexicon and By-products: Integrating and Reusing Terminological Resources”. Terminology, 25 (2): 222–258. 10.1075/term.00037.leo
    https://doi.org/10.1075/term.00037.leo [Google Scholar]
  37. L’Homme, Marie-Claude
    2002 “What can Verbs and Adjectives Tell us about Terms?” InProceedings of Terminology and Knowledge Engineering (TKE 2002), Nancy, France.
    [Google Scholar]
  38. Liu, Qian , Heyan Huang , Junyu Xuan , Guangquan Zhang , Yang Gao , and Jie Lu
    2020 “A Fuzzy Word Similarity Measure for Selecting Top-k Similar Words in Query Expansion”. IEEE Transactions on Fuzzy Systems. 10.1109/TFUZZ.2020.2993702
    https://doi.org/10.1109/TFUZZ.2020.2993702 [Google Scholar]
  39. Liu, Yinhan , Ott, Myle , Goyal, Naman , Du, Jingfei , Joshi, Mandar , Chen, Danqi , Levy, Omer , Lewis, Mike , Zettlemoyer, Luke and Stoyanov, Veselin
    2019 “RoBERTa: A Robustly Optimized BERT Pretraining Approach”. arXiv preprint arXiv:1907.11692.
    [Google Scholar]
  40. Lowe, David , Waltham, Tony
    2002Dictionary of Karst and Caves: A Brief Guide to the Terminology and Concepts of Cave and Karst Science. British Cave Research Association.
    [Google Scholar]
  41. Mikolov, Tomas , Chen, Kai , Corrado, Greg , and Dean, Jeffrey
    2013 “Efficient Estimation of Word Representations in Vector Space”. arXiv preprint arXiv:1301.3781.
    [Google Scholar]
  42. Miljković, Dragana , Tjaša Stare , Igor Mozetič , Vid Podpečan , Marko Petek , Kamil Witek , Marina Dermastia , Nada Lavrač , and Kristina Gruden
    2012 “Signalling Network Construction for Modelling Plant Defence Response.” PloS one 7, no. 12 (2012): e51822. 10.1371/journal.pone.0051822
    https://doi.org/10.1371/journal.pone.0051822 [Google Scholar]
  43. Monroe, Watson H.
    1970A Glossary of Karst Terminology. Washington D.C.: U.S. Geological Survey.
    [Google Scholar]
  44. Navigli, Roberto , and Paula Velardi
    2010 “Learning Word-class Lattices for Definition and Hypernym Extraction”. InProceedings of the 48th annual meeting of the association for computational linguistics: 1318–1327.
    [Google Scholar]
  45. Pavlopoulos, Kosmas , Niki Evelpidou , and Andreas Vassilopoulos
    2009Mapping Geomorphological Environments. Springer, Berlin Heidelberg. 10.1007/978‑3‑642‑01950‑0
    https://doi.org/10.1007/978-3-642-01950-0 [Google Scholar]
  46. Pitkänen-Heikkilä, Kaarina
    2015 “Adjectives as Terms”. Terminology, 21 (1):76–101. 10.1075/term.21.1.04pit
    https://doi.org/10.1075/term.21.1.04pit [Google Scholar]
  47. Pollak, Senja , Anže Vavpetič , Janez Kranjc , Nada Lavrač , and Špela Vintar
    2012 “NLP Workflow for On-line Definition Extraction from English and Slovene Text Corpora”. InProceedings of KONVENS 2012: 53–60.
    [Google Scholar]
  48. Pollak, Senja , Andraž Repar , Matej Martinc , and Vid Podpečan
    2019 “Karst Exploration: Extracting Terms and Definitions from Karst Domain Corpus”. In Proceedings of eLex 2019: 934–956.
    [Google Scholar]
  49. Pollak, Senja , Vid Podpečan , Dragana Miljković , Uroš Stepišnik , and Špela Vintar
    2020 “The NetViz Terminology Visualization Tool and the Use Cases in Karstology Domain Modeling”. Marseille: The International Workshop on Computational Terminology COMPUTERM 2020 at LREC 2020: 55–60.
    [Google Scholar]
  50. Roche, Christophe , Rute Costa , Sara Carvalho , and Bruno Almeida
    2019 “Knowledge-based Terminological E-dictionaries: The EndoTerm and al-Andalus Pottery projects”. Terminology25 (2): 259–290. 10.1075/term.00038.roc
    https://doi.org/10.1075/term.00038.roc [Google Scholar]
  51. San Martín, Antonio , Catherine Trekker , and Pilar León-Araúz
    2020 “Extraction of Hyponymic Relations in French with Knowledge-Pattern-Based Word Sketches”. InProceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020): 5953–5961.
    [Google Scholar]
  52. Silva, Alfredo , and Mendoza, Marcelo
    2020 “Improving Query Expansion Strategies with Word Embeddings”. InProceedings of the ACM Symposium on Document Engineering 2020: 1–4. 10.1145/3395027.3419601
    https://doi.org/10.1145/3395027.3419601 [Google Scholar]
  53. Šušteršič, France , and Martin Knez
    1995 “Prispevek k slovenskemu speleološkemu pojmovniku”. Naše jame37: 153–170.
    [Google Scholar]
  54. Ulčar, Matej , and Marko Robnik-Šikonja
    2020a “Slovenian RoBERTa Contextual Embeddings Model: SloBERTa 1.0”. Slovenian language resource repository CLARIN.SI. Available at: hdl.handle.net/11356/1387 (2. 7. 2021).
    [Google Scholar]
  55. 2020b „FinEst BERT and CroSloEngual BERT”. InInternational Conference on Text, Speech, and Dialogue. Springer, Cham.: 104–111. 10.1007/978‑3‑030‑58323‑1_11
    https://doi.org/10.1007/978-3-030-58323-1_11 [Google Scholar]
  56. Vaswani, Ashish , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N. Gomez , Łukasz Kaiser , and Illia Polosukhin
    2017 “Attention is All you Need.” InAdvances in neural information processing systems: 5998–6008.
    [Google Scholar]
  57. Vintar, Špela , and Larisa Grčić Simeunović
    2017 “Definition Frames as Language-dependent Models of Knowledge Transfer”. Fachsprache1–2/2017: 43–58. 10.24989/fs.v34i1‑2.1260
    https://doi.org/10.24989/fs.v34i1-2.1260 [Google Scholar]
  58. Vintar, Špela , Amanda Saksida , Katarina Vrtovec , and Uroš Stepišnik
    2019 “Modelling Specialized Knowledge with Conceptual Frames: The TermFrame approach to a structured visual domain representation.” InProceedings of eLex 2019: 305–318.
    [Google Scholar]
  59. Vintar, Špela , Larisa Grčić Simeunović , Matej Martinc , Senja Pollak , and Uroš Stepišnik
    2020 “Mining Semantic Relations from Comparable Corpora through Intersections of Word Embeddings”. InProceedings of the 13th Workshop on Building and Using Comparable Corpora: 29–34.
    [Google Scholar]
  60. Vintar, Špela , and Uroš Stepišnik
    2021 “TermFrame: A Systematic Approach to Karst Terminology”. Dela54/2021: 149–167. 10.4312/dela.54.149‑167
    https://doi.org/10.4312/dela.54.149-167 [Google Scholar]
  61. Vintar, Špela , Vid Podpečan , and Vid Ribič
    2021 “Frame-based Terminography: a Multi-modal Knowledge Base for Karstology”. InProceedings of eLex 2021: 164–176.
    [Google Scholar]
  62. Vrtovec, Katarina , Špela Vintar , Amanda Saksida , and Uroš Stepišnik
    2019 “TermFrame : Knowledge Frames in Karstology”. InProceedings of TOTh2019: 109–126.
    [Google Scholar]
  63. Vulić, Ivan , Edoardo Maria Ponti , Robert Litschko , Goran Glavaš , and Anna Korhonen
    2020 “Probing Pretrained Language Models for Lexical Semantics.” arXiv preprint arXiv:2010.05731. 10.18653/v1/2020.emnlp‑main.586
    https://doi.org/10.18653/v1/2020.emnlp-main.586 [Google Scholar]
  64. Wierzbicka, Anna
    1986 “What’s in a Noun? (Or: How do Nouns Differ in Meaning from Adjectives?)”. Studies in Language10: 353–389. 10.1075/sl.10.2.05wie
    https://doi.org/10.1075/sl.10.2.05wie [Google Scholar]
  65. Yin, Wenpeng , and Dan Roth
    2018 “Term Definitions Help Hypernymy Detection.” arXiv preprint arXiv:1806.04532 (2018) 10.18653/v1/S18‑2025
    https://doi.org/10.18653/v1/S18-2025 [Google Scholar]
http://instance.metastore.ingenta.com/content/journals/10.1075/term.21005.vin
Loading
/content/journals/10.1075/term.21005.vin
Loading

Data & Media loading...

  • Article Type: Research Article
Keyword(s): frame-based terminology; karstology; semantic relations; word embeddings
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error