Bilingual term recognition revisited: The bag-of-equivalents term alignment approach and its evaluation

Spela Vintar

doi:10.1075/term.16.2.01vin

ISSN 0929-9971
E-ISSN: 1569-9994

GBP

Bilingual term recognition revisited: The bag-of-equivalents term alignment approach and its evaluation
Author(s): Spela Vintar
Source: Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication, Volume 16, Issue 2, Jan 2010, p. 141 - 158
DOI: https://doi.org/10.1075/term.16.2.01vin

Previous Article
Table of Contents
Next Article

Abstract

The paper describes LUIZ, a bilingual term recognition system that has been developed for the Slovene-English language pair. The system is a hybrid term extractor using morphosyntactic patterns and statistical ranking to propose domain-specific expressions for each of the two languages, whereupon translation equivalents between the languages are identified using the innovative bag-of-equivalents approach. This simple but effective method is based on the Twente word aligner to obtain a lexicon of single word translation pairs and their probability scores, which is then used to identify correspondences between multi-word terms. The bilingual term recognition system has been tested and evaluated on three parallel subcorpora from the tourism, accounting and military domain. Average precision of the term alignment component is 0.83, whereby only fully equivalent and domain-relevant terms were counted as positives. Another advantage of the described approach is the fact that we successfully detect term variants and multiple translations of a candidate multi-word term. Since our term alignment method does not require sentence-aligned corpora it can be used with comparable corpora, provided we already have a domain-specific lexicon or dictionary of single-word correspondences. The paper concludes with some thoughts on the users of term recognition systems and their needs based on our observations from the online version of the system.

Article metrics loading...

/content/journals/10.1075/term.16.2.01vin

2010-01-01

2024-04-16

From This Site

/content/journals/10.1075/term.16.2.01vin

dcterms_title,dcterms_subject,pub_keyword

-contentType:Journal -contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

http://instance.metastore.ingenta.com/content/journals/10.1075/term.16.2.01vin

Article Type: Research Article

Keyword(s): ATR evaluation; bilingual term recognition; comparable corpora; parallel corpora; term alignment; word alignment

Bilingual term recognition revisited: The bag-of-equivalents term alignment approach and its evaluation

Abstract

From This Site

Most Read This Month

Most Cited

Methods of automatic term recognition: A review

Term extraction using non-technical corpora as a point of leverage

Theories of terminology: Their description, prescription and explanation

Causes of denominative variation in terminology: A typology proposal

Process-oriented terminology management in the domain of Coastal Engineering

A corpus comparison approach for terminology extraction

Automatic term recognition based on statistics of compound nouns and their components

Automatic term recognition based on statistics of compound nouns

TExSIS: Bilingual terminology extraction from parallel corpora using chunk-based alignment

Variation in the organization of medical terms: Exploring some motivations for term choice