1887
Volume 13, Issue 1
  • ISSN 0929-9971
  • E-ISSN: 1569-9994
USD
Buy:$35.00 + Taxes

Abstract

The recognition and extraction of terms and their variants in texts are crucial processes in text mining. We use the ILC platform, an automatic controlled indexing platform, to perform these linguistic processes. We present a methodology for enhancing the recognition of syntactic term variation in English, using syntactic and morpho-syntactic features. Principal spurious variants of terms are ascribed to incorrect word dependencies. To overcome these problems, we consider each term variant as a window on the sentence and introduce two criteria: an internal syntactic criterion which checks that the dependencies between words in the window are respected, and an external criterion which defines boundaries, making it possible to ensure that the window is well positioned in the sentence. The use of these criteria improves filtering of the variants and assists the expert in validating the indexing.

Loading

Article metrics loading...

/content/journals/10.1075/term.13.1.03vil
2007-01-01
2025-02-13
Loading full text...

Full text loading...

/content/journals/10.1075/term.13.1.03vil
Loading
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error