Full text loading...
-
An application and e aluation of the C/NC-value approach for the automatic term recognition of multi-word units in Japanese
- Source: Terminology. International Journal of Theoretical and Applied Issues in Specialized Communication, Volume 6, Issue 2, Jan 2000, p. 175 - 194
Abstract
Technical terms are important for knowledge mining, especially as vast amounts of multi-lingual documents are available over the Internet. Thus, a domain and language-independent method for term recognition is necessary to automatically recognize terms from Internet documents.The C-/NC-value method is an efficient domain-independent multi-word term recognition method which combines linguistic and statistical knowledge. Although the C-value/NC-value method is originally based on the recognition of nested terms in English, our aim is to evaluate the application of the method to other languages and to show its feasibility for multi-language environments.In this article, we describe the application of the C/NC-value method to Japanese texts. Several experiments analysing the performance of the method using the NACSIS Japanese AI-domain corpus demonstrate that the method can be utilized to realize a practical domain-and language-independent term rec-ognition system.