Full text loading...
USD
-
The Linguistic Annotation of Corpora: The TOSCA Analysis System
- Source: International Journal of Corpus Linguistics, Volume 3, Issue 2, Jan 1998, p. 189 - 210
- Previous Article
- Table of Contents
- Next Article
Abstract
The article discusses the role of linguistic annotation in corpus linguistics as opposed to annotation in natural language processing. In corpus linguistics, annotation is an integral part of the process of linguistic interpretation and description of the data. Tagging and parsing are discussed as the automatic counterparts of, respectively, the paradigmatic and the syntagmatic description of corpus data. The requirements for a corpus linguistic annotation system are considered. An account is given of the TOSCA analysis system as representative of such an annotation system. Performance results of the system are given, and an evaluation is made.
© 1998 John Benjamins Publishing Company