1887
Volume 12, Issue 3
  • ISSN 1384-6655
  • E-ISSN: 1569-9811
USD
Buy:$35.00 + Taxes

Abstract

COMPARA is a bidirectional parallel corpus of English and Portuguese, currently with 3 million words. The corpus was launched in 2000 and at present it is possibly the largest edited parallel corpus publicly available on the Web, with roughly 6,000 corpus queries per month. This paper summarizes an analysis of six years of corpus use. We begin by looking at user studies for language resources, especially corpora, and then we provide a snapshot of COMPARA’s users and their behaviour based on log analysis. Particular emphasis is given to the language interface preferred by users (Portuguese and English are possible), the choice between the Simple and Complex Search modes, the reasons underlying null-results and behaviour after restricted output. The data has pointed us to cases where COMPARA’s Web interface can be improved, and provided insights about our users and the problems they face, although further studies that distinguish between different kinds of users remain necessary.

Loading

Article metrics loading...

/content/journals/10.1075/ijcl.12.3.03san
2007-01-01
2019-08-23
Loading full text...

Full text loading...

References

http://instance.metastore.ingenta.com/content/journals/10.1075/ijcl.12.3.03san
Loading
  • Article Type: Research Article
Keyword(s): English , error analysis , evaluation , interface design , log analysis , parallel corpora , Portuguese and usability
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error