1887
Named Entities: Recognition, classification and use
  • ISSN 0378-4169
  • E-ISSN: 1569-9927
USD
Buy:$35.00 + Taxes

Abstract

Conditional random fields are state-of-the-art models for sequencing tasks such as named entity recognition. However, being globally conditioned, they have a tendency to overfit to a greater extent than other sequencing models. We introduce an approach to combat this overfitting called a logarithmic opinion pool (LOP). A LOP consists of a weighted combination of constituent models. We present the theory behind LOPs, and show that effective LOPs require constituent models that are diverse from one another. We examine different ways to introduce such diversity, including an approach that involves training the constituent models together, interactively. Our results show that, as expected from the underlying theory, explicitly optimising for constituent model diversity can improve performance over standard approaches to regularisation.

Loading

Article metrics loading...

/content/journals/10.1075/li.30.1.04smi
2007-01-01
2025-04-20
Loading full text...

Full text loading...

/content/journals/10.1075/li.30.1.04smi
Loading
  • Article Type: Research Article
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error