Conference papers

Handling Concept Drift in Text Data Stream Constrained by High Labelling Cost

Patrick Lindstrom, Technological University DublinFollow
Sarah Jane Delany, Technological University DublinFollow
Brian Mac Namee, Technological University DublinFollow

Document Type

Conference Paper

Rights

Available under a Creative Commons Attribution Non-Commercial Share Alike 4.0 International Licence

Disciplines

Computer Sciences

Publication Details

The paper was presented at Florida Artificial Intelligence Research Society Conference (FLAIRS) 2010, http://www.flairs-23.info

Abstract

In many real-world classification problems the concept being modelled is not static but rather changes over time - a situation known as concept drift. Most techniques for handling concept drift rely on the true classifications of test instances being available shortly after classification so that classifiers can be retrained to handle the drift. However, in applications where labelling instances with their true class has a high cost this is not reasonable. In this paper we present an approach for keeping a classifier up-to-date in a concept drift domain which is constrained by a high cost of labelling. We use an active learning type approach to select those examples for labelling that are most useful in handling changes in concept. We show how this approach can adequately handle concept drift in a text filtering scenario requiring just 15% of the documents to be manually categorised and labelled.

DOI

https://doi.org/10.21427/D7B022

Recommended Citation

Lindstrom, Patrick et al. (2010) Handling Concept Drift in Text Data Stream Constrained by High Labelling Cost. Florida Artificial Intelligence Research Society Conference (FLAIRS). Florida, 19-21, May. doi:10.21427/D7B022

Funder

ABBEST

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Conference papers

Handling Concept Drift in Text Data Stream Constrained by High Labelling Cost

Document Type

Rights

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Funder

Included in

Search

Browse

Author Corner

Links

Conference papers

Handling Concept Drift in Text Data Stream Constrained by High Labelling Cost

Authors

Document Type

Rights

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Funder

Included in

Share

Search

Browse

Author Corner

Links