Conference papers

How Short is a Piece of String?: the Impact of Text Length and Text Augmentation on Short-text Classification Accuracy

Austin McCartney, Technological University DublinFollow
Svetlana Hensman, Technological University DublinFollow
Luca Longo, Technological University DublinFollow

Document Type

Conference Paper

Rights

Available under a Creative Commons Attribution Non-Commercial Share Alike 4.0 International Licence

Disciplines

Computer Sciences

Publication Details

https://www.semanticscholar.org/paper/How-Short-is-a-Piece-of-String-%3A-The-Impact-of-Text-Mccartney-Hensman/61165f6fdf445a2706578ba950202cde9a03cfb6

Abstract

Recent increases in the use and availability of short messages have created opportunities to harvest vast amounts of information through machine-based classification. However, traditional classification methods have failed to yield accuracies comparable to classification accuracies on longer texts. Several approaches have previously been employed to extend traditional methods to overcome this problem, including the enhancement of the original texts through the construction of associations with external data supplementation sources. Existing literature does not precisely describe the impact of text length on classification performance. This work quantitatively examines the changes in accuracy of a small selection of classifiers using a variety of enhancement methods, as text length progressively decreases. Findings, based on ANOVA testing at a 95% confidence interval, suggest that the performance of classifiers using simple enhancements decreases with decreasing text length, but that the use of more sophisticated enhancements risks over-supplementation of the text and consequent concept drift and classification performance decrease as text length increases.

DOI

https://doi.org/10.21427/D7151M

Recommended Citation

McCartney, A., Hensman, S. & Longo, L. (2017). How short is a piece of string: the impact of text length and text augmentation on short-text classification accuracy. Proceedings of the 25th Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2017), 7-8 December, Dublin, Ireland.

Download

Included in

Computer Sciences Commons

COinS

Conference papers

How Short is a Piece of String?: the Impact of Text Length and Text Augmentation on Short-text Classification Accuracy

Document Type

Rights

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Included in

Search

Browse

Author Corner

Links

Conference papers

How Short is a Piece of String?: the Impact of Text Length and Text Augmentation on Short-text Classification Accuracy

Authors

Document Type

Rights

Disciplines

Publication Details

Abstract

DOI

Recommended Citation

Included in

Share

Search

Browse

Author Corner

Links