The Combination of Text Classifiers Using Reliability Indicators

Paul N. Bennett, Susan T. Dumais, Eric Horvitz

Access pdf file.

Access postscript file.

Abstract:

The intuition that different text classifiers behave in qualitatively different ways has long motivated attempts to build a better metaclassifier via some combination of classifiers. We introduce a probabilistic method for combining classifiers that considers the context-sensitive reliabilities of contributing classifiers. The method harnesses reliability indicators---variables that provide signals about the performance of classifiers in different situations. We provide background, present procedures for building metaclassifiers that take into consideration both reliability indicators and classifier outputs, and review a set of comparative studies undertaken to evaluate the methodology.

Keywords: Text classification, classifier combination, metaclassifiers, feature selection, reliability indicators.

In: P. N. Bennett, S. T. Dumais, and E. Horvitz. The Combination of Text Classifiers using Reliability Indicators. Information Retrieval.

Author Email: pbennett+www@cs.cmu.edu,sdumais@microsoft.com, horvitz@microsoft.com