Discretization of Ordinal Attributes and Feature Selection

No Thumbnail Available
Date
1995-04-01T00:00:00Z
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The performance of classification algorithms may deteriorate due to irrelevant attributes. Numeric attributes make the situation worse, since many classification algorithms require that the training data contain only discrete attributes. Discretization can turn numeric attributes into discrete ones. Feature selection can eliminate some irrelevant attributes. This paper describes Chi2, a simple and general algorithm that uses the $\chi^2$ statistic to discretize numeric attributes repeatedly until some inconsistencies are found in the data, and achieves feature selection via discretization. In addition, it can handle mixed attributes and multi-class data naturally. %data, and achieves feature selection and discretization in one go. Experiments are conducted on the real and synthetic data sets. The empirical results demonstrate that Chi2 is effective in feature selection and discretization of numeric and ordinal attributes.
Description
Keywords
Citation