ML p(r)ior | Discriminative Sparse Neighbor Approximation for Imbalanced Learning

Discriminative Sparse Neighbor Approximation for Imbalanced Learning

2016-02-03
1602.01197 | cs.CV
Data imbalance is common in many vision tasks where one or more classes are rare. Without addressing this issue conventional methods tend to be biased toward the majority class with poor predictive accuracy for the minority class. These methods further deteriorate on small, imbalanced data that has a large degree of class overlap. In this study, we propose a novel discriminative sparse neighbor approximation (DSNA) method to ameliorate the effect of class-imbalance during prediction. Specifically, given a test sample, we first traverse it through a cost-sensitive decision forest to collect a good subset of training examples in its local neighborhood. Then we generate from this subset several class-discriminating but overlapping clusters and model each as an affine subspace. From these subspaces, the proposed DSNA iteratively seeks an optimal approximation of the test sample and outputs an unbiased prediction. We show that our method not only effectively mitigates the imbalance issue, but also allows the prediction to extrapolate to unseen data. The latter capability is crucial for achieving accurate prediction on small dataset with limited samples. The proposed imbalanced learning method can be applied to both classification and regression tasks at a wide range of imbalance levels. It significantly outperforms the state-of-the-art methods that do not possess an imbalance handling mechanism, and is found to perform comparably or even better than recent deep learning methods by using hand-crafted features only.
PDF

Highlights - Most important sentences from the article

Login to like/save this paper, take notes and configure your recommendations

Related Articles

2018-02-25

Novelty detection is the process of identifying the observation(s) that differ in some respect from … show more
PDF

Highlights - Most important sentences from the article

2017-06-03

In many real-world scenarios, labeled data for a specific machine learning task is costly to obtain.… show more
PDF

Highlights - Most important sentences from the article

2019-04-24
1904.11093 | cs.CV

We present a transductive deep learning-based formulation for the sparse representation-based classi… show more
PDF

Highlights - Most important sentences from the article

2017-05-01

This paper presents a new supervised classification algorithm for remotely sensed hyperspectral imag… show more
PDF

Highlights - Most important sentences from the article

2015-06-16

In histopathological image analysis, feature extraction for classification is a challenging task due… show more
PDF

Highlights - Most important sentences from the article

2018-08-29

Deep learning has revolutionized the performance of classification, but meanwhile demands sufficient… show more
PDF

Highlights - Most important sentences from the article

2018-09-13

In the real world, a learning system could receive an input that looks nothing like anything it has … show more
PDF

Highlights - Most important sentences from the article

2018-06-01

Data for face analysis often exhibit highly-skewed class distribution, i.e., most data belong to a f… show more
PDF

Highlights - Most important sentences from the article

2017-11-02

Learning from class-imbalanced data continues to be a common and challenging problem in supervised l… show more
PDF

Highlights - Most important sentences from the article

2017-12-08
1712.03162 | cs.CV

Recognising detailed facial or clothing attributes in images of people is a challenging task for com… show more
PDF

Highlights - Most important sentences from the article

2019-04-08

Noisy labels often occur in vision datasets, especially when they are issued from crowdsourcing or W… show more
PDF

Highlights - Most important sentences from the article

2015-05-07
1505.01658 | cs.LG

Many real world data mining applications involve obtaining predictive models using data sets with st… show more
PDF

Highlights - Most important sentences from the article

2016-11-06

Convolutional Neural Networks (ConvNets) have achieved excellent recognition performance in various … show more
PDF

Highlights - Most important sentences from the article

2014-03-13
1403.3378 | stat.ML

The vast majority of real world classification problems are imbalanced, meaning there are far fewer … show more
PDF

Highlights - Most important sentences from the article

2017-04-25
1704.07657 | cs.LG

Various modifications of decision trees have been extensively used during the past years due to thei… show more
PDF

Highlights - Most important sentences from the article