References

SMOTE: Synthetic Minority Over-sampling Technique

Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall, & W. Philip Kegelmeyer (2002)

Journal of Artificial Intelligence Research.

DOI: https://doi.org/10.1613/jair.953

Abstract. Introduces the Synthetic Minority Over-sampling Technique (SMOTE), the standard method for handling class imbalance in tabular classification. For each minority example, SMOTE selects a $k$-nearest minority neighbour and synthesises a new example along the line segment between them, sampling without exact duplication. SMOTE substantially outperforms naive minority oversampling and reweighting on imbalanced benchmarks. Two decades on, SMOTE and its variants (Borderline-SMOTE, ADASYN) remain the default class-imbalance baseline in imbalanced-learn.

Tags: classification imbalanced-data

Cited in:

This site is currently in Beta. Contact: Chris Paton

Textbook of Usability · Textbook of Digital Health

Auckland Maths and Science Tutoring

AI tools used: Claude (research, coding, text), ChatGPT (diagrams, images), Grammarly (editing).