Satellite dataset

Dataset information

The original Statlog (Landsat Satellite) dataset from UCI machine learning repository is a multi-class classification dataset. Here, the training and test data are combined. The smallest three classes, i.e. 2, 4, 5 are combined to form the outliers class, while all the other classes are combined to form an inlier class. 

Source (citation)

Liu, Fei Tony, Kai Ming Ting, and Zhi-Hua Zhou. “Isolation forest.2008 Eighth IEEE International Conference on Data Mining. IEEE, 2008.

K. M. Ting, J. T. S. Chuan, and F. T. Liu. “Mass: A New Ranking Measure for Anomaly Detection.“, IEEE Transactions on Knowledge and Data Engineering, 2009.

Downloads

File: satellite.mat

Description: X = Multi-dimensional point data, y = labels (1 = outliers, 0 = inliers)