Skip to content
Menu
  • Outlier Detection DataSets (ODDS)
  • About ODDS
  • Search
  • Outlier Detection DataSets (ODDS)
  • About ODDS
ODDS

Lymphography dataset

Dataset information

The original lymphography dataset from UCI machine learning repository is a classification dataset. It is a multi-class dataset having four classes, but two of them are quite small (2 and 4 data records). Therefore, those two small classes are merged and considered as outliers compared to other two large classes (81 and 61 data records).

Source (citation)

A. Lazarevic and V. Kumar, “Feature bagging for outlier detection.” in ACM SIGKDD, 2005, pp. 157–166.

A. Zimek, M. Gaudet, R. J. Campello, and J. Sander, “Subsampling for efficient and effective unsupervised outlier detection ensembles.” in ACM SIGKDD, 2013, pp. 428–436.

C. C. Aggarwal and S. Sathe, “Theoretical foundations and algorithms for outlier ensembles.” ACM SIGKDD Explorations Newsletter, vol. 17, no. 1, pp. 24–47, 2015.Downloads

Downloads

File: lympho.mat

Description: X = multi-dimensional point data, y = labels (1 = outliers, 0 = inliers)

 

 

 

 

 

 

Archives

Categories

  • No categories
  • Outlier Detection DataSets (ODDS)
  • About ODDS

Copyright © 2023 ODDS. All Rights Reserved.

Codilight Theme by FameThemes