Skip to content
Menu
  • Outlier Detection DataSets (ODDS)
  • About ODDS
  • Search
  • Outlier Detection DataSets (ODDS)
  • About ODDS
ODDS

Pima Indians Diabetes dataset

Dataset Information

The original Pima Indians diabetes dataset from UCI machine learning repository is a binary classification dataset. Several constraints were placed on the selection of instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage. The dataset is utilized as it is from the UCI repository.

Source (citation)

Liu, Fei Tony, Kai Ming Ting, and Zhi-Hua Zhou. “Isolation forest.” 2008 Eighth IEEE International Conference on Data Mining. IEEE, 2008.

K. M. Ting, J. T. S. Chuan, and F. T. Liu. “Mass: A New Ranking Measure for Anomaly Detection.“, IEEE Transactions on Knowledge and Data Engineering, 2009.

F. Keller, E. Muller, K. Bohm.“HiCS: High-contrast subspaces for density-based outlier ranking.” ICDE, 2012.

Downloads

File: pima.mat

Description: X = Multi-dimensional point data, y = labels (1 = outliers, 0 = inliers)

Archives

Categories

  • No categories
  • Outlier Detection DataSets (ODDS)
  • About ODDS

Copyright © 2023 ODDS. All Rights Reserved.

Codilight Theme by FameThemes