Pregnancies: Number of times pregnant
Glucose: Plasma glucose concentration after 2 hours in an oral glucose tolerance test
BloodPressure: Diastolic blood pressure (mm Hg)
SkinThickness: Triceps skin fold thickness (mm)
Insulin: 2-hour serum insulin (mu U/ml)
BMI: Body mass index (weight in kg/(height in m²))
DiabetesPedigreeFunction: A function that scores the likelihood of diabetes based on family history
Age: Age in years
768 female patient records from this dataset.
65.1% non-diabetic
34.9% diabetic
Missing or Zero Values by Feature
In the Pima Indians Diabetes dataset, certain features contain implausible zero values, which are considered missing or invalid since they are not physiologically possible (e.g., zero insulin levels or skin thickness). Below is a summary of such values: