Search this site
Embedded Files
Skip to main content
Skip to navigation
GROUP 3
Home
Dataset
EDA
Preprocessing
Clustering
Classification
Testing
Conclusion
GROUP 3
Home
Dataset
EDA
Preprocessing
Clustering
Classification
Testing
Conclusion
More
Home
Dataset
EDA
Preprocessing
Clustering
Classification
Testing
Conclusion
Conclusion
Summary of Findings
Dataset: 768 female patients, 8 health features
Key predictors: Glucose, Insulin, BMI
Random Forest
selected as best model:
Accuracy: 88.67%
Recall: 0.922 (best for catching true diabetic cases)
Strong F1 score and AUC
Clinical Relevance of the Model
Identifies
critical health markers
for early intervention
Can assist doctors in
risk screening
and
prioritization
Deployable in
health apps
or
clinic decision systems
Limitations & Future Improvements
Limitations:
Small dataset
which only includes female patients
Missing
behavioral/lifestyle
data
Needs validation on diverse populations
Further enhancement :
Add
interpretable ML
(e.g., SHAP, LIME)
Explore
deep learning
for richer feature patterns
OUR PROJECT MATERIALS:
CLASSIFICATION NOTEBOOK:
[https://colab.research.google.com/drive/1XHUWHskALSlictuN7_NW50nronkajl27?usp=sharing]
CLUSTERING NOTEBOOK:
[https://colab.research.google.com/drive/1mK8jDv7Bllib64lZBgYuj-jF4icvISwH?usp=sharing]
USER TESTING WEBSITE:
[https://diabetes-prediction-exkn.onrender.com/#]
Google Sites
Report abuse
Page details
Page updated
Google Sites
Report abuse