A comprehensive data science project assessing risk factors and predicting bone fractures in women with osteoporosis using various statistical models on the glow_bonemed dataset.
Collaboration: Waleed Amer & Nolan Dulude | Data Science Masters Program
Assess risk factors and predict bone fractures in women with osteoporosis using various statistical models. Compare model performance to identify the best predictive approach for clinical decision-making.
Analysis of the glow_bonemed dataset containing clinical variables including BMI, age, prior fractures, bone medications, and demographic factors to predict first-year fracture risk.
Model | Sensitivity | Specificity | PPV | NPV | AUROC |
---|---|---|---|---|---|
CV Interaction Model | 0.8027 | 0.5120 | 0.8315 | 0.4638 | 0.7463 |
QDA | 0.9867 | 0.6800 | 0.9024 | 0.9444 | 0.8333 |
Random Forest | 1.00 | 0.96 | 0.9868 | 1.00 | 0.9800 |
The Random Forest model achieved the highest AUROC of 0.98, significantly outperforming simpler approaches. The simple logistic regression showed moderate predictive ability (AUROC 0.57), while complex models with interactions and QDA showed progressive improvement.
AUROC and sensitivity are crucial metrics in clinical contexts - AUROC provides balanced performance assessment while high sensitivity minimizes false negatives, ensuring high-risk patients are identified for preventive interventions.
With additional time, the project would benefit from advanced feature engineering, external validation on diverse populations, and exploration of deep learning approaches for enhanced predictive accuracy.
© 2024 Waleed Amer. All rights reserved.