Heart Failure Risk Prediction Model

Heart Failure Risk Prediction Model

Academic • Machine Learning/Healthcare

About The Project

Built a full ML workflow on the Heart Failure Prediction dataset (Kaggle): data cleaning (invalid RestingBP/Cholesterol removal), feature scaling, label/one-hot encoding, and PCA for dimensionality reduction/visualization. Trained and compared Random Forest, KNN, and a PyTorch NN with stratified K-fold CV and early stopping; emphasized reproducible evaluation and feature importance.


Achievements

  • Random Forest reached 89.33% accuracy; mean ROC-AUC ≈ 0.929 (stratified CV)

  • Implemented PCA to analyze class separability and guide model selection

  • Built a clean, modular pipeline for training, tuning, and validation across models

Links

sunny.sunho.park@gmail.com

unsplash.com/@reddfrancisco
unsplash.com/@reddfrancisco