MY PROJECTS

Early Sepsis Prediction in ICU

Early Sepsis Prediction in ICU

Machine Learning & Healthcare Analytics

Built predictive models using ICU patient data from the PhysioNet 2019 Challenge to detect sepsis 6 hours before onset. Applied 13 classifiers with SMOTE and undersampling to address class imbalance, achieving the best performance with XGBoost (Recall = 0.84, Accuracy = 0.86).

Skills and Tools

Python, scikit-learn, XGBoost, SMOTE, Random Undersampling, Cross-Validation, SHAP, Data Analysis, Model Evaluation, Healthcare Analytics

Code Details
Investment Portfolio Optimization

Investment Portfolio Optimization

Predictive & Prescriptive Analytics

Forecasted stock prices and optimized portfolio allocation across Amazon, AMD, Cisco, Netflix, and Apple to achieve a target return of 5% while minimizing risk. Applied descriptive analytics for historical trend analysis, predictive analytics using the AutoReg model, and prescriptive analytics for portfolio weight optimization.

Skills and Tools

Python, NumPy, Pandas, Scikit-learn, AutoReg (Time Series Forecasting), Portfolio Optimization, Risk Assessment, Data Visualization (Matplotlib, Seaborn), Financial Analytics

Code Details
FoodHub Order Analysis

FoodHub Order Analysis using Python

Python

The food aggregator company has stored the data of the different orders made by the registered customers in their online portal. They want to analyze the data to draw some actionable insights for the business.

Skills and Tools

Exploratory Data Analysis (Variable Identification, Univariate analysis, Bi-Variate analysis), Python.

Code Details
E-news Express Project

E-news Express Project

Business Statistics

This project used statistical analysis, a/b testing, and visualization to decide whether the new landing page of an online news portal (E-news Express) is effective enough to gather new subscribers or not.

Skills and Tools

Hypothesis Testing, a/b testing, Data Visualization, Statistical Inference.

Code Details
ReCell

ReCell

Supervised Learning

Analyze the used devices dataset, build a model which will help develop a dynamic pricing strategy for used and refurbished devices, and identify factors that significantly influence the price.

Skills and Tools

EDA, Linear Regression, Linear Regression Assumptions, Business insights and Recommendations.

Code Details
INN Hotels

INN Hotels

Supervised Learning - Classification

Analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build a predictive model that can predict which booking is going to be canceled in advance.

Skills and Tools

EDA, Data Pre-processing, Logistic regression, Multicollinearity, Finding optimal threshold using AUC-ROC curve, Decision trees.

Code Details
Easy Visa

Easy Visa

Ensemble Techniques

Analyze the data of Visa applicants, build a predictive model to facilitate the process of visa approvals, and recommend a suitable profile for the applicants for whom the visa should be certified or denied.

Skills and Tools

EDA, Data Preprocessing, Customer Profiling, Bagging Classifiers, Boosting Classifier, Stacking Classifier, Hyperparameter Tuning.

Code Details
ReneWind

ReneWind

Model Tuning

"ReneWind" is a company working on improving the machinery/processes involved in the production of wind energy using machine learning and has collected data of generator failure of wind turbines using sensors.

Skills and Tools

Up and downsampling, Regularization, Hyperparameter tuning.

Code Details
Travel Booking System

Travel Booking System

Database Management System

Designed and implemented a relational database for a simulated travel booking platform where users can book flights, hotels, rental cars, and services. Created a normalized schema and ER diagram to ensure data integrity and support reporting and analytics.

Skills and Tools

MySQL, SQL, Relational Schema Design, ER Diagram, Data Normalization, Draw.io / Visio

Code Details
CIA World Factbook Analysis

CIA World Factbook Analysis

SQL & Data Analysis

Analyzed the CIA World Factbook database to extract insights about countries, populations, and global demographics using SQL queries and data visualization techniques.

Skills and Tools

SQL, Data Analysis, Data Visualization, Statistical Analysis, Demographic Research

Code Details