MY PROJECTS
Early Sepsis Prediction in ICU
Machine Learning & Healthcare Analytics
Built predictive models using ICU patient data from the PhysioNet 2019 Challenge to detect sepsis 6 hours before onset. Applied 13 classifiers with SMOTE and undersampling to address class imbalance, achieving the best performance with XGBoost (Recall = 0.84, Accuracy = 0.86).
Skills and Tools
Python, scikit-learn, XGBoost, SMOTE, Random Undersampling, Cross-Validation, SHAP, Data Analysis, Model Evaluation, Healthcare Analytics
Investment Portfolio Optimization
Predictive & Prescriptive Analytics
Forecasted stock prices and optimized portfolio allocation across Amazon, AMD, Cisco, Netflix, and Apple to achieve a target return of 5% while minimizing risk. Applied descriptive analytics for historical trend analysis, predictive analytics using the AutoReg model, and prescriptive analytics for portfolio weight optimization.
Skills and Tools
Python, NumPy, Pandas, Scikit-learn, AutoReg (Time Series Forecasting), Portfolio Optimization, Risk Assessment, Data Visualization (Matplotlib, Seaborn), Financial Analytics
FoodHub Order Analysis using Python
Python
The food aggregator company has stored the data of the different orders made by the registered customers in their online portal. They want to analyze the data to draw some actionable insights for the business.
Skills and Tools
Exploratory Data Analysis (Variable Identification, Univariate analysis, Bi-Variate analysis), Python.
E-news Express Project
Business Statistics
This project used statistical analysis, a/b testing, and visualization to decide whether the new landing page of an online news portal (E-news Express) is effective enough to gather new subscribers or not.
Skills and Tools
Hypothesis Testing, a/b testing, Data Visualization, Statistical Inference.
ReCell
Supervised Learning
Analyze the used devices dataset, build a model which will help develop a dynamic pricing strategy for used and refurbished devices, and identify factors that significantly influence the price.
Skills and Tools
EDA, Linear Regression, Linear Regression Assumptions, Business insights and Recommendations.
INN Hotels
Supervised Learning - Classification
Analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build a predictive model that can predict which booking is going to be canceled in advance.
Skills and Tools
EDA, Data Pre-processing, Logistic regression, Multicollinearity, Finding optimal threshold using AUC-ROC curve, Decision trees.
Easy Visa
Ensemble Techniques
Analyze the data of Visa applicants, build a predictive model to facilitate the process of visa approvals, and recommend a suitable profile for the applicants for whom the visa should be certified or denied.
Skills and Tools
EDA, Data Preprocessing, Customer Profiling, Bagging Classifiers, Boosting Classifier, Stacking Classifier, Hyperparameter Tuning.
ReneWind
Model Tuning
"ReneWind" is a company working on improving the machinery/processes involved in the production of wind energy using machine learning and has collected data of generator failure of wind turbines using sensors.
Skills and Tools
Up and downsampling, Regularization, Hyperparameter tuning.
Travel Booking System
Database Management System
Designed and implemented a relational database for a simulated travel booking platform where users can book flights, hotels, rental cars, and services. Created a normalized schema and ER diagram to ensure data integrity and support reporting and analytics.
Skills and Tools
MySQL, SQL, Relational Schema Design, ER Diagram, Data Normalization, Draw.io / Visio
CIA World Factbook Analysis
SQL & Data Analysis
Analyzed the CIA World Factbook database to extract insights about countries, populations, and global demographics using SQL queries and data visualization techniques.
Skills and Tools
SQL, Data Analysis, Data Visualization, Statistical Analysis, Demographic Research