Developer - Cici Chang

Movies Recommendation System
Web App

This project implements a movie recommender system using Streamlit and content-based filtering. Users can enter a movie title and receive recommendations for similar movies. It was deployed to Heroku.

Dynamic Vaccine Allocation
for Control of Human-Transmissible Disease

During pandemics, such as COVID-19, supplies of vaccines can be insufficient for meeting all needs. Our study develops a dynamic methodology for vaccine allocation by region, age, and timeframe using a time-sensitive model. Our findings estimate that approximately 1.8 million cases and 9 thousand deaths could have been averted in the U.S. with an improved allocation. While applied to COVID-19, our approach generalizes to other human-transmissible diseases for future epidemics.

Real Time Pose Analysis
and Tracking Application

Developed "Swoleboi," a sophisticated real-time exercise pose analysis and tracking application, utilizing Python and an array of libraries including Tkinter for the GUI, OpenCV for video capture and image processing, and MediaPipe for advanced pose estimation.

Github Repository

Web App for
Predicting Customer Churn using Streamlit

The web app appears in the browser for real-time churn prediction based on adjustable customer inputs. Customer features include tenure, promotions offered, etc.

Github Repository

Detecting Machine Failure
from IoT Sensors with a SQL Pipeline

This project uses an SQL pipeline to analyze real-world IoT sensor data from industrial machines. The goal is to detect early signs of machine failure so maintenance can be scheduled proactively before a critical breakdown occurs.

Github Repository

Martian Frost Detection
Using CNN and Transfer Learning in HiRISE Images

This project aims to develop classifiers to identify frost in Martian terrain images using the HiRISE dataset. It explores the effectiveness of a custom-built CNN+MLP model and compares it with transfer learning models (EfficientNetB0, ResNet50, and VGG16).

Breast Cancer Machine Learning Methods Comparison

This project focuses on comparing different machine learning methodologies, including supervised, semi-supervised, unsupervised, and active learning techniques. The analysis is conducted using Monte-Carlo simulations on two datasets: Breast Cancer Wisconsin Diagnostic and Banknote Authentication.

Advanced Analysis of Anuran Calls: Multi-class and Multi-label Classification
with SVM and K-Means Clustering

This project focuses on the advanced analysis of the Anuran Calls (MFCCs) Dataset, utilizing both classification and clustering techniques. The primary objectives include multi-class and multi-label classification using Support Vector Machines (SVMs) and K-Means clustering. The project explores different SVM approaches, such as Gaussian kernels and L1-penalized SVMs, and addresses class imbalance.

Advanced Analytics with Tree-Based Methods:
A Case Study on APS Failure Data

This project offers a comprehensive look at tree-based machine learning techniques, particularly focusing on the APS Failure dataset from Scania Trucks. A significant highlight of this notebook is the utilization of SMOTE (Synthetic Minority Over-sampling Technique) for addressing class imbalance, a common challenge in machine learning.

Modeling of Energy Output in Combined Cycle Power Plants
Using Linear and KNN Regression

This project presents a detailed analysis of the Combined Cycle Power Plant Data Set, covering the years 2006 to 2011. The goal is to predict the net hourly electrical energy output (EP) using key ambient variables like Temperature (T), Ambient Pressure (AP), Relative Humidity (RH), and Exhaust Vacuum (V). Employing a range of statistical and machine learning techniques, the project explores linear regression, multiple regression, polynomial regression, interaction term analysis, and KNN regression.

KNN-Based Classification of
Spinal Conditions in the Vertebral Column Data Set

This project is centered around the analysis and classification of the Vertebral Column Data Set, originally compiled by Dr. Henrique da Mota. The primary focus is on binary classification of spinal conditions into Normal (NO=0) and Abnormal (AB=1), utilizing biomechanical features from the pelvis and lumbar spine. The project encompasses data pre-processing, exploratory data analysis, and classification employing the K-Nearest Neighbors (KNN) algorithm.

Analysis on HR Employee Data
with Machine Learning Models

The HR employee data contains information like evaluations, promotions, satisfaction, and whether employees have left, which allows analysis of factors related to attrition. By modeling this data using different models, HR can predict if and when employees will leave based on their characteristics and make changes to improve retention.