Hayward Data Science Projects

Machine Learning: Supervised & Unsupervised, Principal Component Analysis Segmentation, and SQL

1

Probability distribution fitting, XGBoost and Backwards Elimination Regression for feature selection; Multiple and Simple Linear Regression for modeling; Principal Component Analysis (PCA) for dimensionality reduction, then K-Means for clustering; 3D matplotlib plotting; T-SQL for business analysis.

Machine Learning: Time Series Forecasts & Classify Labor Force

2

For forecasting supply chain production levels: ARIMA model and Facebook Prophet. For 1975 labor force participation: logistic regression, XGBoost, and Keras Neural Network classification with Google's TensorFlow.

A/B Testing, Causal Inference, & Unsupervised Machine Learning: K-Means Clustering

3

Statistical Analysis of A/B Test, Causal Inference, and Web Recommendation. Consumer engagement segmentation via K-Means Clustering.

Machine Learning: Predict Insurance Losses and Loss Amount & Classify News Headlines

4

For predicting insurance loss claim amount: XGBoost, support vector (polynomial), standard and Lasso regression. For predicting insurance loss: logistic regression and XGBoost. For news headlines: Multinominal Naive Bayes.

What Makes a Playlist Great? Variety and Intimacy

5

Data science presentation on what makes a playlist great/successful. Includes: user research survey, exploratory data analysis, and standard linear regression.

Delivery Companies & Tipping: How Can We Increase the Tip Amount?

6

Analyzed tipping rates and other metrics for delivery company. Found that one of the wealthiest towns in America had the lowest tipping rate. Recommended A/B test to see if displaying a short story on the delivery person at checkout might lead to more tips.

Machine Learning: Supervised & Unsupervised, Principal Component Analysis Segmentation, and SQL

Machine Learning: Time Series Forecasts & Classify Labor Force

A/B Testing, Causal Inference, & Unsupervised Machine Learning: K-Means Clustering

Machine Learning: Predict Insurance Losses and Loss Amount & Classify News Headlines

What Makes a Playlist Great? Variety and Intimacy

Delivery Companies & Tipping: How Can We Increase the Tip Amount?

What Kinds of News Content Are Consumed Most on Facebook?Meaningful Video

Databases & Scripting: SQL in Python + Matplotlib

Data & Politics: Statistical Inferences from Polling Data

Hayward Weightloss