Recent Posts
Mountain Bike Categorization Analysis
Introduction Overview The Data EDA Label (Mountainbike Category) Categorical Variables Continuous Variables ~Normally Distributed Variables: Skewed Variables: Multi-Modal Distributed Variables: Average bikes by flip-chip setting Methodology Variation Amongst Featureset 1. Correlation 2. Principal Component Analysis (PCA) Clustering K-Means Gaussian Mixture Model (GMM) GMM - 3 Clusters GMM - 6 Clusters Multi-class SVM Conclusions Findings Opportunities for Improved Analysis Introduction Overview For this post, I worked with Mike Czerwinski to determine whether the specifications of mountain bikes (MTB) are enough to differentiate between the different types of mountain bike categories.
read more
The Federalist Papers | NLP Analysis (Part 1)
When Alexander Hamilton, John Jay, and James Madison came together in support of the ratification of the Constitution, they created what has become one of the most celebrated series of political texts in history. The Federalist Papers, a series of 85 essays, helped push New Yorkers towards ratification and laid some of the strongest arguments in favor of a strong Federal Government.
In part one of our analysis, we perform an Exploratory Data Analysis (EDA) using modern Natural Language Processing (NLP) techniques to to better understand these essays.
read more
Polling Places | Exploratory Data Analysis
In a democracy, the polling booth is much more than a location where an unwitting citizen fills in a bubble on a piece of paper in a make-shift booth. It is a symbol of the promise of democracy, a connector between the citizen and the government. It is the shrine that, if protected and respected, powers a democracy.
This article analyzes that symbol of American democracy by taking an analytical look at a dataset of U.
read more