Back to Home

Beer Case Study

Overview

In this data analysis project, I worked with a partner to analyze beer market data to provide strategic insights for Budweiser. The goal was to understand brewing patterns, consumer preferences, and identify market opportunities through detailed examination of ABV (Alcohol By Volume) and IBU (International Bitterness Units) metrics across different states and beer styles.

Challenge

Budweiser wanted to gain competitive insights about the craft beer market to identify:

Approach

Our analysis included:

  1. Data Cleaning and Preparation: We addressed missing values in the dataset, particularly in the ABV and IBU columns. We implemented two different imputation methods:
    • Median imputation for IBU values where ABV was present
    • Linear regression-based imputation by leveraging the correlation between ABV and IBU
  2. Geographic Analysis: We mapped the brewery distribution across states and calculated median ABV and IBU values for each state to identify regional preferences. Colorado stood out with 47 breweries, while some states like DC and West Virginia had only one.

  3. Statistical Analysis: We examined the relationship between alcohol content and bitterness through correlation tests and visualization techniques including scatter plots, histograms, and density heat maps.

  4. Machine Learning: We employed K-Nearest Neighbors (KNN) classification to distinguish between IPAs and other Ales based on their ABV and IBU characteristics, running 1,000 iterations with different random seeds to ensure robust results.

Key Findings

Regional Patterns

Notable Statistics

Market Insights

Through density analysis, we identified two underserved market segments:

  1. Beers with approximately 5% ABV and IBU under 25
  2. Beers with ABV around 7.5% and IBU slightly under 75

Our machine learning model achieved 81% accuracy in distinguishing IPAs from other Ales using only ABV and IBU values, demonstrating that these two metrics are strong predictors of beer style. The model showed 73% sensitivity (correctly identifying IPAs) and 86.5% specificity (correctly identifying other ales).

Impact

This analysis provided Budweiser with actionable insights to:

Visualization Techniques

Throughout this project, we employed various visualization methods to extract insights:

Tools Used