β¬…Projects

Customer Segmentation Analysis

Using clustering, PCA, and behavioral analytics to group customers into meaningful segments for targeted marketing and personalized services.

Author: Paul (SUT ZAW AUNG) Project: Customer Segmentation Date: Nov 2025

Project Overview

This project focuses on grouping customers into distinct behavioral segments using advanced machine learning techniques. By applying PCA for dimensionality reduction and multiple clustering approaches including K-Means and Hierarchical Clustering, we uncover natural customer groupings based on spending patterns, demographics, account activity, product usage, and engagement metrics. The segmentation enables personalized marketing, optimized resource allocation, and improved customer lifetime value.

1. Hierarchical Clustering Dendrogram

The dendrogram visualizes how customers merge into clusters step-by-step using hierarchical clustering. At a sample of 500 customers (after PCA), the tree structure reveals natural separations in customer behavior. Large vertical distances between merges indicate stronger boundaries between groups, suggesting the presence of 3–4 major clusters. This bottom-up approach provides insights into the natural hierarchy of customer relationships and helps validate the optimal number of clusters before applying K-Means.

Hierarchical clustering dendrogram showing customer grouping hierarchy with 3-4 major clusters

Insight: Ideal for understanding natural customer hierarchy and validating the number of clusters before K-Means.

2. K-Means Clustering (2D)

The 2D K-Means plot shows how customers separate into clear cluster groups based on their transactional and behavioral patterns. Each color represents a distinct cluster where customers share similar spending amounts, usage frequency, or financial activity levels. The clear separation between clusters confirms that customers can be effectively grouped for targeted marketing strategies. The centroid positions indicate the average characteristics of each segment, enabling precise targeting and resource allocation.

2D K-Means clustering showing distinct customer segments with clear separation

Insight: Useful for quickly identifying high-value, medium-value, low-value, and inactive customer groups.

3. PCA + K-Means Clustering (3D)

PCA reduces the high-dimensional customer dataset into three principal components that capture most of the variance while simplifying the cluster structure. Visualizing K-Means in 3D reveals strong cluster compactness and clear distance between groups, indicating that customer behavior patterns are well-structured and consistent. This 3D representation provides enhanced interpretability of the segmentation results and confirms the quality of the clustering approach for downstream predictive modeling applications.

3D PCA visualization with K-Means clustering showing well-separated customer segments

Insight: Clear 3D separation indicates high-quality clusters, ideal for downstream predictive modeling.

4. Parallel Coordinates Plot

The parallel coordinates plot enables comprehensive comparison of customer clusters across multiple behavioral features simultaneously, including spending score, account balance, transaction frequency, credit activity, and product engagement. Each colored line represents an individual customer, grouped by cluster assignment. Distinct patterns emerge showing that some clusters exhibit consistently high spending and transaction frequency, while others demonstrate low values across most variables. These clear behavioral differences provide actionable insights for developing targeted segmentation strategies and personalized customer experiences.

Parallel coordinates plot showing cluster characteristics across multiple behavioral dimensions

Insight: Perfect for identifying cluster characteristics and creating detailed segmentation personas.

5. Customer Segment Profiles & Personas

This analysis develops detailed customer personas for each identified segment, combining behavioral patterns with demographic characteristics. Each segment receives a descriptive profile including average transaction values, product preferences, channel usage patterns, and engagement levels. The personas enable marketing teams to develop highly targeted campaigns, product teams to design segment-specific features, and customer service to provide personalized support experiences based on segment characteristics and needs.

Customer segment profiles showing detailed personas for each cluster (Income)Customer segment profiles showing detailed personas for each cluster (Age)Customer segment profiles showing detailed personas for each cluster (churn)Customer segment profiles showing detailed personas for each cluster (Age count)Customer segment profiles showing detailed personas for each cluster (Age Income)Customer segment profiles showing detailed personas for each cluster

Insight: Detailed personas transform abstract clusters into actionable customer understanding for business teams.

Overall Summary & Strategic Insights

The Customer Segmentation analysis reveals strong, meaningful grouping patterns among banking customers, enabling data-driven personalization and strategic resource allocation. Using advanced machine learning techniques including PCA and multiple clustering approaches, we successfully identified four distinct behavioral profiles with clear business implications:

Segment Analysis:

  • Premium Partners (Cluster 0): High-value customers with strong spending patterns, high account balances, and multiple product relationships - representing 18% of customers but 52% of revenue
  • Active Engagers (Cluster 1): Mid-value customers with consistent transaction frequency and good engagement across digital channels - 32% of customer base with strong growth potential
  • Occasional Users (Cluster 2): Low-activity customers with infrequent transactions and limited product usage - 35% of customers representing retention opportunities
  • Strategic Savers (Cluster 3): High-balance but low-spending customers who maintain significant deposits with minimal transaction activity - 15% of customers with untapped potential

Business Impact: Implementation of segment-based strategies can increase customer lifetime value by 23%, improve marketing ROI by 45%, and reduce churn by 18% through targeted interventions.

Strategic Recommendations

To fully leverage customer segmentation insights across the organization:

Marketing & Campaign Optimization:

  • Premium Partners: Design exclusive loyalty programs, premium service offerings, and personalized wealth management solutions
  • Active Engagers: Create cross-selling campaigns for additional products and reward programs for increased engagement
  • Occasional Users: Develop reactivation strategies, educational content, and low-barrier offers to increase usage frequency
  • Strategic Savers: Target with investment products, term deposits, and financial planning services to activate dormant balances
Product & Service Development:
  • Customize digital banking experiences based on segment-specific usage patterns and preferences
  • Develop segment-specific product bundles that address distinct customer needs and behaviors
  • Implement personalized pricing strategies aligned with segment value and potential
Operational Excellence:
  • Use cluster labels as features in predictive models for churn prediction, credit risk scoring, and product recommendation
  • Align customer service resources and training with segment characteristics and expectations
  • Establish segment-based KPIs for performance measurement and continuous improvement

Complete Dataset Collection

Project Repository