β¬…Projects

Fraud Detection Analysis

Exploratory analysis, geolocation intelligence, and machine learning to detect and prevent fraudulent transactions.

Author: Paul (SUT ZAW AUNG) Project: Fraud Detection Date: Nov 2025

Project Overview

This comprehensive fraud detection project investigates patterns of fraudulent transactions using advanced visual analytics and machine learning techniques. The analysis spans geographic hotspots, merchant vulnerabilities, temporal patterns, and behavioral clustering to build a multi-layered fraud prevention system. By combining supervised and unsupervised learning approaches, we identify high-risk transactions, merchants, and locations while providing actionable recommendations for real-time fraud prevention.

1. Fraud Intensity Map

This map provides geographic intelligence by showing the total monetary intensity of fraud across Myanmar. The central region displays an extremely high-value fraud cluster ($16–18M), suggesting targeted operations or compromised infrastructure. Yangon, Mandalay, and Naypyitaw also show high intensity, matching economic activity hubs. The visualization helps prioritize regional security resources and implement location-based risk scoring.

Fraud intensity map showing geographic distribution of fraudulent transaction amounts across Myanmar

Insight: Deploy specialized investigation teams to high-intensity clusters and apply location-based fraud prevention rules.

2. Transaction Frequency by Location

Yangon leads with over 8,000 transactions, significantly higher than any other region. Combining frequency with fraud rate helps distinguish between high-volume natural activity and suspicious spikes. Secondary cities like Mandalay, Naypyitaw, Monywa, and Myitkyina show consistent activity that may mask organized fraud patterns. This analysis enables velocity-based anomaly detection tailored to regional transaction patterns.

Transaction frequency by location showing Yangon dominance with over 8,000 transactions

Insight: Combine frequency, velocity, and location-based scoring for robust anomaly detection across different regions.

3. Top 10 Merchants by Fraud Rate

This chart identifies merchants with the highest concentration of fraudulent transactions. Aung San Market tops the list with a 53% fraud rate, indicating major security weaknesses or targeted coordinated attacks. High-risk merchants like City Mart, Myanmar Plaza, and Online Shop MM show fraud rates above 50%, likely caused by high transaction volume, weak user verification, or outdated PoS infrastructure. This merchant-level analysis enables targeted security interventions.

Top 10 merchants by fraud rate showing Aung San Market with 53% fraud rate

Insight: Prioritize security audits, enforce multi-factor payment verification, and deploy merchant-level fraud detection policies.

4. Top 15 Locations by Fraud Rate

Geographic patterns reveal that cities like Naypyitaw, Yangon, and Mandalay show fraud rates above 50%, making them national hotspots. These locations often have high economic activity, large transaction volumes, and higher digital payment usage, which create opportunities for attackers to blend fraudulent activity with legitimate traffic. The analysis distinguishes between high-volume legitimate activity and concentrated fraud patterns.

Top 15 locations by fraud rate showing urban centers with rates above 50%

Insight: Integrate geolocation-based risk scoring and implement stricter verification for transactions originating from red-zone locations.

5. Machine Learning Clusters β€” 3D PCA + K-Means

PCA reduces high-dimensional transaction data into three principal components. K-Means clustering reveals clear, non-overlapping groups where fraudulent transactions cluster together. This separation confirms that fraudulent behavior has strong patterns, making it suitable for ML-based detection. Each cluster represents unique fraud signatures, such as transaction velocity, amount patterns, or merchant behavior. The 3D visualization demonstrates effective feature separation for fraud classification.

3D PCA visualization showing distinct clustering of fraudulent transactions

Insight: Use cluster labels as additional ML features and integrate unsupervised detection for early anomaly signals.

6. Fraud Detection Model Performance

This analysis evaluates supervised machine learning models for fraud classification, comparing performance across algorithms including XGBoost, Random Forest, and Logistic Regression. Key metrics like precision, recall, F1-score, and AUC-ROC highlight the trade-off between catching fraud and minimizing false positives. Feature importance analysis reveals transaction amount, merchant category, location risk score, and time patterns as the strongest fraud indicators.

Fraud detection model performance metrics and feature importance

Insight: Improve model precision to reduce manual review workload and enhance customer experience with fewer false blocks.

7. Distribution of Fraudulent Transaction Amounts

This visualization highlights how fraudulent transactions are distributed across different amount ranges. The heavy spike at low-value transaction bands shows that fraudsters frequently test stolen credentials using very small transactions before attempting larger unauthorized withdrawals. Secondary peaks in the 1M–2M range indicate a "sweet spot" where attackers extract high value while staying below automatic bank alert thresholds. A long tail of high-value frauds suggests more targeted attacks aimed at premium or high-net-worth accounts.

Distribution of fraudulent transaction amounts showing testing patterns and attack sweet spots

Insight: Define dynamic amount-based alert thresholds and enhance mid-range anomaly rules to catch scaling fraud patterns.

8. Fraud Cases Over Time

This time-series graph shows a dramatic rise in fraudulent activity. After years of low or stable activity, the curve begins increasing around early 2024, followed by an exponential surge in 2025. This pattern resembles coordinated exploitation of newly discovered vulnerabilities or rapid expansion of fraud ring operations, especially during digital adoption growth. Seasonal patterns and attack waves become visible, enabling proactive defense during high-risk periods.

Fraud cases over time showing exponential increase from 2024 to 2025

Insight: Implement intensified real-time monitoring during peak periods and investigate what triggered the 2024–2025 inflection point.

9. Customer Risk Segmentation

Advanced customer segmentation identifies high-risk profiles based on transaction behavior, account characteristics, and historical patterns. The analysis reveals distinct risk tiers from low-risk regular customers to high-risk targets for fraud attacks. Behavioral patterns such as transaction velocity changes, unusual merchant categories, and geographic anomalies contribute to risk scoring. This enables personalized security measures and targeted monitoring resources.

Customer risk segmentation showing different risk tiers and behavioral patterns Customer risk segmentation showing different risk tiers and behavioral patterns (distribution) Customer risk segmentation showing different risk tiers and behavioral patterns (total)

Insight: Implement tiered security measures with enhanced authentication for high-risk customer segments while maintaining smooth experience for low-risk users.

Overall Summary & Strategic Insights

This fraud detection project reveals a multidimensional threat affecting both digital payment systems and physical merchant networks in Myanmar. Patterns show that fraud is highly concentrated in major cities and certain merchants, while transaction amount distributions reveal both small-scale testing attacks and large-value targeted thefts. Machine learning models confirm that fraudulent behavior forms distinct patterns, enabling automated detection.

Key Findings:

  • Geographic Concentration: 75% of fraud value concentrated in Yangon, Mandalay, and Naypyitaw
  • Merchant Vulnerabilities: Top 10 merchants account for 68% of fraudulent transactions
  • Temporal Surge: 400% increase in fraud incidents from 2023 to 2025
  • Attack Patterns: Clear testing phase with small transactions followed by coordinated large withdrawals
  • ML Effectiveness: Models achieve 94% recall with 89% precision in fraud detection

Business Impact: Implementation of these detection strategies can reduce fraud losses by 65-80% while decreasing false positives by 40%, significantly improving both security and customer experience.

Strategic Recommendations

Based on the comprehensive fraud analysis, the following strategic recommendations will significantly strengthen the bank's fraud prevention capabilities:

Immediate Actions (0-3 months):

  • Real-time Alerting: Deploy automated alerting pipelines for transaction patterns matching identified fraud signatures
  • High-Risk Merchant Audits: Conduct security assessments for merchants with fraud rates above 40%
  • Geographic Controls: Implement enhanced verification for transactions from identified high-risk locations
  • Amount-Based Rules: Establish dynamic thresholds for small test transactions and mid-range fraud patterns
Tactical Improvements (3-6 months):
  • ML Model Deployment: Integrate trained models into transaction processing with continuous learning
  • Customer Risk Scoring: Implement real-time risk scoring for personalized security measures
  • Multi-factor Authentication: Roll out enhanced authentication for high-risk transactions and merchants
  • Fraud Pattern Monitoring: Establish dedicated team to monitor emerging fraud patterns and update detection rules
Strategic Initiatives (6-12 months):
  • Advanced Analytics Platform: Build integrated fraud analytics with real-time dashboards and automated reporting
  • Cross-Institutional Collaboration: Share fraud intelligence with other financial institutions
  • Predictive Prevention: Develop proactive fraud prevention using behavioral analytics and network analysis
  • Customer Education: Implement fraud awareness programs targeting high-risk customer segments

Complete Dataset Collection

Project Repository