Designed Analytics Blog

November 14, 2025

Kumar Singh

ai, artificial-intelligence, chatgpt, machine-learning, technology

Fraud Identification Leveraging WGAN

This is an interesting paper https://lnkd.in/gf8Q3bsV but there are certain concerns that I have.

The results here are obviously dataset-specific. The paper uses bank-transaction data (and later a credit-card fraud dataset), but there needs to be more clarity on how representative the dataset was, and other aspects like whether there is cross-validation, etc.

The other models (WGAN+XGBoost, XGBoost, Logistic Regression) serve as baseline comparators, but it’s not entirely clear what hyper-parameter tuning, feature engineering, or other preprocessing they had compared to the proposed model.

Still, this is an approach worth looking into since the authors aim to build an architecture combining privacy-preserving techniques with strong fraud-detection performance. What they are trying to achieve is:

1. A data protection framework using Federated Learning (FL) + Differential Privacy (DP) to enable collaborative model training without direct data sharing.

2. A fraud‐recognition model built using a variant of Generative Adversarial Networks (specifically a Wasserstein GAN, WGAN) to generate synthetic samples and handle class imbalance.

Three key methodologies have been leveraged here:

1. Adaptive FL-DP architecture

Clients (e.g., banks) generally hold local data; a central server orchestrates model updates without sharing raw data. The authors introduce adaptive gradient pruning and noise differentiation strategies to improve convergence under privacy constraints (i.e., fewer data leakage risks).

Privacy leakage metrics analyzed include mutual information and re-identification risk.

2. Fraud recognition model with WGAN for imbalance

Fraudulent transactions being rare create class-imbalance problems; WGAN is used to generate synthetic “fraud” examples to augment detection training.
They also integrate with a classification/regression model (e.g., XGBoost) for the downstream fraud detection task.

3. Empirical evaluation

Tests on standard datasets (e.g., NSL-KDD, UNSW-NB15, ToN-IoT) for convergence and accuracy of FL-DP architecture. Realistic bank transaction dataset scenario: measuring accuracy, false alarm/miss rates, robustness under attacks, and privacy leakage.

Overall, you can think of a plethora of ways you can leverage these three methodologies to build solutions in the banking sector.

https://lnkd.in/gf8Q3bsV

Posted by:

Kumar Singh

Kumar Singh is the founder of Designed Analytics LLC, which is focused on helping organizations explore how to leverage data and analytics to effectively compete, thrive and innovate. Kumar has over a decade of hands-on experience in supply chain and operations analytics. With over a decade of industry experience, he has worked across multiple industries, helping companies set up analytics centers of excellence. Post his industry experience, Kumar also did a stint in external consulting as a data science consultant with Boston Consulting Group (BCG). Kumar holds an MSc. in AI from Liverpool John Moores University, a Masters in Supply Chain from The Ross School of Business at The University of Michigan, Ann Arbor, an MBA in Operations Management from IIT Roorkee, India, and an undergraduate degree in Electrical engineering. Kumar is ASCM CPIM, CSCP, and CLTD certified and holds the PMP certification from PMI. He is also an AWS-certified Machine Learning Specialist and Microsoft certified in the Azure IoT developer specialty.

Fraud Identification Leveraging WGAN

Share this:

Leave a comment Cancel reply