Skip to content

This repository contains a machine learning-based system for detecting fraudulent credit card transactions using Isolation Forest and One-Class SVM algorithms. The project includes data preprocessing, exploratory data analysis (EDA), model training, evaluation, and real-time fraud prediction via a Streamlit web interface.

Notifications You must be signed in to change notification settings

prathamesh693/01_Credit-Card-Fraud-Detection-Using-Anomaly-Detection

Repository files navigation

💳 Credit Card Fraud Detection Using Anomaly Detection

⚠️ Identify fraudulent transactions in real-time using ML

This project aims to detect fraudulent transactions in credit card usage using machine learning techniques. Fraud detection is a classic example of anomaly detection and is crucial for minimizing financial losses and ensuring the security of financial systems.


📚 Table of Contents


📌 Problem Statement

Credit card fraud is a major concern in the financial industry, with billions of dollars lost annually. The key challenge in fraud detection is to identify fraudulent transactions from highly imbalanced datasets where fraud represents a tiny fraction of all records.

The goal is to build an anomaly detection system using unsupervised learning techniques that can accurately identify fraudulent transactions while minimizing false positives, suitable for real-time deployment.


🎯 Objectives

  • Analyze characteristics of fraudulent vs. legitimate transactions
  • Build models using:
    • Isolation Forest
    • One-Class SVM
  • Evaluate with precision, recall, F1-score, AUC-ROC
  • Deploy a real-time detection interface using Streamlit

⚠️ Challenges

  • Extreme class imbalance
  • Anonymized dataset features (less interpretability)
  • Need for real-time inference
  • Managing false positives vs. detection rate trade-off

🛠️ Project Lifecycle

  1. Problem Definition
    • Define use case and success criteria
  2. Data Acquisition & Understanding
    • Use public Kaggle dataset on credit card transactions
  3. Exploratory Data Analysis (EDA)
    • Analyze transaction patterns, detect outliers
  4. Modeling
    • Apply Isolation Forest and One-Class SVM
  5. Evaluation
    • Use precision, recall, F1, ROC-AUC for comparison
  6. Deployment
    • Deploy best model using a Streamlit web app
  7. Monitoring
    • Prepare retraining and drift detection pipeline

💻 Tools and Technologies


✔️ Success Criteria

  • F1-score > 0.85 on test data
  • Real-time prediction latency < 1 second
  • Streamlit interface for live testing
  • Monitoring and retraining ready for production scaling

🔗 References

The dataset is available for download from Kaggle's Credit Card Fraud Detection Dataset.


🤝 Connect With Me

LinkedIn GitHub

About

This repository contains a machine learning-based system for detecting fraudulent credit card transactions using Isolation Forest and One-Class SVM algorithms. The project includes data preprocessing, exploratory data analysis (EDA), model training, evaluation, and real-time fraud prediction via a Streamlit web interface.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published