Skip to content

Deepfake Audio Detection using MFCCs and Machine Learning Detects AI-generated speech using MFCC and spectral features from the Fake-or-Real dataset. Trained multiple ML models (SVM, Random Forest, MLP, XGBoost) with PCA and feature normalization. Evaluation includes confusion matrix and ROC curve.

Notifications You must be signed in to change notification settings

anvay936/deepfake-audio-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deepfake Audio Detection via MFCC Features and VGG16 + LSTM

This project implements the research paper:
"Deepfake Audio Detection via MFCC Features Using Machine Learning"


📌 Summary

We detect audio deepfakes using classical ML models and a hybrid deep learning model. The project covers:

  • 🧹 Data Cleaning & Preprocessing
  • 🎵 MFCC and Spectral Feature Extraction
  • ⚙️ Dimensionality Reduction via PCA
  • 🧠 ML Models: SVM, RF, MLP, Gradient Boosting
  • 🤖 Deep Learning: VGG16 + LSTM Fusion
  • 🔍 Hyperparameter tuning with RandomizedSearchCV

📂 Dataset Used

Fake-or-Real (FoR):
Includes 4 subsets:

  • for-original
  • for-2sec
  • for-norm
  • for-rerec

📈 Results & Observations

  • PCA helped reduce training time without losing much accuracy.
  • VGG16+LSTM gave best performance for robust detection.
  • Evaluation used accuracy, confusion matrix, ROC-AUC.

🔗 Notebook

Full code in this notebook: deepfake_audio_detection.ipynb

Also on Kaggle


🚀 Tech Stack

  • Python, NumPy, Pandas
  • Librosa, Matplotlib
  • Scikit-learn
  • TensorFlow / Keras

📦 How to Run

git clone https://github.com/<your-username>/deepfake-audio-detection.git
cd deepfake-audio-detection
jupyter notebook deepfake_audio_detection.ipynb

📊 Results (MLP Classifier on FoR-Rerecorded Subset)

Confusion Matrix

Confusion Matrix

ROC Curve

ROC Curve

About

Deepfake Audio Detection using MFCCs and Machine Learning Detects AI-generated speech using MFCC and spectral features from the Fake-or-Real dataset. Trained multiple ML models (SVM, Random Forest, MLP, XGBoost) with PCA and feature normalization. Evaluation includes confusion matrix and ROC curve.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published