Skip to content

A stacking ensemble model is built for the purpose of detecting fake reviews. The ensemble is a comination of LogisticRegression and MultinomialNB

Notifications You must be signed in to change notification settings

sudhee302/Fake-Review-Detection-using-Stacking-Ensemble

Repository files navigation

🧠 Fake Review Detection using Ensemble Learning

Python scikit-learn Streamlit


📌 Overview

A machine learning project designed to detect fake product reviews using an ensemble of Logistic Regression and Naïve Bayes models.
Built to improve online review reliability by identifying automatically generated or spammed reviews with high accuracy.


🚀 Features

  • 🧩 Trains and evaluates multiple classification models using scikit-learn
  • 🧠 Implements a stacked ensemble combining Logistic Regression & Naïve Bayes
  • 💬 Utilizes TF-IDF for text vectorization and feature extraction
  • 🌐 Interactive Streamlit interface for real-time single or bulk review analysis
  • 📦 Deployable using ngrok for remote demo access

🗂️ Dataset

  • Size: 40,000 reviews (20K genuine, 20K AI-generated)
  • Type: Text-based, CSV format
  • Processing: Cleaned and preprocessed using Pandas and scikit-learn utilities

🛠️ Tech Stack

Pythonscikit-learnPandasNumPyStreamlitTF-IDF


📊 Results

  • ✅ Achieved 92.16% accuracy on test data using the stacked ensemble
  • 🔍 Ensemble model outperformed individual base learners in precision and recall
  • 🧮 Reduced misclassification through text normalization and feature weighting

🧩 How to Run Locally

  1. Clone this repository
    git clone https://github.com/sudhee302/Fake-Review-Detection-using-Stacking-Ensemble
    cd Fake-Review-Detection-using-Stacking-Ensemble
    

About

A stacking ensemble model is built for the purpose of detecting fake reviews. The ensemble is a comination of LogisticRegression and MultinomialNB

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published