GitHub - mohan696matlab/Unsupervised-_Fault_Detection-Industrial_Process: This repository contains code for analyzing the TEP dataset, which is a public dataset for evaluating fault detection and diagnosis algorithms in industrial systems. The dataset includes measurements from a simulated production line, and faults are introduced at specific times during the production process.

Introduction

Welcome to our tutorial series on unsupervised machine learning techniques for fault detection using the Tennessee Eastman Process dataset. In this playlist, we will cover various unsupervised machine learning techniques for detecting anomalies and faults in the process. We will start with statistical methods and progress to more complex techniques such as PCA reconstruction, autoencoder-based reconstruction, ensemble learning, CNN, LSTM-based next-step prediction, Transformer-based anomaly detection, and XAI methods on autoencoders. By the end of this tutorial series, you will have a comprehensive understanding of how unsupervised machine learning can be used to detect faults in complex industrial processes. This repository is a part of youtube playlist.

Requirements

Python 3
pyreadr
matplotlib
seaborn
numpy
pandas
scikit-learn
tensorflow
keras

Usage/Examples

Video 1: Statistical Method for Anomaly Detection

In this video, we will provide an introduction to statistical methods for anomaly detection. We will discuss the basic concepts of statistical process control, including control charts and process capability analysis. We will also demonstrate how to detect anomalies using statistical methods.

Video 2: PCA Reconstruction-based Anomaly Detection

In this video, we will introduce the concept of Principal Component Analysis (PCA) and demonstrate how it can be used for anomaly detection. We will also discuss the limitations of PCA and how it can be improved by incorporating additional features.

I realized that I made a mistake when demonstrating how to use the Pandas iloc function. Specifically, I used the following code: df.iloc[:3], when I should have used the code df.iloc[:, 3:] instead. This error may have caused unexpected results when working with your DataFrame. To correct this mistake, please replace df.iloc[:3] with df.iloc[:, 3:] in your code. This will select all rows and all columns starting from the 4th column of your DataFrame.

Video 3: Autoencoder-based Reconstruction

In this video, we will introduce autoencoders and demonstrate how they can be used for anomaly detection. A brief introduction of autoencoder will be followed by implementation

Video 4: Ensemble Learning

In this video, we will introduce ensemble learning techniques and demonstrate how simple regression model can be used for anomaly detection. We will also discuss the advantages of ensemble learning over single models.

Video 5: LSTM-based Next Step Prediction

In this video, we will introduce convolutional neural networks (CNN) and long short-term memory (LSTM) models and demonstrate how they can be used for next-step prediction in the Tennessee Eastman Process dataset. We will also discuss how this technique can be used for anomaly detection.

Video 6: K-means distance based anomaly detection

In this video, viewers will learn about K-means distance based anomaly detection and how it can be applied to the Tennessee Eastman Process dataset. The video will explain how K-means distance is used to identify anomalies in data and discuss the advantages of using this approach over traditional machine learning models. The video will also provide a step-by-step demonstration of how to perform K-means distance based anomaly detection on the Tennessee Eastman Process dataset.

Video 7: One Class SVM and Isolation Forest

In this video, we will introduce One Class SVM and Isolation Forest and demonstrate how it can be used for anomalies detection.

Average F1 score (average='macro') obtained from various methods on this dataset with Unsupervised Learning excluding Fault Number 3 and 9

Method	F1 Score	Accuracy
Statistical Method for Anomaly Detection	0.679	0.787
PCA Reconstruction-based Anomaly Detection	0.619	0.723
Autoencoder-based Reconstruction	0.627	0.729
Ensemble Learning	0.690	0.818
CNN, LSTM-based Next Step Prediction	0.579	0.720
K means-based Anomaly Detection	0.612	0.717
One Class SVM	0.635	0.745
Isolation Forest	0.481	0.579

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
images		images
1_TEP_statistical_thresholding.ipynb		1_TEP_statistical_thresholding.ipynb
2_TEP_PCA_control_charts.ipynb		2_TEP_PCA_control_charts.ipynb
3_TEP_Autoencoder.ipynb		3_TEP_Autoencoder.ipynb
4_TEP_Ensemble_reconstruction.ipynb		4_TEP_Ensemble_reconstruction.ipynb
5_TEP_LSTM_CNN.ipynb		5_TEP_LSTM_CNN.ipynb
6_TEP_K_means.ipynb		6_TEP_K_means.ipynb
7_TEP_One_Class_SVM.ipynb		7_TEP_One_Class_SVM.ipynb
8_TEP_Isolation_Forest.ipynb		8_TEP_Isolation_Forest.ipynb
LICENSE		LICENSE
PowerPoint_for_Fault_detection_TEP.pptx		PowerPoint_for_Fault_detection_TEP.pptx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Requirements

Usage/Examples

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

mohan696matlab/Unsupervised-_Fault_Detection-Industrial_Process

Folders and files

Latest commit

History

Repository files navigation

Introduction

Requirements

Usage/Examples

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages