- This notebook is based on an assignment from the Convolutional Neural Networks course by DeepLearning.AI.
- I completed the implementation as part of my own learning process.
- Modifications, comments, and explanation are my own. For educational purposes only.
- This notebook implements forward propagration, backpropragation of volume convolution: 1 layer (Input -> Conv -> Pooling -> Output)
- Using the dataset ibmdb_reviews which containts 50,000 reviews films, splitted into 25k train and 25k test, labels are 0 or 1 as negative or positive
- Techniques used: Word embedding, bidirectional LSTM, Dropout, L2 Regularization
- Steps:
- Load, process data:
1.1. Use tensorflow_datasets to load the "ibmdb_reviews"
1.2. Split into train, test
1.3. Tokenize, truncating, padding sentences - Build sequential model and train:
2.1 Embedding, Bidirectional(LSTM), Dropout, Dense(ReLU), Output(Sigmoid)
2.2.Loss: binary_crossentropy, optimizer: Adam - Evaluation uses accuracy
- I try to achieve better metrics by using algorithms that search for hyperparameters, such as Bayes Optimization, Genetic Algorithm
- This project is comparison on optimizers (Bayes Optimization and Genetic algorithm) on the Fashion MNIST dataset
- Genetic Algorithm requires a really long time for searching and a strong device to train, recommend for the Bayes
- My team and I used pre-trained ResNet50 to distinguish between real and deepfake images with a pretty number ~ 81% on test data
- Architecture: Based Model (ResNet50) -> Pooling -> Densely -> Output
- For more details, check on Project 2 - DPL.pdf (Vietnamese), slide: Deepfake detection.pdf
- Link dataset: https://drive.google.com/file/d/1HnrO5sGg8gc2VuC1dUHE_62WIbYMSXid/view?usp=sharing
- Link model.h5: https://drive.google.com/file/d/1LF-fYG8r5gXuvHmEEA2y8xLyKeE7Z5pB/view?usp=sharing
- With the same architecture (2 x CNN -> LSTM -> Output), same distribution of train - test set, this aims to compare performance model using normal data vs augmentation data
- Datasets used: Crema_D, RAVDESS, SAVEE, TESS. These 4 datasets express emotion through voice, not through meaning of words.
- For more details, check on DPL-Project-3- Report.pdf (Vietnamese), slide: Project 3 - DPL.pdf
- Link dataset: https://drive.google.com/file/d/1jS8GiWf2PMS_91gGcjCwESsA0MSuyOT9/view?usp=sharing