This repo includes implementation of various machine learning models to predict target variable (continuous/label). The series of assignments are a part of Graduate class of Applied Machine Learning (UTD BUAN 6341).
Assignment 1– Linear and Logistic Regression with Gradient Descent AlgorithmAssignment 2– Support Vector Classification with Linear, Radial and Polynomial Kernel, Decision Tree Classifier, Pruned Decision Tree Classifier, and AdaBoost Decision Tree ClassifierAssignment 3– k-Nearest Neighbors Classifier, Artificial Neural Network ClassifierAssignment 4– ANN Classifier run on features created using 1) clustering algorithms – a) k-Means Clustering and b) Expectation Maximization, 2) feature selection technique using Random Forest Feature Importance and feature transformation techniques - a) Principal Component Analysis, b) Independent Component Analysis, and c) Random Projections, and 3) k-Means Clustering features with features selected using Random Forest Feature Importance and Principal Components
- Seoul Bike Sharing Demand Dataset – The dataset contains information regarding number of bikes rented on an hourly basis for a year and includes prevailing weather conditions.
- Framingham Heart Study Dataset – The chronic heart diseases dataset contains information regarding the chance of contracting a chronic heart disease after 10 years based on demographic, behavioral, current and past medical history.