Welcome to the Feature Engineering repository! This project contains hands-on implementation of essential techniques used in preprocessing and transforming data for machine learning models. The focus is on building a strong foundation in practical feature engineering skills.
- Drop missing values
- Mean/Median/Mode Imputation
- Random Sample Imputation
- Capturing NaNs with Indicators
- End of Distribution Imputation
- Arbitrary Value Imputation
- One Hot Encoding
- Ordinal Encoding
- Count/Frequency Encoding
- Target Mean Encoding
- Logarithmic Transformation
- Box-Cox Transformation
- IQR Method
- Z-score Method
- Equal Width Binning
- Equal Frequency Binning
- Min-Max Scaling
- Standardization (Z-score Normalization)
- Robust Scaling
- Date/Time Feature Extraction
- Text Feature Extraction
- Filter Methods (Correlation)
- Wrapper Methods
- Embedded Methods
Each topic is implemented and explained step-by-step in Jupyter notebooks. You can clone the repository and run the notebooks for practice and learning:
git clone https://github.com/Abdullah-Niaz/Feature-Engineering.git
cd Feature-Engineering
Open the notebooks in JupyterLab or Google Colab to explore the techniques interactively.
Abdullah Niaz
Connect with me on GitHub