The purpose of this data is to examine the different features and to observe their relationship. The ML model is based on several features of individuals, such as age, physical/family condition, and location, against their existing medical expenses to predict future medical expenses of individuals. This helps the medical insurance company to decide and charge the premium.
Application URL: InsurancePremiumPredictor
This project aims to provide individuals with a personalised estimate of their healthcare needs, enabling them to choose a health insurance plan that aligns with those needs. By focusing on the projected costs from our study, customers can prioritise the health-related aspects of insurance policies over less relevant features.
- Python Modular Coding
- Machine Learning
- MongoDB Database
- Jupyter Notebook
- Git
- CI/CD Pipeline
- Streamlit
Create a conda environment
conda create -p venv python==3.11 -y
activate conda environment
conda activate venv
To install the requirement file
pip install -r requirements.txt
- Add files to git
git add .
orgit add <file_name>
- To check the git status
git status
- To check all versions maintained by git
git log
- To create a version/commit all changes by git
git commit -m "message"
- To send version/changes to GitHub
git push origin main
- Data ingestion is the process by which unstructured data is extracted from one or multiple sources and then prepared for training machine learning models.
- Data validation is an integral part of the ML pipeline. It is checking the quality of source data before training a new mode
- It focuses on checking that the statistics of the new data are as expected (e.g. feature distribution, number of categories, etc).
- Data transformation is the process of converting raw data into a format or structure that is more suitable for model building.
- It is an imperative step in feature engineering that facilitates discovering insights.
- Model training in machine learning is the process in which a machine learning (ML) algorithm is fed with sufficient training data to learn from.
- Model evaluation is the process of using different evaluation metrics to understand a machine learning model’s performance, as well as its strengths and weaknesses.
- Model evaluation is important to assess the efficacy of a model during initial research phases, and it also plays a role in model monitoring.
- Deployment is the method by which we integrate a machine-learning model into the production environment to make practical business decisions based on data.