Insurance Charge Predictor

About project

he purposes of this exercise to look into different features to observe their relationship, and plot a multiple linear regression based on several features of individual such as age, physical/family condition and location against their existing medical expense to be used for predicting future medical expenses of individuals that help medical insurance to make decision on charging the premium.

Technologies

This project is created with below resources:

Python: 3.7
Machine Learning
Jupyter Notebook
Docker
Git
CI/CD Pipeline
Azure

Software and account Requirement

Setup

To install requirement file

pip install -r requirements.txt

Add files to git git add . or git add <file_name>
To check the git status git status
To check all version maintained by git git log
To create version/commit all changes by git git commit -m "message"
To send version/changes to github git push -u origin main

1. Data Ingestion:

Data ingestion is the process in which unstructured data is extracted from one or multiple sources and then prepared for training machine learning models.

2. Data Validation:

Data validation is an integral part of ML pipeline. It is checking the quality of source data before training a new mode
It focuses on checking that the statistics of the new data are as expected (e.g. feature distribution, number of categories, etc).

3. Data Transformation

Data transformation is the process of converting raw data into a format or structure that would be more suitable for model building.
It is an imperative step in feature engineering that facilitates discovering insights.

4. Model Training

Model training in machine learning is the process in which a machine learning (ML) algorithm is fed with sufficient training data to learn from.

5. Model Evaluation

Model evaluation is the process of using different evaluation metrics to understand a machine learning model’s performance, as well as its strengths and weaknesses.
Model evaluation is important to assess the efficacy of a model during initial research phases, and it also plays a role in model monitoring.

6. Model Deployement

Deployment is the method by which we integrate a machine learning model into production environment to make practical business decisions based on data.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.ebextensions		.ebextensions
.github/workflows		.github/workflows
artifacts		artifacts
catboost_info		catboost_info
notebook		notebook
src		src
templates		templates
venv		venv
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
application.py		application.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Insurance Charge Predictor

Table of contents

About project

Technologies

Software and account Requirement

Setup

Project Pipeline

1. Data Ingestion:

2. Data Validation:

3. Data Transformation

4. Model Training

5. Model Evaluation

6. Model Deployement

About

Releases

Packages

Languages

ikigai-aa/insurance-charge-predictor

Folders and files

Latest commit

History

Repository files navigation

Insurance Charge Predictor

Table of contents

About project

Technologies

Software and account Requirement

Setup

Project Pipeline

1. Data Ingestion:

2. Data Validation:

3. Data Transformation

4. Model Training

5. Model Evaluation

6. Model Deployement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages