Shopping Predictor

Project Description

This project involves building a nearest-neighbor classifier to predict whether online shopping customers will complete a purchase based on their browsing behavior. The classifier uses a dataset of around 12,000 user sessions, analyzing features like pages visited, session duration, bounce rates, and more.

Features

Data Preprocessing: Load and process data from a CSV file.
Model Training: Train a k-nearest-neighbor classifier using scikit-learn.
Evaluation Metrics: Calculate sensitivity (true positive rate) and specificity (true negative rate).

Usage

Setup:
```
pip install scikit-learn
```
Run the Program:
```
python shopping.py shopping.csv
```

Functions to Implement

load_data(filename): Load and preprocess data from CSV.
train_model(evidence, labels): Train the k-nearest-neighbor model.
evaluate(labels, predictions): Calculate sensitivity and specificity.

Evaluation

Correct: Number of correct predictions.
Incorrect: Number of incorrect predictions.
True Positive Rate: Proportion of actual positives correctly identified.
True Negative Rate: Proportion of actual negatives correctly identified.

Results Explanation

Run the program:

python shopping.py shopping.csv

After running the program, the output will include:

Correct: 4078
Incorrect: 854
True Positive Rate: 40.61%
True Negative Rate: 90.24%

Interpretation of Results

Correct Predictions (4078): The number of user sessions where the model accurately predicted whether a purchase was completed.
Incorrect Predictions (854): The number of user sessions where the model's prediction was incorrect.
True Positive Rate (Sensitivity) (40.61%): This metric indicates that 40.61% of the actual purchases were correctly identified by the model. It reflects the model's ability to identify users who will make a purchase.
True Negative Rate (Specificity) (90.24%): This metric indicates that 90.24% of the non-purchases were correctly identified by the model. It reflects the model's ability to identify users who will not make a purchase.

These metrics help in understanding the performance of the classifier in distinguishing between customers who are likely to complete a purchase and those who are not. The higher the sensitivity and specificity, the better the model is at making accurate predictions.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE.txt		LICENSE.txt
README.md		README.md
requirements.txt		requirements.txt
shopping.csv		shopping.csv
shopping.py		shopping.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Shopping Predictor

Project Description

Features

Usage

Functions to Implement

Evaluation

Results Explanation

Interpretation of Results

About

Releases

Packages

Languages

License

SavinRazvan/shopping

Folders and files

Latest commit

History

Repository files navigation

Shopping Predictor

Project Description

Features

Usage

Functions to Implement

Evaluation

Results Explanation

Interpretation of Results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages