Skip to content

WebSite-Classifier is a machine learning project that classifies websites based on their content. The repository includes code for web scraping, data cleaning, feature extraction, and model building using scikit-learn and pandas. In addition, the project includes a Flask module for deploying the model.

License

Notifications You must be signed in to change notification settings

SrikantShubam/WebSite-Classifier

Repository files navigation

📚 WebSite-Classifier 🌐

Welcome to the WebSite-Classifier repository! This is a machine learning project that can classify websites into different categories based on their content. This repository contains the code and necessary files to build and train a website classifier model.

📋 Table of Contents

  • Installation
  • Usage
  • Dataset
  • Model Training
  • Contributing
  • License

💻 Installation

To use this project, you will need to install the necessary dependencies. You can do this using pip:

pip install -r requirements.txt

🚀 Usage

The main script for this project is classify.py. To use it, simply run the following command:

python classify.py <url>

📊 Dataset

The dataset used for training this model is not included in this repository. However, you can find the dataset here

🤖 Model Training

To train the website classifier model, you can run the train.py script. This will use the dataset to train a machine learning model and save it to disk.

🚀 Flask Deployment

This project also includes a Flask web application for deploying the website classifier model. To run the Flask app, execute the following commands:

export FLASK_APP=app.py
flask run

The app will be available at http://127.0.0.1:5000/. You can enter a website URL and click the "Classify" button to classify the website into one of several categories.

🤝 Contributing

Contributions to this project are welcome! If you would like to contribute, please create a pull request.

📄 License

This project is licensed under the MIT License. See the LICENSE file for more information.

Thank you for visiting the WebSite-Classifier repository! If you have any questions or suggestions, please feel free to open an issue.

About

WebSite-Classifier is a machine learning project that classifies websites based on their content. The repository includes code for web scraping, data cleaning, feature extraction, and model building using scikit-learn and pandas. In addition, the project includes a Flask module for deploying the model.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •