Network-Security-System

This project demonstrates the MLOps workflow for a Network Security System, implementing a continuous integration, continuouse delivery and deployment pipeline. The system ingests network security data, validates, transforms, and trains a predictive model to detect potential security threats.

Additionally, a FastAPI-based web service is integrated to enable model training and real-time inference via API endpoints.

Key Features

Automated Data Pipeline – Ingestion, Validation, and Transformation of security data.
Model Training Pipeline – Uses structured MLOps principles for scalable training.
FastAPI for Deployment – Enables RESTful API for model retraining and prediction.
MongoDB Integration – Stores ingested data for further analysis.
Structured Logging and Exception Handling – Ensures traceability and robustness.
Containerized Deployment Ready – Easily deployable using Docker/Kubernetes.

MLOps Workflow

Data Ingestion: Retrieves raw security data and stores it in MongoDB.
Data Validation: Ensures the dataset meets predefined schema requirements.
Data Transformation: Prepares features for model training.
Model Training: Trains an ML model to detect security anomalies.
Model Deployment: Serves the trained model via FastAPI.
Continuous Monitoring & Retraining: Supports periodic retraining for improved accuracy.

Below is an explanation of each column and what the values represent:

Feature Explanations

having_IP_Address:
- Whether the URL contains an IP address instead of a domain name.
- Phishing URLs often use IP addresses.
- 1: No IP address, legitimate.
  0: Neutral.
  -1: Contains an IP address, phishing.
URL_Length:
- Checks the length of the URL. Longer URLs can indicate phishing.
- 1: Short, legitimate.
  0: Medium-length.
  -1: Long, phishing.
Shortining_Service:
- Whether the URL uses a shortening service like bit.ly.
- 1: Not shortened, legitimate.
  -1: Shortened, phishing.
having_At_Symbol:
- Checks if the URL contains an "@" symbol, which can redirect users to another URL.
- 1: No "@" symbol, legitimate.
  -1: Contains "@", phishing.
double_slash_redirecting:
- Whether there is a "//" redirect after the domain in the URL.
- 1: No redirect, legitimate.
  -1: Redirect present, phishing.
Prefix_Suffix:
- Checks if the domain has a hyphen ("-"), often used in phishing URLs.
- 1: No hyphen, legitimate.
  -1: Hyphen present, phishing.
having_Sub_Domain:
- The number of subdomains in the URL. More subdomains can indicate phishing.
- 1: Few subdomains, legitimate.
  0: Moderate number of subdomains.
  -1: Many subdomains, phishing.
SSLfinal_State:
- Checks the SSL/TLS certificate status of the website.
- 1: Valid SSL, legitimate.
  0: Suspicious or self-signed SSL.
  -1: No SSL, phishing.
Domain_registeration_length:
- Length of the domain's registration period. Shorter registrations often indicate phishing.
- 1: Long registration, legitimate.
  -1: Short registration, phishing.
Favicon:
- Checks the source of the favicon (website icon). Phishing sites may use unrelated favicons.
- 1: Favicon matches domain, legitimate.
  -1: Favicon from a different domain, phishing.
port:
- Whether unusual ports (other than 80 or 443) are open.
- 1: No unusual ports, legitimate.
  -1: Unusual ports, phishing.
HTTPS_token:
- Checks if "HTTPS" is present in the domain name, which may falsely imply security.
- 1: No HTTPS token in domain, legitimate.
  -1: HTTPS token in domain, phishing.
Request_URL:
- Percentage of external objects (like images) loaded from other domains.
- 1: Low percentage, legitimate.
  0: Moderate percentage.
  -1: High percentage, phishing.
URL_of_Anchor:
- Percentage of anchor tags with empty or external links.
- 1: Low percentage, legitimate.
  0: Moderate percentage.
  -1: High percentage, phishing.
Links_in_tags:
- Percentage of links in , <script>, or tags.
- 1: Low percentage, legitimate.
  0: Moderate percentage.
  -1: High percentage, phishing.
SFH (Server Form Handler):
- Checks the action attribute of form tags to see if it’s empty or points to a different domain.
- 1: Legitimate action.
  0: Suspicious.
  -1: Phishing.
Submitting_to_email:
- Checks if forms on the website directly send data to email addresses.
- 1: No, legitimate.
  -1: Yes, phishing.
Abnormal_URL:
- Whether the URL structure matches typical patterns for the domain.
- 1: Normal structure, legitimate.
  -1: Abnormal structure, phishing.
Redirect:
- Number of redirections the URL performs.
- 1: Few or none, legitimate.
  -1: Many redirects, phishing.
on_mouseover:
- Checks for changes in status bar content when hovering over elements.
- 1: No changes, legitimate.
  -1: Changes present, phishing.
RightClick:
- Whether right-click functionality is disabled on the website.
- 1: Not disabled, legitimate.
  -1: Disabled, phishing.
popUpWidnow:
- Checks for pop-up windows triggered by the website.
- 1: No pop-ups, legitimate.
  -1: Pop-ups present, phishing.
Iframe:
- Checks for the presence of hidden iframes.
- 1: No hidden iframes, legitimate.
  -1: Hidden iframes present, phishing.
age_of_domain:
- Age of the domain in months. Older domains are usually more trustworthy.
- 1: Old, legitimate.
  -1: New, phishing.
DNSRecord:
- Checks if DNS records exist for the domain.
- 1: Records exist, legitimate.
  -1: No records, phishing.
web_traffic:
- Traffic rank of the website. Lower rank indicates more traffic.
- 1: High traffic, legitimate.
  0: Moderate traffic.
  -1: Low traffic, phishing.
Page_Rank:
- Google PageRank score of the website.
- 1: High PageRank, legitimate.
  -1: Low PageRank, phishing.
Google_Index:
- Checks if the website is indexed by Google.
- 1: Indexed, legitimate.
  -1: Not indexed, phishing.
Links_pointing_to_page:
- Number of backlinks pointing to the website.
- 1: Many links, legitimate.
  0: Moderate number of links.
  -1: Few links, phishing.
Statistical_report:
- Checks if the website is flagged by statistical or threat analysis tools.
- 1: Not flagged, legitimate.
  -1: Flagged, phishing.
Result:
- The target variable indicating whether the URL is phishing.
- 1: Legitimate.
  -1: Phishing.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows		.github/workflows
Network_Data		Network_Data
data_schema		data_schema
final_model		final_model
networksecurity		networksecurity
prediction_output		prediction_output
templates		templates
valid_data		valid_data
workflows		workflows
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
main.py		main.py
push_data.py		push_data.py
requirements.txt		requirements.txt
setup.py		setup.py
test_mongodb.py		test_mongodb.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Network-Security-System

Key Features

MLOps Workflow

Feature Explanations

About

Uh oh!

Releases

Packages

Languages

License

Chukwuemeka-James/Network-Security-System

Folders and files

Latest commit

History

Repository files navigation

Network-Security-System

Key Features

MLOps Workflow

Feature Explanations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages