- Dataset : Intrusion Detection Evaluation Dataset (CICIDS2017)
- Feature Selection : Based on Pearson Correlation and Chi-square distribution
- Data Pre-processing : Combine csv files, normalize features, splitting data with cross validation -stratified
Run preprocessing.py to generate the file that we will be using to train our model.