This assignment involves the following steps and tasks:
Download the dataset from the provided link.
Transform the dataset into a balanced class dataset, ensuring an equal representation of different classes.
Use the Pycaret library to apply the following five models to the dataset:
- Random Forest Classifier
- Extra Trees Classifier
- Gradient Boosting Classifier
- Decision Tree Classifier
- Ada Boost Classifier
The table below shows the performance of the five models using different sampling techniques:
Sampling Technique | Random Forest | Extra Trees | Gradient Boosting | Decision Tree | Ada Boost |
---|---|---|---|---|---|
Simple Random | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
Systematic | 1.000 | 1.000 | 0.997 | 0.981 | 1.000 |
Stratified | 1.000 | 1.000 | 1.000 | 0.994 | 1.000 |
Cluster | 0.780 | 0.841 | 0.871 | 0.738 | 0.864 |