This repository contains three progressively improved models submitted to Kaggle's Titanic survival prediction competition:
- Baseline Random Forest β default sklearn settings
- Tuned Random Forest β GridSearchCV optimized
- Tuned XGBoost β grid search + feature engineering
notebooks/β All model development notebookssubmissions/β CSVs ready for Kaggle submissionrequirements.txtβ Python packages used
| Model | CV Accuracy | Kaggle Rank |
|---|---|---|
| Baseline RF | ~82.1% | #14,329 |
| Tuned RF | ~83.7% | #10,729 |
| Tuned XGBoost | ~84.9% | #9,686 |
- Title & Deck extracted from Name/Cabin
- FamilySize, IsAlone
- Pclass, Age, Sex, Fare, Embarked
- Ensemble Voting Classifier
- LightGBM trials
- Additional feature engineering