This repository serves as a centralized storage location for resources related to the Python sprint , which includes webinar notebooks, various exercises and supplementary materials. Its purpose is to provide learners with easy access to all relevant content needed to enhance their learning experience throughout their time at ExploreAI Academy.
Projects are categorized under machine learning and consequently adhere to a Data Science process guide, which encompasses the following steps:
- Data collection
- Data cleaning and preparation
- Exploratory Data Analysis
- Feature Engineering
- Model building, Evaluation and Deployment.
The datasets used in this repo can be found here.
To carry out all the objectives for this repo, the following necessary dependencies were loaded:
Pandas 2.2.2
andNumpy 1.26
Matplotlib 3.8.4
It's highly recommended to use a virtual environment for your projects, there are many ways to do this; we've outlined one such method below. Make sure to regularly update this section. This way, anyone who clones your repository will know exactly what steps to follow to prepare the necessary environment. The instructions provided here should enable a person to clone your repo and quickly get started.
# make sure your pip in your base Python installation is up-to-date
python3 -m pip install -U pip
# install the virtualenv package
python3 -m pip install virtualenv
# create a virtual environment in this directory called '.venv'
python3 -m venv .venv
# you should now see the folder `.venv` in your repo
# activate the virtual environment
source .venv/bin/activate
# install the requirements for this project
pip install -r requirements.txt
Name | |
---|---|
Marc Marais | mmarias@sandtech.com |
James Beta | jbeta@sandtech.com |
Oladare Adekunle | oadekunle@sandtech.com |
Ereshia Gabier | egabier@sandtech.com |