-
Notifications
You must be signed in to change notification settings - Fork 81
Home
Hi!
My name is Mikiko Bazeley and this is my repo wiki for the Springboard Data Science Track.
From Oct 2018 to April 2019 I completed a number of projects, including two capstones, as part of the DS track.
All of the documentation, code, and notes can be found here, as well as links to other resources I found helpful for successfully completing the program.
For questions or comments, please feel free to reach out on LinkedIn.
Regards, Mikiko
For a comprehensive list of the projects and corresponding skills needed, please see the list below.
Topics covered:
- Python
- Matplotlib, Seaborn—visualization tools in Python
- Writing clear, elegant, readable code in Python using the PEP8 standard
Topics covered:
- Deep dive into Pandas for data wrangling
- Data in files: Work with a variety of file formats from plain text (.txt) to more structured and nested formats files like csv and JSON
- Data in databases: Get an overview of relational and NoSQL databases and practice data querying with SQL
- APIs: Collect data from the internet using Application Programming Interfaces (APIs)
Topics covered:
- Theory of inferential statistics
- Statistical significance
- Parameter estimation
- Hypothesis testing
- Correlation and regression
- Exploratory data analysis
- A/B testing
Topics covered:
- Scikit-learn
- Supervised and unsupervised learning
- Top machine learning techniques:
- Linear and logistic regression
- naive bayes
- support vector machines
- decision trees
- clustering
- Ensemble learning with random forests and gradient boosting
- Best practices
- Evaluating and tuning machine learning systems
Topics covered:
- Anatomy of a tech company
- Job search strategies that top candidates use
- How to build your network and effectively use it to land interviews
- Create a high-quality resume, LinkedIn profile and cover letter
- Interview coaching and practice, including mock interviews for both technical and non-technical topics
- Negotiation success tips
- Practice interview questions for each technical topic
- Algorithms and data structures to ace your coding interviews
The capstone project is a key part of our curriculum that every student must complete. The projects are designed to provide you with the experience of working in a realistic data science scenario. Working with your mentor, you'll pick a data set and a problem of interest. From start to finish, your project will be targeted to a specific client (real or imaginary). Using the data science techniques, you've learned, you'll not only come up with a reasonable solution to the problem, but learn to present it to them as a compelling story.
You will work on two capstone projects that involve the following:
-
- Formulating a problem based on exploratory data analysis;
-
- Building a model and transforming data so that it can be input to an algorithm;
-
- Iteratively evaluating performance, and adapting model/data input to figure out if more data or a different algorithm is needed to best solve the problem. If you choose one of our specialization tracks, your second capstone project will be related directly to the specialization of your choice.