A curated list of links related to data analysis in python
- URLs to Selected Libraries
- Visualization Libraries
- Data Sources
- Money
- Python versus XXX
- Data Science and ML Blogs
- Learning and Educational
- Tutorial Notebooks From Github
- numpy - NumPy is the fundamental package for scientific computing with Python.
- scipy - SciPy (pronounced “Sigh Pie”) is a Python-based ecosystem of open-source software for mathematics, science, and engineering.
- pandas - pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
- statistics - Builtin Python library for all basic statistical calculations
- sympy - SymPy is a Python library for symbolic mathematics.
- pymc3 - Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano
- ARCH - ARCH models in Python
- statsmodels - Python module that allows users to explore data, estimate statistical models, and perform statistical tests.
- Matplotlib - The infamous port of the Matlab visualization libs
- Seaborn - A more modern interface and look over Matplotlib
- Plotly - Often thought of as just an online service but also works through Jupyter
- Voila - Turn notebooks into web-apps
- Awesome Public Datasets - truly awesome
- Google Dataset Search
- 2016 Data Scientist Salary Survey - from O'Reilly
- R vs Python: Shrin Glander - Side by Side comparison
- R vs Python for Data Science - from kdnuggets.com
- Python vs Matlab - from a python developer's point of view
- Analytics Vidhya - _
- Essentials of ML Algorithms - Python and R
- Tutorial to Learn Data Science with Python - xpost with Educational
- Data Science Central - perform technical analysis of financial market data
- ML, Data Science, AI, Deep Learnings, and Stats - post by Vincent Granville
- 9 Python Analytics Libraries - post by Vincent Granville
- Python packages for Data Science - extended list
- 10 things about Juypter Notebooks - presentation and code
- Mode Analytics Blog
- Practical Business Python
- Towards Data Science - General repository of short to medium length articles covering a broad set of topics
- Data Skeptic Podcast - Great podcast with long form discussion and short form intro episodes
- Software Engineering Daily Podcast - Covers many areas of Software, much of it focused on ML related technologies
- Data School - Videos and Blogs for data wrangling and EDA.
- Python Crash Course - Links to a range of python cheatsheets.
- How to learn Python - Commentary and Links.
- Markdown Cheatsheet - Format your code better. Another cheatersheet here
- Data Cleaning -Clean data using pandas and Jupyter notebook by Karrie Kehoe. Additional material here and here
- Explanatory Data Analysis -EDA 101
- Past PyData Conferences - up to 2016 as compiled by Rohit Sivaprasad
- Pandas Exercises for Data Analysis
- Pandas for Excel Super Users
- Data Science Terms Explained - from kdnuggets.com
- Book: Quant Equity Investing - by Frank Fabozzi et.al. via James Carl
- Book: The Field Guide to Data Science - by Booz Allen
- Course: Google's Python Class
- Course: Bloombergs's ML Course - Series of youtube videos & related slide decks
- Course: Foundations of Data Science - offered by UC Berkeley (free and open source)
- Course: Andrew Ng's Machine Learning - The original course, taking you from Linear regression through to an intro to Deep Nets
- Course: Andrew Ng's Deep Learning Specialization - 5 courses covering Deep Neural Nets
- Vander Plas: Python Data Science Handbook - Youtube video link:
- Fonnesback: Statistical analysis in python - Youtube video link
- Caudron: Statistical analysis in python - Youtube video link
- Practical Business Python
- Data Schools's top 25 pandas tricks - Youtube video link