Computer Science, Machine Learning & Natural Language Processing

Under (permanent) construction

Historical papers

On Computable Numbers with an Application to the Entscheidungsproblem - By A.M. Turing, 1936
Computing Machinery and Intelligence By A.M. Turing, 1950.
A Mathematical Theory of Communication By C. Shannon, 1948
The perceptron: A probabilistic model for information storage and organization in the brain by F. Rossenblat.
Learning representations by back-propagating errors By D.Rumelhart, G.Hinton y R.Williams.

Books

Probability Theory: The Logic of Science By E.T. Jaynes
Speech and Language Processing By J.H. Martin and D. Jurafsky (3rd edition draft)
Reinforcement Learning: an Introduction - By R. Sutton and A. Barto
Machine Learnning - By T. Mitchell

General Surveys

Turing Machines - An article in 1984 Scientific American by John E. Hopcropt, about Turing Machines, A.M. Turing, and the history of computability and computational complexity.
Deep Learning - A review of Deep Learning for Nature. By LeCun, Bengio & Hinton

Foundations

Machine Learning is fun! - A really nice machine learning intro, a topic that actually needs an intro. By Adam Geitgey.
Intuition for Simulated Annealing - Shake!. By Robb Seaton.
Everything You Wanted to Know about the Kernel Trick (But Were Too Afraid to Ask). By Eric Kim.
Principal Component Analysis (PCA) vs Ordinary Least Squares (OLS): A Visual Explanation - By J.D. Long
Markov Chains - A visual explanation. By Lewis Lehe.
A Beginner’s Guide to Eigenvectors, PCA, Covariance and Entropy - by Skymind. The most intuitive introduction to Eigenvectors and Eigenvalues I've found so far.
Visual Information Theory - by C. Olah. Entropy, Cross-entropy, and KL-divergence visually explained...
The Matrix Calculus You Need For Deep Learning - by Terrence Parr and Jeremy Howard.
Seeing Theory By Daniel Kunin. A visual introduction to Probability and Statistics

Causality

The book of why - by J. and D. Mackenzie
Casual Inference in Statistics - A Primer - by J. Pearl webpage and references
Fairness and machine learning - Chapter 4: Causality by S. Barocas et al.
Causality: Models Reasoning and Inference by J. Pearl
Causality for Machine Learning by B. Schölkopf
ML beyond Curve Fitting: An Intro to Causal Inference and do-Calculus by F. Huszár
Introduction to Causal Inference course by B. Neal

Deep Learning

Deep Learning, NLP, and Representations - By C. Olah
Neural Networks and Deep Learning - By Micheal Nielsen. A great online book on neural networks.
Calculus on computational graphs: backpropagation - by C. Olah. Backpropagation explained as calculus on computational graphs
Understanding LSTM Networks - by C.Olah
The Unreasonable Effectiveness of Recurrent Neural Networks - by A. Karpathy. An introduction to RNN and charater-level language models.
Understanding Convolutions - by C.Olah (2014)
Conv Nets: A Modular Perspective - by C.Olah (2014) - How convolutional neural networks work.
Attention is All you Need: Before you Read Transformer - Video tutorial by @NamVo about the Transformer Architecure presente in the paper Attention is All You Need

Unsupervised Learning

A tutorial on PCA - Lindsay Smith - 2002 - Very clear, step by step, introduction to Principal Component Analysis

Supervised Learning

Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning - Sebastian Raschka - A great overview of supervised learning methodology

Programming Machine Learning

Introduction to NumPy - By Sebastian Raschka (Appendix F)
An introduction to NumPy and SciPy - By M. Scott Shell
Implementing a Principal Component Analysis (PCA) - by Sebastian Raschka. Using Python and NumPy.

Visualization

Visual Vocabulary (.png) - By ft.com - How to visualize your data, depending on what you want to emphasize.
Visualizing the uncertainty in data - By Nathan Yau
Fundamentals of Data Visualization - By Claus Wilke - "The book is meant as a guide to making visualizations that accurately reflect the data, tell a story, and look professional."
How to make beautiful data visualizations in Python with matplotlib - By Randal Olson

Applications

Movie Recommendations with k-Nearest Neighbors and Cosine Similarity - By Nicole White.
Sentiment Analysis on Movie Reviews - By Rafael Carrascosa. Sentiment Analysis using Random Forests.

Yet assorted

Logs, Tails, Long Tails - By Ryan Moulton. Why log probabilities are useful. Why long tails matter.
Tiny Data, Approximate Bayesian Computation and the Socks of Karl Broman - By Rasmus Bååth.

Advanced

Deep Reinforcement Learning Doesn't Work Yet - By Alex Irpan.
The Bitter Lesson by Richard Sutton. Reflections on The Bitter Lesson by Michael Nielsen.
On the Bias-Variance Tradeoff: Textbooks Need an Update - By Brady Neal

Reviews

NLP Year in Review 2019 - By Elvis. Very comprehensive.

My tutorials / Guías (English/Spanish)

Yet Another Python Encoding Tutorial (Python 2)
Matrices for Data Scientists
Natural Language Parsing with Python
Ciencia de Datos: lo mínimo que hay que saber
- 4.1 NumPy
- 4.2 Pandas
- 4.3 Matplotlib y Seaborn

Presentaciones (in Spanish)

Seminario Ciencia de Datos - Slides for a 8-hour seminar on Data Science. Facultad de Ciencias Económicas - Universidad de la República - Uruguay
Veinte Años de Aprendizaje Automático - Talk at the GX27 Meeting - Uruguay - 2017
Machine Learning, Python y el Titanic - Talk at Tech Meetup Uruguay - 2014 - Slides
Aprendizaje automático en el mundo real - Talk at the GX28 Meeting - Uruguay 2018
Olas, inviernos, ciencia y tecnología: Lo que aprendí del Procesamiento de Lenguaje Natural - Talk at the GX29 Meeting - Uruguay - 2019
Computabilidad y Máquinas de Turing - Talk about computability for a Cognitive Sciences course.

Amusements / Entretenimiento

Figuritas
Mentiras, malditas mentiras, y encuestas
Mi "predicción" para las elecciones 2014 en Uruguay
Sobreajuste - Una increíblemente precisa predicción de casos de COVID-19

Name		Name	Last commit message	Last commit date
Latest commit History 201 Commits
all_of_statistics		all_of_statistics
causalidad		causalidad
d2l.ai		d2l.ai
data		data
img		img
presentaciones		presentaciones
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Science, Machine Learning & Natural Language Processing

Historical papers

Books

General Surveys

Foundations

Causality

Deep Learning

Unsupervised Learning

Supervised Learning

Programming Machine Learning

Visualization

Applications

Yet assorted

Advanced

Reviews

My tutorials / Guías (English/Spanish)

Presentaciones (in Spanish)

Amusements / Entretenimiento

About

Releases

Packages

Languages

License

gmonce/datascience

Folders and files

Latest commit

History

Repository files navigation

Computer Science, Machine Learning & Natural Language Processing

Historical papers

Books

General Surveys

Foundations

Causality

Deep Learning

Unsupervised Learning

Supervised Learning

Programming Machine Learning

Visualization

Applications

Yet assorted

Advanced

Reviews

My tutorials / Guías (English/Spanish)

Presentaciones (in Spanish)

Amusements / Entretenimiento

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages