Instructor: Sergio A. Mora Pardo
- email: sergioa.mora@javeriana.edu.co
- github: sergiomora03
Knowledge of the challenges and solutions present in specific situations of organizations that require advanced and special handling of information, such as text mining, process mining, data flow mining (stream data mining) and social network analysis. This module on Natural Language Processing will explain how to build systems that learn and adapt using real-world applications. Some of the topics to be covered include text preprocessing, text representation, modeling of common NLP problems such as sentiment analysis, similarity, recurrent models, word embeddings, introduction to lenguage generative models. The course will be project-oriented, with emphasis placed on writing software implementations of learning algorithms applied to real-world problems, in particular, language processing, sentiment detection, among others.
- Python version >= 3.7;
- Numpy, the core numerical extensions for linear algebra and multidimensional arrays;
- Scipy, additional libraries for scientific programming;
- Matplotlib, excellent plotting and graphing libraries;
- IPython, with the additional libraries required for the notebook interface.
- Pandas, Python version of R dataframe
- Seaborn, used mainly for plot styling
- scikit-learn, Machine learning library!
A good, easy to install option that supports Mac, Windows, and Linux, and that has all of these packages (and much more) is the Anaconda.
GIT!! Unfortunatelly out of the scope of this class, but please take a look at these tutorials
- 50% Project
- 40% Exercises
- 10% Class participation
Session | Activity | Deadline | Comments |
---|---|---|---|
Deep Learning |
|
|
Expo March 22th |
NLP |
|
|
Expo April 12th |
Graph Learning |
|
|
|
Final grade |
|
|
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
March 1st | Machine Learning Operations (MLOps) | ||
March 1st | ML monitoring & Data Drift | ||
March 1st | Machine Learning as a Service (AIaaS) |
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
March 8th | First steps in deep learning | ||
March 15th | Deep Computer Vision | ||
March 22th | Computer Vision Project | Exercises Deadline | P1 - Frailejon Detection (a.k.a "Big Monks Detection") |
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
March 22th | Introduction to NLP |
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
March 22th | Space Vector Models | ||
March 29th | Distributed Representations |
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
April 5th | Deep Learning in NLP (RNN, LSTM, GRU) | ||
April 12th | NLP Project | P1 - Movie Genre Prediction | |
April 12th | Attention, Tranformers and BERT | ||
April 19th | Holy Week | Holy Week | Holy Week |
April 25th | Exercises Deadline |
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
April 26th | Intro to Graphs | ||
April 26th | Graphs Metrics | E10 - Twitter Analysis |
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
May 3th | Graph Representation |
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
May 10th | Graph Neural Network | ||
May 17th | Graph Machine Learning Task [Optional] | ||
May 24th | Geometric Deep Learning Project | Exercises Deadline | P3 - Graph Machine Learning / P3 - Graph Machine Learning [old < 2022] |
Date | Session | Notebooks/Presentations | Exercises |
---|---|---|---|
May 31th | Final Grades |
Module | Topic | Material |
---|---|---|
NLP | Word Embedding Projector | Tensorflow Embeddings Projector |
NLP | Time Series with LSTM | ARIMA-SARIMA-Prophet-LSTM |
NLP | Stanford | Natural Language Processing with Deep Learning |
GML | Stanford | CS224W: Machine Learning with Graphs |
Module | Topic | Material |
---|---|---|
NLP | Polarity | Sentiment Analysis - Polarity |
NLP | Image & Text | Image Captions |
ML | Hyperparameter Tuning [WIP] | |
NLP | Neural Style Transfer | Style Transfer |