Skip to content

Latest commit

 

History

History
45 lines (28 loc) · 1.17 KB

README.md

File metadata and controls

45 lines (28 loc) · 1.17 KB

75.06/95.58 Organización de Datos

Análisis exploratorio: Real or Not? NLP with Disaster Tweets

El objetivo del segundo TP es el uso de Machine Learning en la competencia: Real or Not? NLP with Disaster Tweets. Su set de datos se encuentra en: https://www.kaggle.com/c/nlp-getting-started.

Autores:

  • Guglielmone, Lionel (padrón: 96963)
  • Bauni, Chiara (padrón: 102981)
  • Leloutre, Daniela (padrón: 96783)
  • Cai, Ana Maria (padrón: 102150)

Informe: Para editar: https://www.overleaf.com/5739662662nkpmknshprkr Para ver: https://www.overleaf.com/read/mfvcrrwmmtrh

Colab - Entrenamiento RoBERTa: https://colab.research.google.com/drive/1Hr1FO_cXRDgJi4T0QjjthTcVmRa3MVY5?usp=sharing

Modelo entrenado RoBERTa: https://drive.google.com/file/d/1mSkecdje5RCH5wn9Bwx1WlTtiizgn3Qi/view?usp=sharing

CÓMO LEER NOTEBOOKS

FEATURE GENERATION:

Features

FEATURE SELECTION:

Feature selection - decision trees

Feature selection - lightgbm

Feature selection - random forest

ALGORITMOS:

Algoritmos_w_Pipelines

https://colab.research.google.com/drive/1Hr1FO_cXRDgJi4T0QjjthTcVmRa3MVY5?usp=sharing (Entrenamiento Roberta)

RobertaSubmit (Predicción Roberta)