CNN-text-from-scratch

This is the keras implementation of the CNN model given in the paper 'Text Understanding from Scratch' (https://arxiv.org/pdf/1502.01710.pdf). Toy datasets are used currently, but the repo will be updated soon to use the original datasets used by the authors of the paper.

Implementation:

Python3, keras API with tensorflow as the backend

Run using:

python3 -W ignore main.py dataset_train.tsv dataset_test.tsv

Files:

main.py - Main file, contains code for model implementation, training and testing
data_handling.py - Auxillary file for dataset handling
dataset_train.tsv - TSV file with toy training data. The character sequences have been generated randomly and don't have any meaning.
dataset_test.tsv - TSV file with toy test data. The character sequences have been generated randomly and don't have any meaning.

In both the data files, the format used for each sample is: Character_sequence \TAB Class_label

Notes:

All output to console
The program generates a file called 'model_structure_new.png' (I've already placed it in this folder), in which it prints the entire structure of the ConvNet
The program generates a .hdf5 file in which it saves the best model during training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CNN-text-from-scratch

Implementation:

Run using:

Files:

Notes:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
data_handling.py		data_handling.py
dataset_test.tsv		dataset_test.tsv
dataset_train.tsv		dataset_train.tsv
main.py		main.py
model_structure_new.png		model_structure_new.png

akhisud/CNN-text-from-scratch

Folders and files

Latest commit

History

Repository files navigation

CNN-text-from-scratch

Implementation:

Run using:

Files:

Notes:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages