Deep Comedy

Introduction

This project was developed for the Deep Learning course held by professor Andrea Asperti at University of Bologna.

Our goal was to build a neural model capable of producing the correct syllabification of Italian poetry, and using this same model to produce poetry in hendecasyllables in the style of Dante.

We only used the Divine Comedy as training set. For the syllabified version we rely on the outputs of this project from professor Asperti.

Results

We provide three notebooks which explain our model and the results we obtained:

Initial syllabification experiments --> contains our first experiments with syllabification using the transformer architecture
Char2Char generation and syllabification --> uses a transformer architecture for both syllabification and text generation; as the name suggests, both encoder and decoder work at character-level.
Word2Char generation --> we tried to improve the semantics of generated text using a word-level encoder, however the results are only slightly better

You can download the models we trained from this link.

The deepcomedy folder contains the custom libraries we use in the notebooks.

The nlgpoetry folder contains an alternative syllabification algorithm from Neural Poetry. We used this for comparison.

In outputs we provide a syllabification of the "Orlando Furioso" by Ludovico Ariosto, obtained using the Char2Char model.

For a deep dive check the docs folder, which contains a report of our experiments and discoveries.

TODOs:

Using a syllable-based decoder for generation
Longer training
Bigger models :)

Running the code

With poetry

We manage dependencies using poetry.

You can create a virtual environment and install all dependencies using the following poetry command:

poetry install

Then activate the environment with:

poetry shell

Then you should be able to run the notebooks.

Without poetry

We also provide the freezed dependencies in the requirements.txt file. We highly recommend installing them in a virtual environment:

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Credits

Some of the code is adapted from these sources:

Transformer model for language understanding from Tensorflow
https://github.com/AlessandroLiscio/DeepComedy for some metrics
Neural Poetry for some metrics and the alternative syllabification

We used Overleaf as the LaTeX editor for the report.

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
data		data
deepcomedy		deepcomedy
docs		docs
experiments		experiments
nlgpoetry		nlgpoetry
outputs		outputs
references		references
.gitignore		.gitignore
Char2Char generation and syllabification.ipynb		Char2Char generation and syllabification.ipynb
Initial syllabification experiments.ipynb		Initial syllabification experiments.ipynb
LICENSE.txt		LICENSE.txt
README.md		README.md
Word2Char generation.ipynb		Word2Char generation.ipynb
poetry.lock		poetry.lock
preprocess.sh		preprocess.sh
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Comedy

Introduction

Results

Running the code

With poetry

Without poetry

Credits

About

Contributors 2

Languages

License

alessandropacielli/deepcomedy

Folders and files

Latest commit

History

Repository files navigation

Deep Comedy

Introduction

Results

Running the code

With poetry

Without poetry

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages