Skip to content

This repository contains EmoITA, the first Italian text corpus manually annotated with emotion dimensions according to the Valence-Arousal-Dominance (VAD) model. It has been obtained by translating and re-annotating the EmoBank corpus (Buechel and Hahn, 2017). You can also find files from the EmotivITA shared task at EVALITA 2023.

Notifications You must be signed in to change notification settings

GiovanniGafa/EmoITA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

EmoITA

Overview

This repository contains EmoITA, the first text corpus manually annotated with emotion dimensions according to the Valence-Arousal-Dominance (VAD) model. In order to produce it, 16 students from the Master’s Degree in Foreign Languages at the University of Catania were asked to translate to Italian the EmoBank corpus (Buechel and Hahn, 2017). All of them were Italian native speakers and were specializing in English. The same group of subjects also labeled each Italian sentence according to the emotion evoked in an average reader. We took care never to ask a participant to annotate a sentence he had translated.

You can find the corpus inside the corpus subfolder.

The dataset was used for the shared task EmotivITA (https://sites.google.com/view/emotivita) at the EVALITA 2023 evaluation campaign (https://www.evalita.it/campaigns/evalita-2023/). Files from the shared task are available inside the EmotivITA subfolder.

Attribution of Raw Data

As already remarked, the sentences in EmoITA were translated from those in the EmoBank corpus. Emobank was gathered from MASC, the Manually Annotated SubCorpus of the ANC (Ide et al., 2010) and the SemEval 2007 Task 14 (Strapparava & Mihalcea, 2007).

License

This work is licensed under CC-BY-SA 4.0: https://creativecommons.org/licenses/by-sa/4.0/

Citation

Please cite the following paper if you use EmoITA:

Giovanni Gafà, Francesco Cutugno, and Marco Venuti. 2023. EmotivITA at EVALITA2023: Overview of the Dimensional and Multidimensional Emotion Analysis Task. In Mirko Lai, Stefano Menini, Marco Polignano, Valentina Russo, Rachele Sprugnoli, and Giulia Venturi, editors. Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023). Parma, Italy. CEUR.org.

Contact

Please contact me for any additional information: giovanni.gafa@gmail.com

References

  • Sven Buechel and Udo Hahn. 2017. EmoBank: Studying the Impact of Annotation Perspective and Representation Format on Dimensional Emotion Analysis. In EACL 2017 - Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Valencia, Spain, April 3-7, 2017. Volume 2, Short Papers, pages 578-585. Available: http://aclweb.org/anthology/E17-2092
  • Nancy C. Ide, Collin F. Baker, Christiane Fellbaum, and Rebecca J. Passonneau. 2010. The Manually Annotated Sub-Corpus: A community resource for and by the people. In ACL 2010 — Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Uppsala, Sweden, 11-16 July 2010, volume 2: Short Papers, pages 68–73.
  • Carlo Strapparava and Rada Mihalcea. 2007. SemEval-2007 Task 14: Affective text. In SemEval 2007 — Proceedings of the 4th International Workshop on Semantic Evaluations @ ACL 2007. Prague, Czech Republic, June 23-24, 2007, pages 70–74.

About

This repository contains EmoITA, the first Italian text corpus manually annotated with emotion dimensions according to the Valence-Arousal-Dominance (VAD) model. It has been obtained by translating and re-annotating the EmoBank corpus (Buechel and Hahn, 2017). You can also find files from the EmotivITA shared task at EVALITA 2023.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages