This repository contains EmoITA, the first text corpus manually annotated with emotion dimensions according to the Valence-Arousal-Dominance (VAD) model. In order to produce it, 16 students from the Master’s Degree in Foreign Languages at the University of Catania were asked to translate to Italian the EmoBank corpus (Buechel and Hahn, 2017). All of them were Italian native speakers and were specializing in English. The same group of subjects also labeled each Italian sentence according to the emotion evoked in an average reader. We took care never to ask a participant to annotate a sentence he had translated.
You can find the corpus inside the corpus subfolder.
The dataset was used for the shared task EmotivITA (https://sites.google.com/view/emotivita) at the EVALITA 2023 evaluation campaign (https://www.evalita.it/campaigns/evalita-2023/). Files from the shared task are available inside the EmotivITA subfolder.
As already remarked, the sentences in EmoITA were translated from those in the EmoBank corpus. Emobank was gathered from MASC, the Manually Annotated SubCorpus of the ANC (Ide et al., 2010) and the SemEval 2007 Task 14 (Strapparava & Mihalcea, 2007).
This work is licensed under CC-BY-SA 4.0: https://creativecommons.org/licenses/by-sa/4.0/
Please cite the following paper if you use EmoITA:
Giovanni Gafà, Francesco Cutugno, and Marco Venuti. 2023. EmotivITA at EVALITA2023: Overview of the Dimensional and Multidimensional Emotion Analysis Task. In Mirko Lai, Stefano Menini, Marco Polignano, Valentina Russo, Rachele Sprugnoli, and Giulia Venturi, editors. Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023). Parma, Italy. CEUR.org.
Please contact me for any additional information: giovanni.gafa@gmail.com
- Sven Buechel and Udo Hahn. 2017. EmoBank: Studying the Impact of Annotation Perspective and Representation Format on Dimensional Emotion Analysis. In EACL 2017 - Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics. Valencia, Spain, April 3-7, 2017. Volume 2, Short Papers, pages 578-585. Available: http://aclweb.org/anthology/E17-2092
- Nancy C. Ide, Collin F. Baker, Christiane Fellbaum, and Rebecca J. Passonneau. 2010. The Manually Annotated Sub-Corpus: A community resource for and by the people. In ACL 2010 — Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Uppsala, Sweden, 11-16 July 2010, volume 2: Short Papers, pages 68–73.
- Carlo Strapparava and Rada Mihalcea. 2007. SemEval-2007 Task 14: Affective text. In SemEval 2007 — Proceedings of the 4th International Workshop on Semantic Evaluations @ ACL 2007. Prague, Czech Republic, June 23-24, 2007, pages 70–74.