KuDA

Codebase for EMNLP 2024 Findings Paper:

Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis

Model Architecture:

The code was refactored to integrate all datasets; please contact me if you find any bugs. Thanks.

Content

Data Preparation
Environment
Running
Note
Citation

Data Preparation

KuDA uses four MSA datasets and BERT in the corresponding languages: Chinese (CH-SIMS, CH-SIMSv2) and English (CMU-MOSI, CMU-MOSEI).

Datasets

CH-SIMS / CMU-MOSI / CMU-MOSEI can be downloaded from MMSA.
CH-SIMSv2 can be downloaded from ch-sims-v2 (Supervised).

Pretrained Language Modal

CH-SIMS / CH-SIMSv2: bert-base-chinese.
CMU-MOSI / CMU-MOSEI: bert-base-uncased.

Environment

The paper's basic training environment for its results is Python 3.8, Pytorch 1.9.0 with a single NVIDIA RTX 3090. Notably, different hardware and software environments can cause the results to fluctuate.

Running

Note: The parameters of the two stages need to be modified for different datasets because the data lengths and dimensions are different.

Stage 1: Knowledge Inject Pretraining

There are two ways to obtain weights of knowledge injection:

Download the translated text file from this link (required for MOSI and MOSEI, not required for CH-SIMS and CH-SIMSv2), and execute the following command to pretrain each modality:
```
python pretrain.py
```
The weights we have previously trained can be downloaded from this link.

Stage 2: Multimodal Sentiment Analysis

python train.py

Note

In Encoder_KIAdapter.py, you need to modify the source code of torch.nn.TransformerEncoder to return the intermediate hidden statues. The code can be modified as follows:

class TransformerEncoder(Module):
 r"""TransformerEncoder is a stack of N encoder layers
 """
 __constants__ = ['norm']

 def __init__(self, encoder_layer, num_layers, norm=None):
     super(TransformerEncoder, self).__init__()
     self.layers = _get_clones(encoder_layer, num_layers)
     self.num_layers = num_layers
     self.norm = norm

 def forward(self, src: Tensor, mask: Optional[Tensor] = None, src_key_padding_mask: Optional[Tensor] = None) -> Tensor:
     r"""Pass the input through the encoder layers in turn.
     """
     output = src
     hidden_state_list = []
     hidden_state_list.append(output)

     for mod in self.layers:
         output = mod(output, src_mask=mask, src_key_padding_mask=src_key_padding_mask)
         hidden_state_list.append(output)

     if self.norm is not None:
         output = self.norm(output)

     return output, hidden_state_list

After completing the preparation of data and models, the file structure is as follows:

├─core
├─data
│  ├─CH-SIMS
│  ├─CH-SIMSv2
│  ├─MOSI
│  └─MOSEI
├─log
├─models
├─pretrainedModel
│  ├─BERT
│  └─KnowledgeInjectPretraining
├─opts.py
├─pretrain.py
├─train.py

We gratefully acknowledge the help of open-source projects used in this work 🎉🎉🎉, including MMSA, ALMT, TMBL, TETFN, CENet, CubeMLP, Self-MM, MMIM, BBFN, MISA, MulT, LMF, TFN, etc 😄.

Citation

Paper publication address:

Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis

Please cite our paper if you find it having other limitations and valuable for your research (卑微求引用 T^T) :

@inproceedings{feng2024knowledge,
  title={Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis},
  author={Feng, Xinyu and Lin, Yuming and He, Lihua and Li, You and Chang, Liang and Zhou, Ya},
  booktitle={Findings of the Association for Computational Linguistics: EMNLP 2024},
  pages={14755--14766},
  year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KuDA

Codebase for EMNLP 2024 Findings Paper:

Model Architecture:

Content

Data Preparation

Datasets

Pretrained Language Modal

Environment

Running

Stage 1: Knowledge Inject Pretraining

Stage 2: Multimodal Sentiment Analysis

Note

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
core		core
data		data
log		log
models		models
pretrainedModel		pretrainedModel
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model_architecture.png		model_architecture.png
opts.py		opts.py
pretrain.py		pretrain.py
train.py		train.py

License

MKMaS-GUET/KuDA

Folders and files

Latest commit

History

Repository files navigation

KuDA

Codebase for EMNLP 2024 Findings Paper:

Model Architecture:

Content

Data Preparation

Datasets

Pretrained Language Modal

Environment

Running

Stage 1: Knowledge Inject Pretraining

Stage 2: Multimodal Sentiment Analysis

Note

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages