GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation

GaVaMoE Project

Large language model-based explainable recommendation (LLM-based ER) systems show promise in generating human-like explanations for recommendations. However, they face challenges in modeling user-item collaborative preferences, personalizing explanations, and handling sparse user-item interactions. To address these issues, we propose GaVaMoE, a novel Gaussian-Variational Gated Mixture of Experts framework for explainable recommendation. GaVaMoE introduces two key components: (1) a rating reconstruction module that employs Variational Autoencoder (VAE) with a Gaussian Mixture Model (GMM) to capture complex user-item collaborative preferences, serving as a pre-trained multi-gating mechanism; and (2) a set of expert models coupled with the multi-gating mechanism for generating highly personalized explanations. The VAE component models latent factors in user-item interactions, while the GMM clusters users with similar behaviors. Each cluster corresponds to a gate in the multi-gating mechanism, routing user-item pairs to appropriate expert models. This architecture enables GaVaMoE to generate tailored explanations for specific user types and preferences, mitigating data sparsity by leveraging user similarities. Extensive experiments on three real-world datasets demonstrate that GaVaMoE significantly outperforms existing methods in expla- nation quality, personalization, and consistency. Notably, GaVaMoE exhibits robust performance in scenarios with sparse user-item interactions, maintaining high-quality explanations even for users with limited historical data.

GaVaMoE                        --- root content    
    - model                    --- model structure & config model
      - config_llama3.py       --- config of llama      
      - config_moe.py          --- config of moe 
      - moe_layer_llama.py     --- structure of moe
      - vae_cluster.py         --- vae & gmm structure & pretraining
      - vamoe.py               --- GaVaMoE Structure
    - pepler_utils             --- Evaluation Metric
      - bleu.py                --- bleu score
      - rouge.py               --- rouge score
      - utils.py               --- some metric functions
    - utils                    --- tool for project
      - dataset_rs.py          --- dataset
      - lr_utils.py            --- learning rate function
      - pepler_dataloader.py   --- dataloader
      - prompt_process.py      --- integrate prompt 
      - utils.py               --- visuliztion & process prompt
    - ds_config.json           --- deepspeed config 
    - train.py                 --- code for training GaVaMoE
    - inference.py             --- generate explainable text
    - readme.md
    - requirements.txt

Datasets to Download

TripAdvisor Hong Kong
Amazon Movies & TV
Yelp 2019

Code dependencies

pip install -r requirements.txt

Usage for GaVaMoE

we performed our experiments on 8xA6000（48GB）.

Modify the config

1. replace dataset: ex: TripAdvisor, Amazon, Yelp.

make dataset folder in the root content, modify the data_path and index_dir.

GaVaMoE
  - dataset
    - Amazon
      - 1
      - 2
      - item.json
      - reviews.pickle

1. replace pretrain_model_path and pretrain_weight_save, you can create a new folder named 'output' in the root content. Then, replace 'output_dir', just like this: './output/dataset/', dataset means dataset name you named.
1. if you wanna use deepspeed, please add the config path on the train function in the train.py.
1. modify moe_layer_llama.py, set num_cluster equals arg's num_cluster.

Training

use below script to train GaVaMoe.

export CUDA_VISIBLE_DEVICES=0
python train.py

or you can use deepspeed:

deepspeed --num_gpus 1 train.py

Inference

Please modify some config in inference.py, just like this:

Make new folder called excel in your root. The dataset path need to map the dataset to huggingface format.

vae_model_path = './output/Yelp_Cluster4/Yelp_Cluster4_cluster_4_epoch_20.pth'
llm_model_path = './output/Yelp_Cluster4/explain'
data_path      = './dataset/cache/test_dataset_Yelp'
excel_path     = './excel/yelp/llama_yelp_cluster4.xlsx'
txt_path       = './excel/yelp/llama_yelp_cluster4.txt'
num_cluster    = 4

num_cluster equals to config's num_cluster.

python inference.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation

GaVaMoE Project

Datasets to Download

Code dependencies

Usage for GaVaMoE

Modify the config

Training

Inference

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
imgs		imgs
model		model
pepler_utils		pepler_utils
test_dataset_amazon		test_dataset_amazon
utils		utils
README.md		README.md
ds_config.json		ds_config.json
inference.py		inference.py
requirements.txt		requirements.txt
train.py		train.py

sugarandgugu/GaVaMoE

Folders and files

Latest commit

History

Repository files navigation

GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation

GaVaMoE Project

Datasets to Download

Code dependencies

Usage for GaVaMoE

Modify the config

Training

Inference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages