Mixture of Experts Approaches in Dense Retrieval Tasks

Read the paper here.
To reproduce the experiments, complete the following steps:

Step 1:

Create a virtual environment and run the following:
pip install -r requirements.txt

to install all necessary requirements.

Step 2:

Run the following command to complete data pre-processing and bring all datasets to the BEIR format:
python3 data_preprocessing.py

For our experiments, we use the following four publicly available IR benchmarks: (i) HotpotQA and (ii) Natural Questions from the BEIR collection, as well as the (iii) Political Science and (iv) Computer Science collections from the Multi-Domain Benchmark. We bring the last two collections to the BEIR format for our experiments.

Step 3:

Update the parameters in the pipeline.sh file to the desired ones and execute the script.

For example, the following command will train sbmoe_ALL on TinyBERT using the HotpotQA dataset by employing 6 experts:
python3 1_train_new_moe.py model=tinybert dataset=hotpotqa testing=hotpotqa model.adapters.use_adapters=True model.adapters.num_experts=6 model.init.specialized_mode=sbmoe_all

_*Note: When use_adapters=True, only two modes are allowed; (i) 'sbmoe_all' and (ii) 'sbmoe_top1'.

3-Dimensional T-SNE plots

3D t-SNE representations of the same query and its top 1000 documents, on the left embedded by the original DRM and on the right by our model, for all seven benchmarks.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
conf		conf
src		src
tsne		tsne
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mixture of Experts Approaches in Dense Retrieval Tasks

Step 1:

Step 2:

Step 3:

3-Dimensional T-SNE plots

MS MARCO

TREC DL 2019

TREC DL 2020

Natural Questions

HotpotQA

Political Science

Computer Science

About

Uh oh!

Releases

Packages

Languages

FaySokli/SB-MoE

Folders and files

Latest commit

History

Repository files navigation

Mixture of Experts Approaches in Dense Retrieval Tasks

Step 1:

Step 2:

Step 3:

3-Dimensional T-SNE plots

MS MARCO

TREC DL 2019

TREC DL 2020

Natural Questions

HotpotQA

Political Science

Computer Science

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages