This repository contains the resources used for RecSys'2024 submission "Towards Better STEM Recommendations: A Gold-Standard Dataset with Math content"
Please run the following to install dependencies for running the scripts to obtain results (it is recommended to create a virtual environment first).
pip install -r requirements.txt
The documents are obtained via zbMATHOpen API https://api.zbmath.org/v1/ The ID of documents can be obtained from repo: https://zenodo.org/records/5062959
To obtain metadata if all documents such as title, abstract, keywords, MSCs, citations, author names, etc please run
python /src/hybrid/feature_simil/getDataset.py
To get evaluation results of basline, please go to follow the mentioned steps:
reproducing_results/Baseline/
To get evaluation results of Initial Ranker, go to follow the mentioned steps:
reproducing_results/InitialRanker/
To get evaluation results of Re-ranker, go to follow the mentioned steps:
reproducing_results/Re-ranker/
Annotations of 4 annotators are available in the folder
src/hybrid/originalAnno/
To calculate Kappa scores and evaluation scores (Precision, Recall, F1, MRR, nDCG), run
python src/hybrid/userAnnoEval.py