We proposed a method to create high level content descriptors (known as information nuggets) using social tags, providing datasets with annotations to evaluate content diversity in recommender systems. (See N.Torres 2018 37th International Conference of the Chilean Computer Science Society (SCCC)). We release the datasets and their annotations, encouraging the use of these datasets in future studies on diversity analysis.
All the diversity measures were developed in RecommenderLab, a R package library devoted to recommender systems.
The process to evaluate a recommender (e.g., UBCF) in a dataset (e.g., MovieLens-100K) is shown below:
Load base package and dependences.
library('recommenderlab')
source('AAA.R')
source('calcPredictionAccuracy.R')
source('alpha_Measures.R')
source('BinomDiv.R')
source('evaluate.R')
Load ML100K dataset.
data('MovieLense')
Create an evaluationScheme
object from MovieLense
data set using a 5-fold cross-validation.
e <- evaluationScheme( MovieLense, method = "cross-validation", train = 0.8, k = 5,
given = 15, goodRating = 4)
Evaluate the recommender model given an evaluation scheme. Two approaches for diversity analysis are available:
subtype = "a-nDCG"
for diversity measures based on α-nDCG (Clarke et al. 2008) and extension (Torres and Mendoza 2019).subtype = "BinomDiv"
for Binomial Diversity (Vargas et al. 2014).
> r <- evaluate(e, method = "UBCF", nMatrix = "../nuggets/Nuggets_ML100K.dat", type = "topNList",
subtype = "a-nDCG", n = 10, param = list(method = "cosine", nn = 50))
Results for each fold:
UBCF run fold/sample [model time/prediction time]
α-DCG | α-nDCG | αβ-nDCG | αγ-nDCG | αβγ-nDCG | TotDiv | αβγ-TotDiv | [mt/tt] | |
---|---|---|---|---|---|---|---|---|
1 | 61.3707 | 0.8601 | 0.6124 | 0.7453 | 0.4627 | 0.1125 | 0.0521 | [0.006sec/1.22sec] |
2 | 62.0903 | 0.8657 | 0.6160 | 0.7464 | 0.4636 | 0.1359 | 0.0630 | [0.006sec/1.16sec] |
3 | 62.0703 | 0.8782 | 0.6142 | 0.7430 | 0.4599 | 0.1379 | 0.0634 | [0.006sec/1.33sec] |
4 | 61.5409 | 0.8674 | 0.6155 | 0.7381 | 0.4582 | 0.1407 | 0.0645 | [0.006sec/1.29sec] |
5 | 60.8042 | 0.8620 | 0.6151 | 0.7488 | 0.4646 | 0.1173 | 0.0545 | [0.006sec/1.15sec] |
Overall performance:
> avg(r)
α-DCG | α-nDCG | αβ-nDCG | αγ-nDCG | αβγ-nDCG | TotDiv | αβγ-TotDiv | |
---|---|---|---|---|---|---|---|
10 | 61.5753 | 0.8667 | 0.6146 | 0.7443 | 0.4618 | 0.1289 | 0.0595 |
> r <- evaluate(e, method = "UBCF", nMatrix = "../nuggets/Nuggets_ML100K.dat", type = "topNList",
subtype = "BinomDiv", n = 10, param = list(method = "cosine", nn = 50))
Results for each fold:
UBCF run fold/sample [model time/prediction time]
Coverage | NonRed | BinomDiv | [mt/tt] | |
---|---|---|---|---|
1 | 0.8173 | 0.2107 | 0.1774 | [0.007sec/1.21sec] |
2 | 0.8236 | 0.2305 | 0.1948 | [0.007sec/1.09sec] |
3 | 0.8258 | 0.2340 | 0.1980 | [0.007sec/1.20sec] |
4 | 0.8230 | 0.2242 | 0.1888 | [0.007sec/1.18sec] |
5 | 0.8321 | 0.2310 | 0.1972 | [0.007sec/1.27sec] |
Overall performance:
> avg(r)
Coverage | NonRed | BinomDiv | |
---|---|---|---|
10 | 0.8244 | 0.2260 | 0.1912 |
- RecommenderLab (Reference Manual)
- GroupLens datasets
- Charles L. A. Clarke, Maheedhar Kolla, Gordon V. Cormack, Olga Vechtomova, Azin Ashkan, Stefan Büttcher, Ian MacKinnon. 2008. Novelty and Diversity in Information Retrieval Evaluation pdf
- Saúl Vargas, Linas Baltrunas, Alexandros Karatzoglou, Pablo Castells. 2014. Coverage, Redundancy and Size-Awareness in Genre Diversity for Recommender Systems pdf
- Marcelo Mendoza & Nicolas Torres. 2019. Evaluating content novelty in recommender systems in Journal of Intelligent Information Systems.