Multimedia project

This project transforms Social media prediction challenge to become recommendation tasks.

As dataset does not provide ratings of each user per post we decided to target this as an content based task focusing on item to item predictions.

We distinguish 3 approaches (all in module recommendation_system/content_based):

File name	Recommender System description
`content_based_rs.py`	Similarity based approach calculating cosine similarity between users history posts and potential recommended posts.
`date_based_cb_rs.py`	Similarity based approach similar to the one above with added weighting by user's historical post date.
`classify_rs.py`	Machine learning approach using Random Forest classifier in order to assign recommendation for a user.

Evaluation

In order to evaluate our results we decided to calculate following metrics:

Metric	Explanation
Precision@5	Precision for first 5 retrieved posts.
Precision@10	Precision for first 10 retrieved posts.
Precision@50	Precision for first 50 retrieved posts.
Recall@5	Recall for first 5 retrieved posts.
Recall@10	Recall for first 10 retrieved posts.
Recall@50	Recall for first 50 retrieved posts.
Mean Average Precision	Mean value of average precision. More information here.
Mean Reciprocal Rank	Mean value of position of first relevant post retrieved over all queries. More precise information here.

Obtain data

We used data from Social Media Prediction Challenge. It should be stored in directory data/train_all_json/.

Cofiguration

In order to install all required dependencies and reformat data for recommendation task run following command.

./configure.sh

Organization of data for training

{"user_id1": {"train_set": dataframe with post ids as index and the features as columns,
              "test_set": same as train_set
             },
 "user_id2": {...},
 ...
}

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
data		data
feature_extraction/image		feature_extraction/image
recommendation_system		recommendation_system
.gitignore		.gitignore
README.md		README.md
compute_feats.py		compute_feats.py
configure.sh		configure.sh
create_datasets.py		create_datasets.py
dataset_statistics.py		dataset_statistics.py
final_statistics.py		final_statistics.py
preprocess_data.py		preprocess_data.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimedia project

Evaluation

Obtain data

Cofiguration

Organization of data for training

About

Releases

Packages

Languages

rpytel1/multimedia-project

Folders and files

Latest commit

History

Repository files navigation

Multimedia project

Evaluation

Obtain data

Cofiguration

Organization of data for training

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages