Melon Playlist Continuation

This is an extra solution to the Melon Playlist Continuation Challenge by the *** Team.
It was inspired by the following two papers: A hybrid two-stage recommender system for automatic playlist continuation, which won 3rd place in the RecSys Challenge ’18; and Relational Learning via Collective Matrix Factorization.

Dataset

As stated in the Challenge README, the dataset in data.tar.gz contains 150K playlists that have been created by Melon users.
To untar the dataset:

tar -xvzf data.tar.gz

The data/train.json contains all the data, whereas data/val.json and data/test.json are just for submission, so only some of the songs and tags are included.
For this repository, we just consider data/val.json and data/test.json as additional information.

Solution

Phase 1: Extract candidates using CMF Recommandation(song+tag matrix)
Phase 2: Re-rank candidates using Learning-To-Rank Boosting

Preprocessing - Data Partitioning

For local evaluation, we create the new evaluation dataset. The part2 and part3 are for the training and validation datasets for boosting, respectively.
These are divided into question (_q) and answer (_a) parts.
In Phase 1, we train part1+part2_q+part3_q+evaluation_q and optionally include valid.json+test.json as additional information.
In Phase 2, we use part2_q and part3_q as inputs and use part2_a and part3_a as labels, respectively.
Please refer to A hybrid two-stage recommender system for automatic playlist continuation for detailed partitioning.

Usage

Preprocessing

python3 preprocess.py run ./data/train.json

After running the above, the preprocessed directory is as follows.

├── preprocessed
    ├── inputs
       ├── part1.json
       ├── part2_q.json
       ├── part3_q.json
       └── evaluation_q.json
    └── labels
       ├── part2_a.json
       ├── part3_a.json
       └── evaluation_a.json

Training and Prediction

python3 run.py --dir ./preprocessed --additional ./data/val.json ./data/test.json

The --additional flag is optional.

python3 run.py --dir ./preprocessed

Evaluation

python3 evaluate.py --result ./result.json --answer ./preprocessed/labels/evaluation_a.json

Score

Music nDCG: 0.250488
Tag nDCG: 0.413651
Final Score: 0.274963

Final Score = Music nDCG * 0.85 + Tag nDCG * 0.15

Running Environment

We tested this implementation using Python 3.6.9 with an Intel Core i7-9700 CPU and 32GB RAM.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
docs		docs
src		src
.gitignore		.gitignore
README.md		README.md
data.tar.gz		data.tar.gz
evaluate.py		evaluate.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Melon Playlist Continuation

Dataset

Solution

Preprocessing - Data Partitioning

Usage

Preprocessing

Training and Prediction

Evaluation

Score

Running Environment

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

HaebinShin/melon-playlist-continuation

Folders and files

Latest commit

History

Repository files navigation

Melon Playlist Continuation

Dataset

Solution

Preprocessing - Data Partitioning

Usage

Preprocessing

Training and Prediction

Evaluation

Score

Running Environment

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages