Million Song Dataset Recommender System

Created recommender system using PySpark’s ALS method to learn latent factor representations for users and items. Final model produces top 500 songs for each user and is evaluated on mean average precision. Created comparison to a single-machine implementation using lightfm.

Click Here for Report PDF

Click Here for GitHub Repository