TY - GEN
T1 - Playlist Classifier based on Song Features
AU - Agarwal, Arunima
AU - Poornalatha, G.
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Playlist curation is a tedious task and with the rise of various online music streaming platforms, there has been a significant increase in the demand for automatic playlist curation. Existing playlist classifiers consider either user taste or genre-based song information, which excludes an individual song's audio features. This paper attempts to use each song's audio features to classify it into a playlist. The playlists chosen are both genre-based playlists (rock, pop etc.) and user defined abstract playlists (workout, summer etc.). We compare various data sources for extracting the audio features for each song in a handpicked subset of Spotify's million playlist dataset. These features are analyzed for their relevance to playlist classification and then pre-processed, i.e. cleaned, formatted, and organized, thereby making them ready-to-go for Machine Learning models. Various machine learning models like SVM, Decision Tree, Random Forest, XGBoost and KNN are implemented and compared. A simple dense neural network with 3 hidden layers is also implemented and used for comparison. The implementation shows that trees based algorithms are most suitable for playlist classification problem. The highest median accuracy obtained was through implementing XGBoost algorithm and it was noted to be 93.9% with a standard deviation of 0.82%. Decision Tree and Random Forest algorithms were also observed to have similar accuracies.
AB - Playlist curation is a tedious task and with the rise of various online music streaming platforms, there has been a significant increase in the demand for automatic playlist curation. Existing playlist classifiers consider either user taste or genre-based song information, which excludes an individual song's audio features. This paper attempts to use each song's audio features to classify it into a playlist. The playlists chosen are both genre-based playlists (rock, pop etc.) and user defined abstract playlists (workout, summer etc.). We compare various data sources for extracting the audio features for each song in a handpicked subset of Spotify's million playlist dataset. These features are analyzed for their relevance to playlist classification and then pre-processed, i.e. cleaned, formatted, and organized, thereby making them ready-to-go for Machine Learning models. Various machine learning models like SVM, Decision Tree, Random Forest, XGBoost and KNN are implemented and compared. A simple dense neural network with 3 hidden layers is also implemented and used for comparison. The implementation shows that trees based algorithms are most suitable for playlist classification problem. The highest median accuracy obtained was through implementing XGBoost algorithm and it was noted to be 93.9% with a standard deviation of 0.82%. Decision Tree and Random Forest algorithms were also observed to have similar accuracies.
UR - http://www.scopus.com/inward/record.url?scp=85145441287&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85145441287&partnerID=8YFLogxK
U2 - 10.1109/GCAT55367.2022.9972120
DO - 10.1109/GCAT55367.2022.9972120
M3 - Conference contribution
AN - SCOPUS:85145441287
T3 - 2022 IEEE 3rd Global Conference for Advancement in Technology, GCAT 2022
BT - 2022 IEEE 3rd Global Conference for Advancement in Technology, GCAT 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd IEEE Global Conference for Advancement in Technology, GCAT 2022
Y2 - 7 October 2022 through 9 October 2022
ER -