TY - GEN
T1 - Comparative Analysis of Various SMS Spam Detection Methods using Machine Learning
AU - Ahluwalia, Kartik
AU - Gururaj, H. L.
AU - Rashmi, R.
AU - Lin, Hong
N1 - Publisher Copyright:
© WCSE 2023.All rights reserved.
PY - 2023
Y1 - 2023
N2 - The term SMS (Short Message Service) refers to a popular text messaging service that is commonly used in telephone, internet, and mobile device systems. This service relies on standardized communication protocols that enable short text messages to be exchanged between mobile devices. The increase in SMS spam messages can be attributed to the higher limit of free SMS allowed by Internet Service Providers (ISPs) SMS spam detection relies heavily on the presence of known words, phrases, abbreviations, and idioms commonly used in spam messages. Studies have developed various datasets to train and test SMS spam detection models and have used different classification techniques to improve the accuracy and efficiency of these models. In the present study, various classification techniques for SMS spam detection have been explored such as Naive Bayes, Support Vector Machines (SVM), Decision Trees, Random Forest, and Neural Networks. These techniques use different approaches to identify patterns and features in the messages that distinguish spam from legitimate messages. Among the various algorithms Naïve Bayes Classifier achieved a highest accuracy of 98.44% and Matthew Correlation Coefficients value of 0.93 for the dataset.
AB - The term SMS (Short Message Service) refers to a popular text messaging service that is commonly used in telephone, internet, and mobile device systems. This service relies on standardized communication protocols that enable short text messages to be exchanged between mobile devices. The increase in SMS spam messages can be attributed to the higher limit of free SMS allowed by Internet Service Providers (ISPs) SMS spam detection relies heavily on the presence of known words, phrases, abbreviations, and idioms commonly used in spam messages. Studies have developed various datasets to train and test SMS spam detection models and have used different classification techniques to improve the accuracy and efficiency of these models. In the present study, various classification techniques for SMS spam detection have been explored such as Naive Bayes, Support Vector Machines (SVM), Decision Trees, Random Forest, and Neural Networks. These techniques use different approaches to identify patterns and features in the messages that distinguish spam from legitimate messages. Among the various algorithms Naïve Bayes Classifier achieved a highest accuracy of 98.44% and Matthew Correlation Coefficients value of 0.93 for the dataset.
UR - http://www.scopus.com/inward/record.url?scp=85176429508&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85176429508&partnerID=8YFLogxK
U2 - 10.18178/wcse.2023.06.021
DO - 10.18178/wcse.2023.06.021
M3 - Conference contribution
AN - SCOPUS:85176429508
T3 - 13th International Workshop on Computer Science and Engineering, WCSE 2023
SP - 146
EP - 155
BT - 13th International Workshop on Computer Science and Engineering, WCSE 2023
PB - International Workshop on Computer Science and Engineering (WCSE)
T2 - 2023 13th International Workshop on Computer Science and Engineering, WCSE 2023
Y2 - 16 June 2023 through 18 June 2023
ER -