Persian SMS Spam Detection using Machine Learning and Deep Learning Techniques

Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience imposed on users, the loss of network traffic, the rise in the calculated cost, occupying more physical space on the mobile phone, and abusing...

Full description

Saved in:

Bibliographic Details
Published in	International journal of Web research Vol. 5; no. 1; pp. 56 - 65
Main Authors	Roya Khorashadizade, Somayyeh Jafarali Jassbi, Alireza Yari
Format	Journal Article
Language	English
Published	University of science and culture 01.01.2022
Subjects	convolutional neural network lstm sms spam spam detection support vector machine
Online Access	Get full text
ISSN	2645-4343
DOI	10.22133/ijwr.2022.348824.1126

Cover

Loading…

Abstract	Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience imposed on users, the loss of network traffic, the rise in the calculated cost, occupying more physical space on the mobile phone, and abusing and defrauding recipients are but a few of their downsides. Consequently, the automated identification of suspicious and spam messages is undoubtedly vitally important. Additionally, text messages which are smartly composed might be difficult to recognize. However, the present methodologies in this subject are hindered by the absence of adequate Persian datasets. A huge body of research and experiments has revealed that techniques based on deep and combined learning are superior at identifying unpleasant text messages. This work sought to develop an effective strategy for identifying SMS spam through utilizing combining machine learning classification algorithms together with deep learning models. After applying preprocessing on our gathered dataset, the suggested technique applies two convolutional neural network layers, the first of which being an LSTM layer, and the second one which is a fully connected layer to extract the data characteristics, thereby implementing the suggested deep learning approach. As part of the Machine Learning methodologies, the vector support machine makes use of the data and features at hand to determine the ultimate classification. Results indicate that the suggested model is implemented more effectively than the existing techniques, and an accuracy of 97.7% was achieved as a result.
AbstractList	Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience imposed on users, the loss of network traffic, the rise in the calculated cost, occupying more physical space on the mobile phone, and abusing and defrauding recipients are but a few of their downsides. Consequently, the automated identification of suspicious and spam messages is undoubtedly vitally important. Additionally, text messages which are smartly composed might be difficult to recognize. However, the present methodologies in this subject are hindered by the absence of adequate Persian datasets. A huge body of research and experiments has revealed that techniques based on deep and combined learning are superior at identifying unpleasant text messages. This work sought to develop an effective strategy for identifying SMS spam through utilizing combining machine learning classification algorithms together with deep learning models. After applying preprocessing on our gathered dataset, the suggested technique applies two convolutional neural network layers, the first of which being an LSTM layer, and the second one which is a fully connected layer to extract the data characteristics, thereby implementing the suggested deep learning approach. As part of the Machine Learning methodologies, the vector support machine makes use of the data and features at hand to determine the ultimate classification. Results indicate that the suggested model is implemented more effectively than the existing techniques, and an accuracy of 97.7% was achieved as a result.
Author	Roya Khorashadizade Somayyeh Jafarali Jassbi Alireza Yari
Author_xml	– sequence: 1 fullname: Roya Khorashadizade organization: Department of Information Technology Science and Research Branch, Islamic Azad University Tehran, Iran – sequence: 2 fullname: Somayyeh Jafarali Jassbi organization: Department of Computer Engineering, Science and Research Branch, Islamic Azad University Tehran, Iran – sequence: 3 fullname: Alireza Yari organization: Iran telecom IT Research faculty, ICT research institute, Tehran, Iran research center
BookMark	eNqtzNtKAzEUheEgClbtK0heoGNm78zBaw8otCJM78NusqfN0CZjMkV8ew8IvoBXCz4W_4U4DTGwENelKgBKxBs_vKcCFECBum1BF2UJ9YmYQa2rhUaN52Ke86CUglvVompm4uWVU_YUZLfqZDfSQd7zxHbyMchj9mErV2R3PrBcMqXwDRTc14nHP1mz3QX_duR8Jc562mee_-6leH58WN89LVykwYzJHyh9mEje_EBMW0Np8nbPptoo6q21hOi0qnCDPTvXQlPWSLax-J-tT5QAYYQ
ContentType	Journal Article
DBID	DOA
DOI	10.22133/ijwr.2022.348824.1126
DatabaseName	DOAJ Directory of Open Access Journals
DatabaseTitleList
Database_xml	– sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISSN	2645-4343
EndPage	65
ExternalDocumentID	oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c
GroupedDBID	ALMA_UNASSIGNED_HOLDINGS GROUPED_DOAJ
ID	FETCH-doaj_primary_oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c3
IEDL.DBID	DOA
IngestDate	Wed Aug 27 01:04:28 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	1
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-doaj_primary_oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c3
OpenAccessLink	https://doaj.org/article/5b0afccca33d4053b3fedd827163ac7c
ParticipantIDs	doaj_primary_oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c
PublicationCentury	2000
PublicationDate	2022-01-01
PublicationDateYYYYMMDD	2022-01-01
PublicationDate_xml	– month: 01 year: 2022 text: 2022-01-01 day: 01
PublicationDecade	2020
PublicationTitle	International journal of Web research
PublicationYear	2022
Publisher	University of science and culture
Publisher_xml	– name: University of science and culture
SSID	ssj0002908307
Score	4.3380427
Snippet	Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience...
SourceID	doaj
SourceType	Open Website
StartPage	56
SubjectTerms	convolutional neural network lstm sms spam spam detection support vector machine
Title	Persian SMS Spam Detection using Machine Learning and Deep Learning Techniques
URI	https://doaj.org/article/5b0afccca33d4053b3fedd827163ac7c
Volume	5
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8QwEA6yJz2IT3yTg9e6ZZI-9uhrWYXupSvsraR5LArWohX_vjNJxXryoNcQkjCTzCP55gtj53magdaJi1QsXSQBdFTb2EROAiZjkFtQVOBczNPZg7xfJsvBV1-ECQv0wEFw46SOldM4jxAGgwtRC2eNyQHjfKF0psn6os8bJFNkg2GCoUWchZJgAEzExo9PH8T_CXAhcNOC9MUzP5j6vUuZbrHNPhbkl2EN22zNNjtsY8AQuMvmhFBHDfKyKHnZqmd-YzuPnmo4QdZXvPBoSMt7otQVV43BTrb9bll88bS-7bG76e3iehbRcqo2ME1UxP3sG1AiVS-R6jeJiH02al4ae8C4QmdNT18TU2dU4qqkwwNtUhVnKkUrfciu_j7f0X8McszWSTfhcuOEjbrXd3uK7r6rz7xmPwGJlq9D
linkProvider	Directory of Open Access Journals
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Persian+SMS+Spam+Detection+using+Machine+Learning+and+Deep+Learning+Techniques&rft.jtitle=International+journal+of+Web+research&rft.au=Roya+Khorashadizade&rft.au=Somayyeh+Jafarali+Jassbi&rft.au=Alireza+Yari&rft.date=2022-01-01&rft.pub=University+of+science+and+culture&rft.eissn=2645-4343&rft.volume=5&rft.issue=1&rft.spage=56&rft.epage=65&rft_id=info:doi/10.22133%2Fijwr.2022.348824.1126&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c