Persian SMS Spam Detection using Machine Learning and Deep Learning Techniques

Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience imposed on users, the loss of network traffic, the rise in the calculated cost, occupying more physical space on the mobile phone, and abusing...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of Web research Vol. 5; no. 1; pp. 56 - 65
Main Authors Roya Khorashadizade, Somayyeh Jafarali Jassbi, Alireza Yari
Format Journal Article
LanguageEnglish
Published University of science and culture 01.01.2022
Subjects
Online AccessGet full text
ISSN2645-4343
DOI10.22133/ijwr.2022.348824.1126

Cover

Loading…
Abstract Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience imposed on users, the loss of network traffic, the rise in the calculated cost, occupying more physical space on the mobile phone, and abusing and defrauding recipients are but a few of their downsides. Consequently, the automated identification of suspicious and spam messages is undoubtedly vitally important. Additionally, text messages which are smartly composed might be difficult to recognize. However, the present methodologies in this subject are hindered by the absence of adequate Persian datasets. A huge body of research and experiments has revealed that techniques based on deep and combined learning are superior at identifying unpleasant text messages. This work sought to develop an effective strategy for identifying SMS spam through utilizing combining machine learning classification algorithms together with deep learning models. After applying preprocessing on our gathered dataset, the suggested technique applies two convolutional neural network layers, the first of which being an LSTM layer, and the second one which is a fully connected layer to extract the data characteristics, thereby implementing the suggested deep learning approach. As part of the Machine Learning methodologies, the vector support machine makes use of the data and features at hand to determine the ultimate classification. Results indicate that the suggested model is implemented more effectively than the existing techniques, and an accuracy of 97.7% was achieved as a result.
AbstractList Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience imposed on users, the loss of network traffic, the rise in the calculated cost, occupying more physical space on the mobile phone, and abusing and defrauding recipients are but a few of their downsides. Consequently, the automated identification of suspicious and spam messages is undoubtedly vitally important. Additionally, text messages which are smartly composed might be difficult to recognize. However, the present methodologies in this subject are hindered by the absence of adequate Persian datasets. A huge body of research and experiments has revealed that techniques based on deep and combined learning are superior at identifying unpleasant text messages. This work sought to develop an effective strategy for identifying SMS spam through utilizing combining machine learning classification algorithms together with deep learning models. After applying preprocessing on our gathered dataset, the suggested technique applies two convolutional neural network layers, the first of which being an LSTM layer, and the second one which is a fully connected layer to extract the data characteristics, thereby implementing the suggested deep learning approach. As part of the Machine Learning methodologies, the vector support machine makes use of the data and features at hand to determine the ultimate classification. Results indicate that the suggested model is implemented more effectively than the existing techniques, and an accuracy of 97.7% was achieved as a result.
Author Roya Khorashadizade
Somayyeh Jafarali Jassbi
Alireza Yari
Author_xml – sequence: 1
  fullname: Roya Khorashadizade
  organization: Department of Information Technology Science and Research Branch, Islamic Azad University Tehran, Iran
– sequence: 2
  fullname: Somayyeh Jafarali Jassbi
  organization: Department of Computer Engineering, Science and Research Branch, Islamic Azad University Tehran, Iran
– sequence: 3
  fullname: Alireza Yari
  organization: Iran telecom IT Research faculty, ICT research institute, Tehran, Iran research center
BookMark eNqtzNtKAzEUheEgClbtK0heoGNm78zBaw8otCJM78NusqfN0CZjMkV8ew8IvoBXCz4W_4U4DTGwENelKgBKxBs_vKcCFECBum1BF2UJ9YmYQa2rhUaN52Ke86CUglvVompm4uWVU_YUZLfqZDfSQd7zxHbyMchj9mErV2R3PrBcMqXwDRTc14nHP1mz3QX_duR8Jc562mee_-6leH58WN89LVykwYzJHyh9mEje_EBMW0Np8nbPptoo6q21hOi0qnCDPTvXQlPWSLax-J-tT5QAYYQ
ContentType Journal Article
DBID DOA
DOI 10.22133/ijwr.2022.348824.1126
DatabaseName DOAJ Directory of Open Access Journals
DatabaseTitleList
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 2645-4343
EndPage 65
ExternalDocumentID oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c
GroupedDBID ALMA_UNASSIGNED_HOLDINGS
GROUPED_DOAJ
ID FETCH-doaj_primary_oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c3
IEDL.DBID DOA
IngestDate Wed Aug 27 01:04:28 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
LinkModel DirectLink
MergedId FETCHMERGED-doaj_primary_oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c3
OpenAccessLink https://doaj.org/article/5b0afccca33d4053b3fedd827163ac7c
ParticipantIDs doaj_primary_oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c
PublicationCentury 2000
PublicationDate 2022-01-01
PublicationDateYYYYMMDD 2022-01-01
PublicationDate_xml – month: 01
  year: 2022
  text: 2022-01-01
  day: 01
PublicationDecade 2020
PublicationTitle International journal of Web research
PublicationYear 2022
Publisher University of science and culture
Publisher_xml – name: University of science and culture
SSID ssj0002908307
Score 4.3380427
Snippet Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience...
SourceID doaj
SourceType Open Website
StartPage 56
SubjectTerms convolutional neural network
lstm
sms spam
spam detection
support vector machine
Title Persian SMS Spam Detection using Machine Learning and Deep Learning Techniques
URI https://doaj.org/article/5b0afccca33d4053b3fedd827163ac7c
Volume 5
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8QwEA6yJz2IT3yTg9e6ZZI-9uhrWYXupSvsraR5LArWohX_vjNJxXryoNcQkjCTzCP55gtj53magdaJi1QsXSQBdFTb2EROAiZjkFtQVOBczNPZg7xfJsvBV1-ECQv0wEFw46SOldM4jxAGgwtRC2eNyQHjfKF0psn6os8bJFNkg2GCoUWchZJgAEzExo9PH8T_CXAhcNOC9MUzP5j6vUuZbrHNPhbkl2EN22zNNjtsY8AQuMvmhFBHDfKyKHnZqmd-YzuPnmo4QdZXvPBoSMt7otQVV43BTrb9bll88bS-7bG76e3iehbRcqo2ME1UxP3sG1AiVS-R6jeJiH02al4ae8C4QmdNT18TU2dU4qqkwwNtUhVnKkUrfciu_j7f0X8McszWSTfhcuOEjbrXd3uK7r6rz7xmPwGJlq9D
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Persian+SMS+Spam+Detection+using+Machine+Learning+and+Deep+Learning+Techniques&rft.jtitle=International+journal+of+Web+research&rft.au=Roya+Khorashadizade&rft.au=Somayyeh+Jafarali+Jassbi&rft.au=Alireza+Yari&rft.date=2022-01-01&rft.pub=University+of+science+and+culture&rft.eissn=2645-4343&rft.volume=5&rft.issue=1&rft.spage=56&rft.epage=65&rft_id=info:doi/10.22133%2Fijwr.2022.348824.1126&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_5b0afccca33d4053b3fedd827163ac7c