A Survey of Existing E-mail Spam Filtering Methods Considering Machine Learning Techniques

α
Jinat Ara
Jinat Ara
σ
Hanif Bhuiyan
Hanif Bhuiyan
ρ
Akm Ashiquzzaman
Akm Ashiquzzaman
Ѡ
Tamanna Islam Juthi
Tamanna Islam Juthi
¥
Suzit Biswas
Suzit Biswas
α Southeast University Southeast University

Send Message

To: Author

A Survey of Existing E-mail Spam Filtering Methods Considering Machine Learning Techniques

Article Fingerprint

ReserarchID

CSTSDEA36WV

A Survey of Existing E-mail Spam Filtering Methods Considering Machine Learning Techniques Banner

AI TAKEAWAY

Connecting with the Eternal Ground
  • English
  • Afrikaans
  • Albanian
  • Amharic
  • Arabic
  • Armenian
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan
  • Cebuano
  • Chichewa
  • Chinese (Simplified)
  • Chinese (Traditional)
  • Corsican
  • Croatian
  • Czech
  • Danish
  • Dutch
  • Esperanto
  • Estonian
  • Filipino
  • Finnish
  • French
  • Frisian
  • Galician
  • Georgian
  • German
  • Greek
  • Gujarati
  • Haitian Creole
  • Hausa
  • Hawaiian
  • Hebrew
  • Hindi
  • Hmong
  • Hungarian
  • Icelandic
  • Igbo
  • Indonesian
  • Irish
  • Italian
  • Japanese
  • Javanese
  • Kannada
  • Kazakh
  • Khmer
  • Korean
  • Kurdish (Kurmanji)
  • Kyrgyz
  • Lao
  • Latin
  • Latvian
  • Lithuanian
  • Luxembourgish
  • Macedonian
  • Malagasy
  • Malay
  • Malayalam
  • Maltese
  • Maori
  • Marathi
  • Mongolian
  • Myanmar (Burmese)
  • Nepali
  • Norwegian
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Punjabi
  • Romanian
  • Russian
  • Samoan
  • Scots Gaelic
  • Serbian
  • Sesotho
  • Shona
  • Sindhi
  • Sinhala
  • Slovak
  • Slovenian
  • Somali
  • Spanish
  • Sundanese
  • Swahili
  • Swedish
  • Tajik
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Uzbek
  • Vietnamese
  • Welsh
  • Xhosa
  • Yiddish
  • Yoruba
  • Zulu

Abstract

E-mail is one of the most secure medium for online communication and transferring data or messages through the web. An overgrowing increase in popularity, the number of unsolicited data has also increased rapidly. To filtering data, different approaches exist which automatically detect and remove these untenable messages. There are several numbers of email spam filtering technique such as Knowledge-based technique, Clustering techniques, Learning-based technique, Heuristic processes and so on. This paper illustrates a survey of different existing email spam filtering system regarding Machine Learning Technique (MLT) such as Naive Bayes, SVM, K-Nearest Neighbor, Bayes Additive Regression, KNN Tree, and rules. However, here we present the classification, evaluation and comparison of different email spam filtering system and summarize the overall scenario regarding accuracy rate of different existing approaches.

References

46 Cites in Article
  1. W Awad,S Elseuofi (2011). Machine Learning methods for E-mail Classification.
  2. O Saad,A Darwish,R Faraj (2012). PERFORMANCE OF MACHINE LEARNING TECHNIQUES FOR EMAIL SPAM FILTERING.
  3. Yingying Chen,Sourabh Jain,Vijay Adhikari,Zhi-Li Zhang,Kuai Xu (2011). A first look at inter-data center traffic characteristics via Yahoo! datasets.
  4. Kari Barlow,Jenny Lane (2007). Like technology from an advanced alien culture.
  5. Danyel Fisher,A Brush,Eric Gleave,Marc Smith (2006). Revisiting Whittaker & Sidner's "email overload" ten years later.
  6. Enrico Blanzieri,Anton Bryl (2008). A survey of learning-based techniques of email spam filtering.
  7. P Cunningham,N Nowlan,S Delany,M Haahr (2003). A case-based approach to spam filtering that can track concept drift.
  8. H Drucker,Donghui Wu,V Vapnik (1999). Support vector machines for spam categorization.
  9. M Sahami,S Dumais,D Heckerman,E Horvitz (1998). A Bayesian approach to filtering junk e-mail.
  10. Qiang Wang,Yi Guan,Xiaolong Wang (2006). SVM-Based Spam Filter with Active and Online Learning.
  11. S Mohammed,O Mohammed,J Fiaidhi,S Fong,T Kim (2013). Classifying Unsolicited Bulk Email (UBE) using Python Machine Learning Techniques.
  12. A Harisinghaney,A Dixit,S Gupta,A Arora (2014). Text and image based spam email classification using KNN, Naïve Bayes and Reverse DBSCAN algorithm.
  13. M Scholar (2010). Supervised learning approach for spam classification analysis using data mining tools.
  14. V Christina,S Karpagavalli,G Suganya (2010). A Study on Email Spam Filtering Techniques.
  15. V Metsis,I Androutsopoulos,G Paliouras (2006). Spam filtering with naive bayes-which naive bayes?.
  16. Ion Androutsopoulos,John Koutsias,Konstantinos Chandrinos,Constantine Spyropoulos (2000). An experimental comparison of naive Bayesian and keyword-based anti-spam filtering with personal e-mail messages.
  17. J Hovold (2005). Naive Bayes Spam Filtering Using Word-Position-Based Attributes.
  18. José Hidalgo (2002). Evaluating cost-sensitive Unsolicited Bulk Email categorization.
  19. I Androutsopoulos,G Paliouras,V Karkaletsis,G Sakkis,C Spyropoulos,P Stamatopoulos (2000). Learning to filter spam e-mail: A comparison of a naive bayesian and a memory-based approach.
  20. Tom Fawcett (2003). "In vivo" spam filtering.
  21. C Wu,K Cheng,Q Zhu,Y Wu (2005). Using visual features for anti-spam filtering.
  22. G Cormack,J Gómez Hidalgo,E Sánz (2007). Spam filtering for short messages.
  23. T Subramaniam,H Jalab,A Taqa (2010). Overview of textual anti-spam filtering techniques.
  24. S Sharma,A Arora (2013). Adaptive approach for spam detection.
  25. M Banday,T Jan (2009). Effectiveness and limitations of statistical spam filters.
  26. P Chhabra,R Wadhvani,S Shukla (2010). Spam filtering using support vector machine.
  27. K Tretyakov (2004). Machine learning techniques in spam filtering.
  28. T Shahi,A Yadav (2013). Mobile SMS spam filtering for Nepali text using naïve bayesian and support vector machine.
  29. H Drucker,Donghui Wu,V Vapnik (1999). Support vector machines for spam categorization.
  30. G Suganya,M Arulprakash,V Dhayalan,V Sridevi,M Kavitha,C Srinivasan (2025). Sustainable Solid Waste Segregation Through IoT and Linear Regression.
  31. U Rahane,A Lande,O Bavikar,S Chavan,K Shedge International Journal of Engineering Sciences & Research Technology Advanced Filtering System to Protect OSN user Wall From Unwanted Messages.
  32. J Moody,C Darken (1989). Fast learning in networks of locally-tuned processing units.
  33. Megha Rathi,Vikas Pareek (2013). Spam Mail Detection through Data Mining – A Comparative Performance Analysis.
  34. P Graham (2002). A plan for spam.
  35. N Kang,C Domeniconi,D Barbará (2005). Categorization and keyword identification of unlabeled documents.
  36. V Singh,S Bhardwaj (2018). Spam Mail Detection Using Classification Techniques and Global Training Set.
  37. M Shafi'i Muhammad Abdulhamid,O Osho,I Ismaila,J Alhassan (2018). Comparative Analysis of Classification Algorithms for Email Spam Detection.
  38. U Sah,N Parmar (2017). An approach for Malicious Spam Detection in Email with comparison of different classifiers.
  39. T Verma (2017). E-Mail Spam Detection and Classification Using SVM and Feature Extraction.
  40. Nurul Rusland,Norfaradilla Wahid,Shahreen Kasim,Hanayanti Hafit (2017). Analysis of Naïve Bayes Algorithm for Email Spam Filtering across Multiple Datasets.
  41. A Yüksel,S Cankaya,İ Üncü (2017). Design of a Machine Learning Based Predictive Analytics System for Spam Problem.
  42. Neelam Choudhary,Ankit Jain (2017). Towards Filtering of SMS Spam Messages Using Machine Learning Based Technique.
  43. Dave Debarr,Harry Wechsler (2009). Spam detection using Random Boost.
  44. Thiago Guzella,Walmir Caminhas (2009). A review of machine learning approaches to Spam filtering.
  45. D Mavroeidis,K Chaidos,S Pirillos,D Christopoulos,M Vazirgiannis (2006). Using tri-training and support vector machines for addressing the ECML/PKDD 2006 discovery challenge.
  46. Bryan Klimt,Yiming Yang (2004). The Enron Corpus: A New Dataset for Email Classification Research.

Funding

No external funding was declared for this work.

Conflict of Interest

The authors declare no conflict of interest.

Ethical Approval

No ethics committee approval was required for this article type.

Data Availability

Not applicable for this article.

How to Cite This Article

Jinat Ara. 2018. \u201cA Survey of Existing E-mail Spam Filtering Methods Considering Machine Learning Techniques\u201d. Global Journal of Computer Science and Technology - C: Software & Data Engineering GJCST-C Volume 18 (GJCST Volume 18 Issue C2): .

Download Citation

Issue Cover
GJCST Volume 18 Issue C2
Pg. 21- 29
Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Keywords
Classification
GJCST-C Classification: H.1.2
Version of record

v1.2

Issue date

May 25, 2018

Language
en
Experiance in AR

Explore published articles in an immersive Augmented Reality environment. Our platform converts research papers into interactive 3D books, allowing readers to view and interact with content using AR and VR compatible devices.

Read in 3D

Your published article is automatically converted into a realistic 3D book. Flip through pages and read research papers in a more engaging and interactive format.

Article Matrices
Total Views: 5927
Total Downloads: 1451
2026 Trends
Related Research

Published Article

E-mail is one of the most secure medium for online communication and transferring data or messages through the web. An overgrowing increase in popularity, the number of unsolicited data has also increased rapidly. To filtering data, different approaches exist which automatically detect and remove these untenable messages. There are several numbers of email spam filtering technique such as Knowledge-based technique, Clustering techniques, Learning-based technique, Heuristic processes and so on. This paper illustrates a survey of different existing email spam filtering system regarding Machine Learning Technique (MLT) such as Naive Bayes, SVM, K-Nearest Neighbor, Bayes Additive Regression, KNN Tree, and rules. However, here we present the classification, evaluation and comparison of different email spam filtering system and summarize the overall scenario regarding accuracy rate of different existing approaches.

Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

A Survey of Existing E-mail Spam Filtering Methods Considering Machine Learning Techniques

Hanif Bhuiyan
Hanif Bhuiyan
Akm Ashiquzzaman
Akm Ashiquzzaman
Tamanna Islam Juthi
Tamanna Islam Juthi
Suzit Biswas
Suzit Biswas
Jinat Ara
Jinat Ara Southeast University

Research Journals