Spam-Destroyer-NLP

Spam SMS/E-Mail Detection using Natural Language Processing

About the project / Summary :

Used NLP (Natural Language Processing) techniques in ML (Machine Learning) to detect whether an SMS/e-mail is spam or not spam . Used NLP techniques such as tokeniztion , lemmatization , stop words removal , punctuation removal using tools such as NLTK and regex . Used models such as Multinomial Naive Bayes and Logistic regression to achieve overall F1 Score of 0.99 . Also performed feature engineering and handcrafted features such as number of digits , email length , number of punctuations etc which further helped in predictions . Also generated wordclouds for the different prediction classes .

Field : NLP (Natural Language Processing)
Tools : NLTK , regex , scikit-learn , python
Concepts : tokeniztion , lemmatization , stop words , Logistic regression , naive bayes

Results :

Other Visualiztions :

Dataset :

https://www.kaggle.com/datasets/bagavathypriya/spam-ham-dataset (Originally taken from UCI machine learning repository ) . Note that althought the dataset says SMS , it has a significant resemblance to the E-Mail spam received also , and hance can be used to train a moel to detect spam e-mails also :)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
logistic_regression_spam_classifier.pkl		logistic_regression_spam_classifier.pkl
naive_bayes_spam_classifier.pkl		naive_bayes_spam_classifier.pkl
smsspamcollection.zip		smsspamcollection.zip
spam-classification-project.ipynb		spam-classification-project.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spam-Destroyer-NLP

About the project / Summary :

Results :

Other Visualiztions :

Dataset :

About

Releases

Packages

Languages

ayush-agarwal-0502/Spam-Destroyer-NLP

Folders and files

Latest commit

History

Repository files navigation

Spam-Destroyer-NLP

About the project / Summary :

Results :

Other Visualiztions :

Dataset :

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages