Remove Punctuation for Spam Filtering in Python| Tutorial: 02. How to Remove Punctuation for Spam Filtering in Python. Welcome to our second tutorial of series for SPAM Detection Tutorials for NLP.
In this tutorial we will discuss that how to How to Remove Punctuation for Spam Filtering in Python?. If you have not seen our previous Tutorial No: 1. We will suggest you please go through it. Click here for tutorial No: 1.
In this series of tutorials you will learn that How to import Dataset in python For Spam Detection Procedure as well as the other aspects in spam detection.
After these tutorials we would be able to create a model that would be enough intelligent to filter spams in SMS and Emails.
# Importing Mandatory Libraries
>>>import matplotlib.pyplot as plt|
>>>import numpy as np
>>>import pandas as pd
>>>import matplotlib.pyplot as plt
# Importing smsspam Dataset
>>> pd.set_option(‘display.max_colwidth’, 100)
>>> smsspam = pd.read_csv(‘SMSSpamCollection’, sep=“\t”, header=None)
>>> smsspam.columns = [‘label’, ‘msg’]
Defining remove_punctuation Method
txt_nopunct = “”.join([c for c in txt if c not in string.punctuation])
Adding New Column to output Table
>>> smsspam[‘msg_clean’] = smsspam[‘msg’].apply(lambda x: remove_punctuation(x))
Download SMS Spam Filtering Dataset