# Text Processing
### Author: Ehsan Gharib-Nezhad


<!-- Let's review some of the pre-processing steps for text data:

- Remove special characters
- Tokenizing
- Lemmatizing/Stemming
- Stop word removal

`CountVectorizer` actually can do a lot of this for us! It is important to keep these steps in mind in case you want to change the default methods used for each of these. -->

In [44]:
# Load Libraries
from myfunctions import *
from bs4 import BeautifulSoup #Function for removing html
from nltk.stem.porter import PorterStemmer
from nltk.tokenize import sent_tokenize, word_tokenize, RegexpTokenizer


In [2]:
# Load datasets
df = pd.read_csv('../datasets/preprocessed_df_PandemicPreps_reddit_LAST.csv',index_col=0)

In [3]:
df.head()

Unnamed: 0,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp
0,It's going fast in TN,"Anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! I took my under 10 aged child to Walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. Toilet paper gone. Paper towels gone. Lysol, down to 4 bottles. Hand sanitizer and alcohol, gone. Bleach was available but only a few rem...",PandemicPreps,1583021338,BeautifulBalance1,18,1,True,2020-02-29 16:08:58
1,Don’t forget about your pets!,"Amazon is running low on cat and dog food, my normals are sold out. I feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.",PandemicPreps,1583021470,JKMSDE,12,1,True,2020-02-29 16:11:10
2,DIY Hand Sanitizer,Because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/) \n\n* 5 c 91% isopropyl alcohol\n* 2 c aloe vera gel\n\nHere's another receipt with essential oils\n\n [https://www.asiaone.com/lifestyle/make-your-own-diy-hand-sanitizer](https://www....,PandemicPreps,1583022495,AccidentalDragon,44,1,True,2020-02-29 16:28:15
3,Anyone notice anything in the Denver / Boulder area? I see no concern and it concerns me!,What the hell is going on. Anyone around this area notice anything?,PandemicPreps,1583023089,Wizard_Knife_Fight,12,1,True,2020-02-29 16:38:09
4,"PSA: don’t only buy a bunch of random foods, know how to cook it and get the proper nutrition.","I see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). Just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. Have a good balance of foods and learn the nutrients balance. Vitamin supplements are very important because fresh produce can...",PandemicPreps,1583025573,cutting-alumination,56,1,True,2020-02-29 17:19:33


### Data shape

In [4]:
df.shape

(2362, 9)

### Drop rows with selftext equal '[removed]'

In [5]:
# percentage of rows with "[removed]" word
print(f"percentage of rows with '[removed]' word: \
      {np.round(len(df[df['selftext']=='[removed]'])*100/len(df),2)}%")

percentage of rows with '[removed]' word:       0.0%


In [6]:
# remove all rows with selftext = "[removed]"
df.drop(index=df[df['selftext']=='[removed]'].index, inplace=True)

In [7]:
df.reset_index(inplace=True, drop=True)

In [8]:
df.head()

Unnamed: 0,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp
0,It's going fast in TN,"Anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! I took my under 10 aged child to Walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. Toilet paper gone. Paper towels gone. Lysol, down to 4 bottles. Hand sanitizer and alcohol, gone. Bleach was available but only a few rem...",PandemicPreps,1583021338,BeautifulBalance1,18,1,True,2020-02-29 16:08:58
1,Don’t forget about your pets!,"Amazon is running low on cat and dog food, my normals are sold out. I feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.",PandemicPreps,1583021470,JKMSDE,12,1,True,2020-02-29 16:11:10
2,DIY Hand Sanitizer,Because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/) \n\n* 5 c 91% isopropyl alcohol\n* 2 c aloe vera gel\n\nHere's another receipt with essential oils\n\n [https://www.asiaone.com/lifestyle/make-your-own-diy-hand-sanitizer](https://www....,PandemicPreps,1583022495,AccidentalDragon,44,1,True,2020-02-29 16:28:15
3,Anyone notice anything in the Denver / Boulder area? I see no concern and it concerns me!,What the hell is going on. Anyone around this area notice anything?,PandemicPreps,1583023089,Wizard_Knife_Fight,12,1,True,2020-02-29 16:38:09
4,"PSA: don’t only buy a bunch of random foods, know how to cook it and get the proper nutrition.","I see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). Just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. Have a good balance of foods and learn the nutrients balance. Vitamin supplements are very important because fresh produce can...",PandemicPreps,1583025573,cutting-alumination,56,1,True,2020-02-29 17:19:33


### Drop rows with nan in the selftext

In [9]:
# null percentage
df.isnull().sum()*100/len(df)

title           0.0
selftext        0.0
subreddit       0.0
created_utc     0.0
author          0.0
num_comments    0.0
score           0.0
is_self         0.0
timestamp       0.0
dtype: float64

In [10]:
#drop all rows with nulls
df.dropna(inplace=True)

In [11]:
# resetting the index
df.reset_index(inplace=True, drop = True)

In [12]:
# check for any remained nulls ?!
df.isna().sum()

title           0
selftext        0
subreddit       0
created_utc     0
author          0
num_comments    0
score           0
is_self         0
timestamp       0
dtype: int64

In [13]:
df.head()

Unnamed: 0,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp
0,It's going fast in TN,"Anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! I took my under 10 aged child to Walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. Toilet paper gone. Paper towels gone. Lysol, down to 4 bottles. Hand sanitizer and alcohol, gone. Bleach was available but only a few rem...",PandemicPreps,1583021338,BeautifulBalance1,18,1,True,2020-02-29 16:08:58
1,Don’t forget about your pets!,"Amazon is running low on cat and dog food, my normals are sold out. I feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.",PandemicPreps,1583021470,JKMSDE,12,1,True,2020-02-29 16:11:10
2,DIY Hand Sanitizer,Because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/) \n\n* 5 c 91% isopropyl alcohol\n* 2 c aloe vera gel\n\nHere's another receipt with essential oils\n\n [https://www.asiaone.com/lifestyle/make-your-own-diy-hand-sanitizer](https://www....,PandemicPreps,1583022495,AccidentalDragon,44,1,True,2020-02-29 16:28:15
3,Anyone notice anything in the Denver / Boulder area? I see no concern and it concerns me!,What the hell is going on. Anyone around this area notice anything?,PandemicPreps,1583023089,Wizard_Knife_Fight,12,1,True,2020-02-29 16:38:09
4,"PSA: don’t only buy a bunch of random foods, know how to cook it and get the proper nutrition.","I see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). Just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. Have a good balance of foods and learn the nutrients balance. Vitamin supplements are very important because fresh produce can...",PandemicPreps,1583025573,cutting-alumination,56,1,True,2020-02-29 17:19:33


In [14]:
df.shape

(2362, 9)

In [15]:
df['timestamp']

0       2020-02-29 16:08:58
1       2020-02-29 16:11:10
2       2020-02-29 16:28:15
3       2020-02-29 16:38:09
4       2020-02-29 17:19:33
               ...         
2357    2020-02-29 14:58:17
2358    2020-02-29 15:46:42
2359    2020-02-29 15:55:08
2360    2020-02-29 15:56:39
2361    2020-02-29 15:57:46
Name: timestamp, Length: 2362, dtype: object

### Lower Casing

In [16]:
df['post']  = df['selftext'].str.lower()

In [17]:
df['post']

0       anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't!  i took my under 10 aged child to walmart today and picked up some more to add, and it is a night and day difference from 3 days ago.  toilet paper gone.  paper towels gone.  lysol, down to 4 bottles.  hand sanitizer and alcohol, gone.  bleach was available but only a few rem...
1                                                                                                                                                                                                                                                            amazon is running low on cat and dog food, my normals are sold out. i feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.
2       because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pande

### Remove URL's / Website address

In [18]:
# Function for url's
def remove_urls(text):
    url_pattern = re.compile(r'https?://\S+|www\.\S+')
    return url_pattern.sub(r'', text)

In [19]:
df['post'] = df['post'].map( remove_urls )

### Removing special characters

In [20]:
df['post'] = df['post']\
                        .replace('http\S+', '', regex=True)\
                        .replace('www\S+', '', regex=True)\
                        .replace('\n\n\S+', '', regex=True)\
                        .replace('\n', '', regex=True)\
                        .replace('\*', '', regex=True)

In [21]:
df['post']

0       anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't!  i took my under 10 aged child to walmart today and picked up some more to add, and it is a night and day difference from 3 days ago.  toilet paper gone.  paper towels gone.  lysol, down to 4 bottles.  hand sanitizer and alcohol, gone.  bleach was available but only a few rem...
1                                                                                                                                                                                                                                                            amazon is running low on cat and dog food, my normals are sold out. i feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.
2                            because you just can't find it in stores any more... [  5 c 91% isopropyl alcohol 2 c aloe vera gel another receipt with essential oils [   1 tablespoon ru

### Find/Count emoji

In [22]:
import demoji

In [23]:
def find_emoji(dataframe, 
               print_option = False):
    if print_option == True:
        print ( dataframe[dataframe.map(demoji.findall) != {}])
    return (dataframe.map(demoji.findall) != {}).sum()

In [24]:
find_emoji(df['post'])

54

### Remove emoji

In [25]:
def remove_emoji(dataframe):
    return dataframe.map(demoji.replace)

In [26]:
df['post'] = remove_emoji(df['post'])

### Convert emoji to text
All emojis are removed fot the first part of the project which is distingushing two sub-redits. 
However, emojis are converted to text for sentiment analysis.

In [27]:
import emoji
def convert_emoji_to_text(text):
    return emoji.demojize(text)

In [28]:
# df['selftext'].iloc[0:10].map(convert_emoji_to_text)

### Removal of HTML tags

In [30]:
def remove_html(text):
    return BeautifulSoup(text, "lxml").text

In [31]:
df['post'] = df['post'].map(remove_html)

In [32]:
df.head()

Unnamed: 0,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp,post
0,It's going fast in TN,"Anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! I took my under 10 aged child to Walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. Toilet paper gone. Paper towels gone. Lysol, down to 4 bottles. Hand sanitizer and alcohol, gone. Bleach was available but only a few rem...",PandemicPreps,1583021338,BeautifulBalance1,18,1,True,2020-02-29 16:08:58,"anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! i took my under 10 aged child to walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. toilet paper gone. paper towels gone. lysol, down to 4 bottles. hand sanitizer and alcohol, gone. bleach was available but only a few rem..."
1,Don’t forget about your pets!,"Amazon is running low on cat and dog food, my normals are sold out. I feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.",PandemicPreps,1583021470,JKMSDE,12,1,True,2020-02-29 16:11:10,"amazon is running low on cat and dog food, my normals are sold out. i feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter."
2,DIY Hand Sanitizer,Because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/) \n\n* 5 c 91% isopropyl alcohol\n* 2 c aloe vera gel\n\nHere's another receipt with essential oils\n\n [https://www.asiaone.com/lifestyle/make-your-own-diy-hand-sanitizer](https://www....,PandemicPreps,1583022495,AccidentalDragon,44,1,True,2020-02-29 16:28:15,because you just can't find it in stores any more... [ 5 c 91% isopropyl alcohol 2 c aloe vera gel another receipt with essential oils [ 1 tablespoon rubbing alcohol or 2 tablespoons vodka 10 drops tea tree essential oil 10 drops lavender essential oil 1/4 cup aloe vera gel 1/2 teaspoon vitamin e oil (optional) a small bottle if i could just find some iso in the stores lol
3,Anyone notice anything in the Denver / Boulder area? I see no concern and it concerns me!,What the hell is going on. Anyone around this area notice anything?,PandemicPreps,1583023089,Wizard_Knife_Fight,12,1,True,2020-02-29 16:38:09,what the hell is going on. anyone around this area notice anything?
4,"PSA: don’t only buy a bunch of random foods, know how to cook it and get the proper nutrition.","I see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). Just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. Have a good balance of foods and learn the nutrients balance. Vitamin supplements are very important because fresh produce can...",PandemicPreps,1583025573,cutting-alumination,56,1,True,2020-02-29 17:19:33,"i see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. have a good balance of foods and learn the nutrients balance. vitamin supplements are very important because fresh produce can..."


### Replace all non-letters with space

In [33]:
def replace_all_non_letters_with_space(text):
    return re.sub("[^a-zA-Z]",  # Search for all non-letters
                          " ",          # Replace all non-letters with spaces
                          str(text))

In [34]:
df['post'] = df['post'].map(replace_all_non_letters_with_space)

In [35]:
df.head()

Unnamed: 0,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp,post
0,It's going fast in TN,"Anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! I took my under 10 aged child to Walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. Toilet paper gone. Paper towels gone. Lysol, down to 4 bottles. Hand sanitizer and alcohol, gone. Bleach was available but only a few rem...",PandemicPreps,1583021338,BeautifulBalance1,18,1,True,2020-02-29 16:08:58,anyone in this group should know to buy now if you haven t but for the sake of reinforcement buy now if you haven t i took my under aged child to walmart today and picked up some more to add and it is a night and day difference from days ago toilet paper gone paper towels gone lysol down to bottles hand sanitizer and alcohol gone bleach was available but only a few rem...
1,Don’t forget about your pets!,"Amazon is running low on cat and dog food, my normals are sold out. I feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.",PandemicPreps,1583021470,JKMSDE,12,1,True,2020-02-29 16:11:10,amazon is running low on cat and dog food my normals are sold out i feel bad for the delivery guy dropping my shit off almost lbs of food and liter
2,DIY Hand Sanitizer,Because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/) \n\n* 5 c 91% isopropyl alcohol\n* 2 c aloe vera gel\n\nHere's another receipt with essential oils\n\n [https://www.asiaone.com/lifestyle/make-your-own-diy-hand-sanitizer](https://www....,PandemicPreps,1583022495,AccidentalDragon,44,1,True,2020-02-29 16:28:15,because you just can t find it in stores any more c isopropyl alcohol c aloe vera gel another receipt with essential oils tablespoon rubbing alcohol or tablespoons vodka drops tea tree essential oil drops lavender essential oil cup aloe vera gel teaspoon vitamin e oil optional a small bottle if i could just find some iso in the stores lol
3,Anyone notice anything in the Denver / Boulder area? I see no concern and it concerns me!,What the hell is going on. Anyone around this area notice anything?,PandemicPreps,1583023089,Wizard_Knife_Fight,12,1,True,2020-02-29 16:38:09,what the hell is going on anyone around this area notice anything
4,"PSA: don’t only buy a bunch of random foods, know how to cook it and get the proper nutrition.","I see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). Just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. Have a good balance of foods and learn the nutrients balance. Vitamin supplements are very important because fresh produce can...",PandemicPreps,1583025573,cutting-alumination,56,1,True,2020-02-29 17:19:33,i see a lot of people on here buying around the same types of stuff canned beans rice etc just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example you can t live off of only eating meat products have a good balance of foods and learn the nutrients balance vitamin supplements are very important because fresh produce can...


### Remove Stop Words

In [36]:
def remove_stop_words(dataFrame):
    return [token for token in dataFrame if token not in stopwords.words('english')]

In [37]:
#Importing stopwords from nltk library
from nltk.corpus import stopwords
STOPWORDS = set(stopwords.words('english'))# Function to remove the stopwords
def stopwords(text):
    return " ".join([word for word in str(text).split() if word not in STOPWORDS])# Applying the stopwords to 'text_punct' and store into 'text_stop'


df["post"] = df["post"].apply(stopwords)

In [38]:
df.head()

Unnamed: 0,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp,post
0,It's going fast in TN,"Anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! I took my under 10 aged child to Walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. Toilet paper gone. Paper towels gone. Lysol, down to 4 bottles. Hand sanitizer and alcohol, gone. Bleach was available but only a few rem...",PandemicPreps,1583021338,BeautifulBalance1,18,1,True,2020-02-29 16:08:58,anyone group know buy sake reinforcement buy took aged child walmart today picked add night day difference days ago toilet paper gone paper towels gone lysol bottles hand sanitizer alcohol gone bleach available remained cold meds low even dish gloves ransacked hot dogs gone flour low go beans rice aisle child said coronavirus yes lady checking front joke bottles shampoo conditioner spouse thin...
1,Don’t forget about your pets!,"Amazon is running low on cat and dog food, my normals are sold out. I feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.",PandemicPreps,1583021470,JKMSDE,12,1,True,2020-02-29 16:11:10,amazon running low cat dog food normals sold feel bad delivery guy dropping shit almost lbs food liter
2,DIY Hand Sanitizer,Because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/) \n\n* 5 c 91% isopropyl alcohol\n* 2 c aloe vera gel\n\nHere's another receipt with essential oils\n\n [https://www.asiaone.com/lifestyle/make-your-own-diy-hand-sanitizer](https://www....,PandemicPreps,1583022495,AccidentalDragon,44,1,True,2020-02-29 16:28:15,find stores c isopropyl alcohol c aloe vera gel another receipt essential oils tablespoon rubbing alcohol tablespoons vodka drops tea tree essential oil drops lavender essential oil cup aloe vera gel teaspoon vitamin e oil optional small bottle could find iso stores lol
3,Anyone notice anything in the Denver / Boulder area? I see no concern and it concerns me!,What the hell is going on. Anyone around this area notice anything?,PandemicPreps,1583023089,Wizard_Knife_Fight,12,1,True,2020-02-29 16:38:09,hell going anyone around area notice anything
4,"PSA: don’t only buy a bunch of random foods, know how to cook it and get the proper nutrition.","I see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). Just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. Have a good balance of foods and learn the nutrients balance. Vitamin supplements are very important because fresh produce can...",PandemicPreps,1583025573,cutting-alumination,56,1,True,2020-02-29 17:19:33,see lot people buying around types stuff canned beans rice etc remember know cook item buy combination foods right nutrients example live eating meat products good balance foods learn nutrients balance vitamin supplements important fresh produce go bad stored remember buy purpose good luck prepping guys


### Spelling Correction

In [41]:
def compare(corrected_text, original_text):  
    
    l1 = list(corrected_text)
    l2 = list(original_text)
#     print(l1)
    l1_ = [line.split(' ') for line in l1][0]
    l2_ = [line.split(' ')for line in l2][0]
#     print(l1)
    good = 0
    bad = 0
    for i in range(0, len(l1)):
        if l1_[i] != l2_[i]:
            bad += 1
            print(l1_[i] , l2_[i])
        else:
            good += 1
    print(f'Number of accurate words are= {good},\
          \nNumber of corrected words= {bad},\
          \nCorrection Percentage={np.round(bad*100/(len(l1)), 1)}%')


In [42]:
def correct_spell(original_text_df):
    
    return original_text_df.apply(lambda x: str(TextBlob(x).correct()))   # Correcting the text
    

In [43]:
# df['post'] = correct_spell(original_text_df=df['post'])

In [44]:
df.head()

Unnamed: 0,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp,post
0,It's going fast in TN,"Anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! I took my under 10 aged child to Walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. Toilet paper gone. Paper towels gone. Lysol, down to 4 bottles. Hand sanitizer and alcohol, gone. Bleach was available but only a few rem...",PandemicPreps,1583021338,BeautifulBalance1,18,1,True,2020-02-29 16:08:58,anyone group know buy sake reinforcement buy took aged child walmart today picked add night day difference days ago toilet paper gone paper towels gone lysol bottles hand sanitizer alcohol gone bleach available remained cold meds low even dish gloves ransacked hot dogs gone flour low go beans rice aisle child said coronavirus yes lady checking front joke bottles shampoo conditioner spouse thin...
1,Don’t forget about your pets!,"Amazon is running low on cat and dog food, my normals are sold out. I feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.",PandemicPreps,1583021470,JKMSDE,12,1,True,2020-02-29 16:11:10,amazon running low cat dog food normals sold feel bad delivery guy dropping shit almost lbs food liter
2,DIY Hand Sanitizer,Because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/) \n\n* 5 c 91% isopropyl alcohol\n* 2 c aloe vera gel\n\nHere's another receipt with essential oils\n\n [https://www.asiaone.com/lifestyle/make-your-own-diy-hand-sanitizer](https://www....,PandemicPreps,1583022495,AccidentalDragon,44,1,True,2020-02-29 16:28:15,find stores c isopropyl alcohol c aloe vera gel another receipt essential oils tablespoon rubbing alcohol tablespoons vodka drops tea tree essential oil drops lavender essential oil cup aloe vera gel teaspoon vitamin e oil optional small bottle could find iso stores lol
3,Anyone notice anything in the Denver / Boulder area? I see no concern and it concerns me!,What the hell is going on. Anyone around this area notice anything?,PandemicPreps,1583023089,Wizard_Knife_Fight,12,1,True,2020-02-29 16:38:09,hell going anyone around area notice anything
4,"PSA: don’t only buy a bunch of random foods, know how to cook it and get the proper nutrition.","I see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). Just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. Have a good balance of foods and learn the nutrients balance. Vitamin supplements are very important because fresh produce can...",PandemicPreps,1583025573,cutting-alumination,56,1,True,2020-02-29 17:19:33,see lot people buying around types stuff canned beans rice etc remember know cook item buy combination foods right nutrients example live eating meat products good balance foods learn nutrients balance vitamin supplements important fresh produce go bad stored remember buy purpose good luck prepping guys


# Stemmizing
When we "stem" data, we take words and attempt to return a base form of the word. It tends to be cruder than using lemmatization.

In [45]:
Pstemmizer = PorterStemmer()

In [46]:
def make_token(post):
    tokenizer = RegexpTokenizer(r'\w+') # remove the punctuation 
    post_tokens = tokenizer.tokenize(post)
    stem_spam = [Pstemmizer.stem(token) for token in post_tokens]
    return (' '.join(stem_spam))
    

In [47]:
df['token'] = list(map(make_token,df['post']))

In [48]:
df[['selftext','post','token']]

Unnamed: 0,selftext,post,token
0,"Anyone in this group should know to buy now if you haven't, but for the sake of reinforcement - buy now if you haven't! I took my under 10 aged child to Walmart today and picked up some more to add, and it is a night and day difference from 3 days ago. Toilet paper gone. Paper towels gone. Lysol, down to 4 bottles. Hand sanitizer and alcohol, gone. Bleach was available but only a few rem...",anyone group know buy sake reinforcement buy took aged child walmart today picked add night day difference days ago toilet paper gone paper towels gone lysol bottles hand sanitizer alcohol gone bleach available remained cold meds low even dish gloves ransacked hot dogs gone flour low go beans rice aisle child said coronavirus yes lady checking front joke bottles shampoo conditioner spouse thin...,anyon group know buy sake reinforc buy took age child walmart today pick add night day differ day ago toilet paper gone paper towel gone lysol bottl hand sanit alcohol gone bleach avail remain cold med low even dish glove ransack hot dog gone flour low go bean rice aisl child said coronaviru ye ladi check front joke bottl shampoo condition spous think overplay taken video could see saw pleas g...
1,"Amazon is running low on cat and dog food, my normals are sold out. I feel bad for the delivery guy dropping my shit off, almost 300lbs of food and liter.",amazon running low cat dog food normals sold feel bad delivery guy dropping shit almost lbs food liter,amazon run low cat dog food normal sold feel bad deliveri guy drop shit almost lb food liter
2,Because you just can't find it in stores any more...\n\n [http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/](http://www.utahpreppers.com/2009/04/pandemic-preparedness-diy-sanitization/) \n\n* 5 c 91% isopropyl alcohol\n* 2 c aloe vera gel\n\nHere's another receipt with essential oils\n\n [https://www.asiaone.com/lifestyle/make-your-own-diy-hand-sanitizer](https://www....,find stores c isopropyl alcohol c aloe vera gel another receipt essential oils tablespoon rubbing alcohol tablespoons vodka drops tea tree essential oil drops lavender essential oil cup aloe vera gel teaspoon vitamin e oil optional small bottle could find iso stores lol,find store c isopropyl alcohol c alo vera gel anoth receipt essenti oil tablespoon rub alcohol tablespoon vodka drop tea tree essenti oil drop lavend essenti oil cup alo vera gel teaspoon vitamin e oil option small bottl could find iso store lol
3,What the hell is going on. Anyone around this area notice anything?,hell going anyone around area notice anything,hell go anyon around area notic anyth
4,"I see a lot of people on here buying around the same types of stuff (canned beans, rice, etc). Just remember to know how to cook each item you buy and which combination of foods will have the right nutrients for example: you can’t live off of only eating meat products. Have a good balance of foods and learn the nutrients balance. Vitamin supplements are very important because fresh produce can...",see lot people buying around types stuff canned beans rice etc remember know cook item buy combination foods right nutrients example live eating meat products good balance foods learn nutrients balance vitamin supplements important fresh produce go bad stored remember buy purpose good luck prepping guys,see lot peopl buy around type stuff can bean rice etc rememb know cook item buy combin food right nutrient exampl live eat meat product good balanc food learn nutrient balanc vitamin supplement import fresh produc go bad store rememb buy purpos good luck prep guy
...,...,...,...
2357,So right now I have about 50 days of food prepped for 2 people (50 each) at 2000 calories. \n\nI’m not sure if I should get more. I’ve got all my other supplies with just a few things here and there but nothing critical I need anymore.\n\nI’m just not sure if 50 days per person for me and my partner is enough or if I should get more. \n\nHow many days of food are you prepping for each person?,right days food prepped people calories sure get got supplies things nothing critical need anymore sure days per person partner enough get many days food prepping person,right day food prep peopl calori sure get got suppli thing noth critic need anymor sure day per person partner enough get mani day food prep person
2358,"\nEven if you avoid non essential travel, I'm sure some people might still need to fly long distances for various obligations.\n\nAre there guides or tips on what exactly one should do while going through security check, same sitting in the plane?\n\nFor instance, how to sanitize your plane seat?\n\nAny advice would be appreciated!!",even avoid non essential travel sure people might still need fly long distances various obligations guides tips exactly one going security check sitting plane instance sanitize plane seat advice would appreciated,even avoid non essenti travel sure peopl might still need fli long distanc variou oblig guid tip exactli one go secur check sit plane instanc sanit plane seat advic would appreci
2359,Why are many people expecting power outages and/or water contamination/loss of water during COVID-19 pandemic? \n\nIt’s an honest question and I’d like to understand people’s reasoning as maybe I’m overlooking something important. I keep hearing people say they are ready for the coming power outages or seeing it online.\n\nI’ve been following news and individual reports from hard hit areas and...,many people expecting power outages water contamination loss water covid pandemic honest question like understand people reasoning maybe overlooking something important keep hearing people say ready coming power outages seeing online following news individual reports hard hit areas quarantined areas appear shut power plants water treatment plants appear disruptions sars even ebola hit areas es...,mani peopl expect power outag water contamin loss water covid pandem honest question like understand peopl reason mayb overlook someth import keep hear peopl say readi come power outag see onlin follow news individu report hard hit area quarantin area appear shut power plant water treatment plant appear disrupt sar even ebola hit area essenti personnel critic servic alway work genuin wonder co...
2360,"My husband’s been an ER RN for 20 years. He’s also been a Type 1 diabetic since he was 6. Over the course of our 26-year marriage the subject of a pandemic has come up several times. We created a master list of our needs and over the years we’ve purchased the big items. Last month we refreshed our supply on old items, but we’re still banging our heads against the same and most important subjec...",husband er rn years also type diabetic since course year marriage subject pandemic come several times created master list needs years purchased big items last month refreshed supply old items still banging heads important subject solar mini fridge insulin lose power feel dumb overwhelmed try understand need house root cellar giant generator apartment dwellers second floor yard ability mount so...,husband er rn year also type diabet sinc cours year marriag subject pandem come sever time creat master list need year purchas big item last month refresh suppli old item still bang head import subject solar mini fridg insulin lose power feel dumb overwhelm tri understand need hous root cellar giant gener apart dweller second floor yard abil mount solar panel hope get advic someon tackl issu s...


## Save text-processed doc

In [54]:
# check nulls
df.isnull().sum()

title           0
selftext        0
subreddit       0
created_utc     0
author          0
num_comments    0
score           0
is_self         0
timestamp       0
post            0
token           0
dtype: int64

In [55]:
# save processed data to be used for distingushing with P
df.to_csv('../datasets/text_processed_PandemicPreps.csv')

# reduce the time from March 2020 to March 2021

In [56]:
df[ df['timestamp'] < '2021-03-09 16:20:54' ]['timestamp']

0       2020-02-29 16:08:58
1       2020-02-29 16:11:10
2       2020-02-29 16:28:15
3       2020-02-29 16:38:09
4       2020-02-29 17:19:33
               ...         
2357    2020-02-29 14:58:17
2358    2020-02-29 15:46:42
2359    2020-02-29 15:55:08
2360    2020-02-29 15:56:39
2361    2020-02-29 15:57:46
Name: timestamp, Length: 2260, dtype: object

In [57]:
df.shape

(2362, 11)

## Save text-processed doc

check nulls for the last time

In [54]:
df.isnull().sum()

title           0
selftext        0
subreddit       0
created_utc     0
author          0
num_comments    0
score           0
is_self         0
timestamp       0
post            0
token           0
dtype: int64

In [58]:
df.to_csv('../datasets/text_processed_PandemicPreps_Mar2020_Mar2021.csv')

### Sentence Tokenization

In [142]:
df['post'].map(sent_tokenize)

0    [i tested positive today but i’ve had symptoms for about four days (the worst day being day 2)., is anyone else experiencing ridiculous brain fog?, ever since this afternoon i have felt like i took a huge bong rip., i get confused easily and even lightheaded sometimes., at this point it’s my only symptom other than a stuffy nose., i honestly took me a while to type this because of the fog.]
1            [this morning i received a positive test result., i also have high anxiety and i’m really scared., i’m not sure when my symptoms started - i thought i had a sinus infection about a week ago, but i completely lost my taste and smell on monday., mild congestion, and that’s about it., does anyone have any idea where i could be in the timeline of things?, could i just have a mild case?]
Name: selftext, dtype: object

### Word Tokenization

In [200]:
word_token = df['selftext'].iloc[0:2].map(word_tokenize)

In [201]:
word_token

0    [i, tested, positive, today, but, i, ’, ve, had, symptoms, for, about, four, days, (, the, worst, day, being, day, 2, ), ., is, anyone, else, experiencing, ridiculous, brain, fog, ?, ever, since, this, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, ., i, get, confused, easily, and, even, lightheaded, sometimes, ., at, this, point, it, ’, s, my, only, symptom, other, than, a, stuf...
1    [this, morning, i, received, a, positive, test, result, ., i, also, have, high, anxiety, and, i, ’, m, really, scared, ., i, ’, m, not, sure, when, my, symptoms, started, -, i, thought, i, had, a, sinus, infection, about, a, week, ago, ,, but, i, completely, lost, my, taste, and, smell, on, monday, ., mild, congestion, ,, and, that, ’, s, about, it, ., does, anyone, have, any, idea, where, i, ...
Name: selftext, dtype: object

### Tokenizing text

In [35]:
# Instantiate RegExp Tokenizer
tokenizer = RegexpTokenizer(r'\w+') # remove the punctuation 

In [38]:
# post_tokens = df['post'].iloc[0:2].map(tokenizer.tokenize)

In [39]:
# spam_tokens = tokenizer.tokenize(df['selftext'][0:10])

In [40]:
post_tokens = [tokenizer.tokenize(df['post'][i]) for i in range(len(df))]

In [41]:
# spam_tokens

In [42]:
def tokenizing_from_DataFrame(dataFrame):
    return tokenizer.tokenize(dataFrame)

In [43]:
df['token'] = list(map(tokenizing_from_DataFrame,df['post']))

In [44]:
df

Unnamed: 0,index,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp,post,token
0,0,35/M DAY 4,I tested positive today but I’ve had symptoms for about four days (the worst day being day 2). Is anyone else experiencing ridiculous brain fog? Ever since this afternoon I have felt like I took a huge bong rip. I get confused easily and even lightheaded sometimes. At this point it’s my only symptom other than a stuffy nose. I honestly took me a while to type this because of the fog.,COVID19positive,1609459300,Humenoid,6,1,True,2020-12-31 16:01:40,i tested positive today but i’ve had symptoms for about four days (the worst day being day 2). is anyone else experiencing ridiculous brain fog? ever since this afternoon i have felt like i took a huge bong rip. i get confused easily and even lightheaded sometimes. at this point it’s my only symptom other than a stuffy nose. i honestly took me a while to type this because of the fog.,"[i, tested, positive, today, but, i, ve, had, symptoms, for, about, four, days, the, worst, day, being, day, 2, is, anyone, else, experiencing, ridiculous, brain, fog, ever, since, this, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confused, easily, and, even, lightheaded, sometimes, at, this, point, it, s, my, only, symptom, other, than, a, stuffy, nose, i, honestly, t..."
1,1,Tested positive after being extremely careful and no idea where I got it.,"This morning I received a positive test result. I also have high anxiety and I’m really scared. I’m not sure when my symptoms started - I thought I had a sinus infection about a week ago, but I completely lost my taste and smell on Monday. Mild congestion, and that’s about it. Does anyone have any idea where I could be in the timeline of things? Could I just have a mild case?",COVID19positive,1609459651,maddog1606,9,1,True,2020-12-31 16:07:31,"this morning i received a positive test result. i also have high anxiety and i’m really scared. i’m not sure when my symptoms started - i thought i had a sinus infection about a week ago, but i completely lost my taste and smell on monday. mild congestion, and that’s about it. does anyone have any idea where i could be in the timeline of things? could i just have a mild case?","[this, morning, i, received, a, positive, test, result, i, also, have, high, anxiety, and, i, m, really, scared, i, m, not, sure, when, my, symptoms, started, i, thought, i, had, a, sinus, infection, about, a, week, ago, but, i, completely, lost, my, taste, and, smell, on, monday, mild, congestion, and, that, s, about, it, does, anyone, have, any, idea, where, i, could, be, in, the, timeline, ..."
2,2,Happy NYE! Several family members tested positive today.,"After begging over and over for our families to stay away from each other during Christmas/ Thanksgiving, countless people on my side, and my husbands side tested positive today. Of course, everyone got together on Christmas and thanksgiving. My Mother in law, father in law, brother in law, plus everyone else at their Christmas party tested positive. My MIL sounds AWFUL, dyspnea, shortness of ...",COVID19positive,1609460017,lemonprim3,2,1,True,2020-12-31 16:13:37,"after begging over and over for our families to stay away from each other during christmas/ thanksgiving, countless people on my side, and my husbands side tested positive today. of course, everyone got together on christmas and thanksgiving. my mother in law, father in law, brother in law, plus everyone else at their christmas party tested positive. my mil sounds awful, dyspnea, shortness of ...","[after, begging, over, and, over, for, our, families, to, stay, away, from, each, other, during, christmas, thanksgiving, countless, people, on, my, side, and, my, husbands, side, tested, positive, today, of, course, everyone, got, together, on, christmas, and, thanksgiving, my, mother, in, law, father, in, law, brother, in, law, plus, everyone, else, at, their, christmas, party, tested, posit..."
3,3,Don’t trust rapid negative results,"Hi all,\n\nHappy New Year’s Eve!\n\nSo I think I caught the virus from indoor small family get together on Christmas Eve. Most of my family members got tested a few days before meeting up so it’s strange that I picked it up there but I been pretty good at staying home since Covid hit. My job has us working remotely since March so I only leave the house for errands etc.\n\nAnyhow on Monday (12...",COVID19positive,1609460574,TurnipSuccessful2188,7,1,True,2020-12-31 16:22:54,"hi all, new year’s eve! i think i caught the virus from indoor small family get together on christmas eve. most of my family members got tested a few days before meeting up so it’s strange that i picked it up there but i been pretty good at staying home since covid hit. my job has us working remotely since march so i only leave the house for errands etc. on monday (12/28) i woke up with a cou...","[hi, all, new, year, s, eve, i, think, i, caught, the, virus, from, indoor, small, family, get, together, on, christmas, eve, most, of, my, family, members, got, tested, a, few, days, before, meeting, up, so, it, s, strange, that, i, picked, it, up, there, but, i, been, pretty, good, at, staying, home, since, covid, hit, my, job, has, us, working, remotely, since, march, so, i, only, leave, th..."
4,4,What if I caught Covid while waiting to get the vaccine?,"I got my first dose of the Moderna Vaccine, but I had to wait in a room with about 15 people. The room was big. I had to get close to a couple of people. I got the vaccine on Tuesday night. Today I have a dry scratchy throat. I am very scared. How does that work? Will the vaccine help if I caught it exactly before the injection? I was wearing 5 masks and goggles and gloves.",COVID19positive,1609460716,AloneHeat,11,1,True,2020-12-31 16:25:16,"i got my first dose of the moderna vaccine, but i had to wait in a room with about 15 people. the room was big. i had to get close to a couple of people. i got the vaccine on tuesday night. today i have a dry scratchy throat. i am very scared. how does that work? will the vaccine help if i caught it exactly before the injection? i was wearing 5 masks and goggles and gloves.","[i, got, my, first, dose, of, the, moderna, vaccine, but, i, had, to, wait, in, a, room, with, about, 15, people, the, room, was, big, i, had, to, get, close, to, a, couple, of, people, i, got, the, vaccine, on, tuesday, night, today, i, have, a, dry, scratchy, throat, i, am, very, scared, how, does, that, work, will, the, vaccine, help, if, i, caught, it, exactly, before, the, injection, i, w..."
...,...,...,...,...,...,...,...,...,...,...,...,...
16796,19481,"Just tested positive for COVID fully vaccinated, when my isolation is over am I able to see my boyfriend who tested positive a few days later than me and will still be in isolation?",I am not sure how this all works. I tested positive Sunday for COVID despite being fully vaccinated. My boyfriend started showing symptoms later and tested positive Tuesday. Am I able to see him when my isolation is over and he is still in isolation or should I wait until he is out?,COVID19positive,1631227446,bunnygirl1716,7,1,True,2021-09-09 15:44:06,i am not sure how this all works. i tested positive sunday for covid despite being fully vaccinated. my boyfriend started showing symptoms later and tested positive tuesday. am i able to see him when my isolation is over and he is still in isolation or should i wait until he is out?,"[i, am, not, sure, how, this, all, works, i, tested, positive, sunday, for, covid, despite, being, fully, vaccinated, my, boyfriend, started, showing, symptoms, later, and, tested, positive, tuesday, am, i, able, to, see, him, when, my, isolation, is, over, and, he, is, still, in, isolation, or, should, i, wait, until, he, is, out]"
16797,19482,"Has anyone tried taking cough syrup/Buckleys with Covid, and their body just throwing it back up?","Back in 2019, December 27 to be exact, I was the sickest I had ever been in my entire life. My chest hurt, it was hard to breathe, I coughed so much I popped blood vessels in my eyes, and I had no sense of smell. Completely bed ridden for 2 days, needing help to walk to the bathroom, sleeping 20h of the day.\n\nI was 25.\n\nThere's a part of me that thinks it was Covid, but seeing how it was i...",COVID19positive,1631227752,PsydemonCat,5,1,True,2021-09-09 15:49:12,"back in 2019, december 27 to be exact, i was the sickest i had ever been in my entire life. my chest hurt, it was hard to breathe, i coughed so much i popped blood vessels in my eyes, and i had no sense of smell. completely bed ridden for 2 days, needing help to walk to the bathroom, sleeping 20h of the day. was 25. a part of me that thinks it was covid, but seeing how it was in december, the ...","[back, in, 2019, december, 27, to, be, exact, i, was, the, sickest, i, had, ever, been, in, my, entire, life, my, chest, hurt, it, was, hard, to, breathe, i, coughed, so, much, i, popped, blood, vessels, in, my, eyes, and, i, had, no, sense, of, smell, completely, bed, ridden, for, 2, days, needing, help, to, walk, to, the, bathroom, sleeping, 20h, of, the, day, was, 25, a, part, of, me, that,..."
16798,19483,Alcohol after covid,"Just tried having a drink for the first time after recovering from covid about a month ago. Had food with the alcohol, but it seems to be hitting harder than normal. Curious what others have experienced.",COVID19positive,1631229654,waster02,12,1,True,2021-09-09 16:20:54,"just tried having a drink for the first time after recovering from covid about a month ago. had food with the alcohol, but it seems to be hitting harder than normal. curious what others have experienced.","[just, tried, having, a, drink, for, the, first, time, after, recovering, from, covid, about, a, month, ago, had, food, with, the, alcohol, but, it, seems, to, be, hitting, harder, than, normal, curious, what, others, have, experienced]"
16799,19484,Covid,Can you get reinfected again with covid after recently recovering from covid?My sister tested negative for covid like in the first or second week of August after dealing with it and her friend was tested positive today. She’s been spending time with her at the gym and at work since they both work together. My mom is still recovering from pneumonia but we were all tested negative a couple of we...,COVID19positive,1631230273,Huge_Commercial_9976,7,1,True,2021-09-09 16:31:13,can you get reinfected again with covid after recently recovering from covid?my sister tested negative for covid like in the first or second week of august after dealing with it and her friend was tested positive today. she’s been spending time with her at the gym and at work since they both work together. my mom is still recovering from pneumonia but we were all tested negative a couple of we...,"[can, you, get, reinfected, again, with, covid, after, recently, recovering, from, covid, my, sister, tested, negative, for, covid, like, in, the, first, or, second, week, of, august, after, dealing, with, it, and, her, friend, was, tested, positive, today, she, s, been, spending, time, with, her, at, the, gym, and, at, work, since, they, both, work, together, my, mom, is, still, recovering, f..."


### Lemmatizing
When we "lemmatize" data, we take words and attempt to return their lemma, or the base/dictionary form of a word.

In [45]:
# Instantiate lemmatizer. 
lemmatizer = WordNetLemmatizer()

In [46]:
def lemitizing_from_Tokenlist(text):
    return [lemmatizer.lemmatize(i) for i in text]

In [47]:
df['token_lem'] = list(map(lemitizing_from_Tokenlist,df['token']))

In [48]:
df

Unnamed: 0,index,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp,post,token,token_lem
0,0,35/M DAY 4,I tested positive today but I’ve had symptoms for about four days (the worst day being day 2). Is anyone else experiencing ridiculous brain fog? Ever since this afternoon I have felt like I took a huge bong rip. I get confused easily and even lightheaded sometimes. At this point it’s my only symptom other than a stuffy nose. I honestly took me a while to type this because of the fog.,COVID19positive,1609459300,Humenoid,6,1,True,2020-12-31 16:01:40,i tested positive today but i’ve had symptoms for about four days (the worst day being day 2). is anyone else experiencing ridiculous brain fog? ever since this afternoon i have felt like i took a huge bong rip. i get confused easily and even lightheaded sometimes. at this point it’s my only symptom other than a stuffy nose. i honestly took me a while to type this because of the fog.,"[i, tested, positive, today, but, i, ve, had, symptoms, for, about, four, days, the, worst, day, being, day, 2, is, anyone, else, experiencing, ridiculous, brain, fog, ever, since, this, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confused, easily, and, even, lightheaded, sometimes, at, this, point, it, s, my, only, symptom, other, than, a, stuffy, nose, i, honestly, t...","[i, tested, positive, today, but, i, ve, had, symptom, for, about, four, day, the, worst, day, being, day, 2, is, anyone, else, experiencing, ridiculous, brain, fog, ever, since, this, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confused, easily, and, even, lightheaded, sometimes, at, this, point, it, s, my, only, symptom, other, than, a, stuffy, nose, i, honestly, too..."
1,1,Tested positive after being extremely careful and no idea where I got it.,"This morning I received a positive test result. I also have high anxiety and I’m really scared. I’m not sure when my symptoms started - I thought I had a sinus infection about a week ago, but I completely lost my taste and smell on Monday. Mild congestion, and that’s about it. Does anyone have any idea where I could be in the timeline of things? Could I just have a mild case?",COVID19positive,1609459651,maddog1606,9,1,True,2020-12-31 16:07:31,"this morning i received a positive test result. i also have high anxiety and i’m really scared. i’m not sure when my symptoms started - i thought i had a sinus infection about a week ago, but i completely lost my taste and smell on monday. mild congestion, and that’s about it. does anyone have any idea where i could be in the timeline of things? could i just have a mild case?","[this, morning, i, received, a, positive, test, result, i, also, have, high, anxiety, and, i, m, really, scared, i, m, not, sure, when, my, symptoms, started, i, thought, i, had, a, sinus, infection, about, a, week, ago, but, i, completely, lost, my, taste, and, smell, on, monday, mild, congestion, and, that, s, about, it, does, anyone, have, any, idea, where, i, could, be, in, the, timeline, ...","[this, morning, i, received, a, positive, test, result, i, also, have, high, anxiety, and, i, m, really, scared, i, m, not, sure, when, my, symptom, started, i, thought, i, had, a, sinus, infection, about, a, week, ago, but, i, completely, lost, my, taste, and, smell, on, monday, mild, congestion, and, that, s, about, it, doe, anyone, have, any, idea, where, i, could, be, in, the, timeline, of..."
2,2,Happy NYE! Several family members tested positive today.,"After begging over and over for our families to stay away from each other during Christmas/ Thanksgiving, countless people on my side, and my husbands side tested positive today. Of course, everyone got together on Christmas and thanksgiving. My Mother in law, father in law, brother in law, plus everyone else at their Christmas party tested positive. My MIL sounds AWFUL, dyspnea, shortness of ...",COVID19positive,1609460017,lemonprim3,2,1,True,2020-12-31 16:13:37,"after begging over and over for our families to stay away from each other during christmas/ thanksgiving, countless people on my side, and my husbands side tested positive today. of course, everyone got together on christmas and thanksgiving. my mother in law, father in law, brother in law, plus everyone else at their christmas party tested positive. my mil sounds awful, dyspnea, shortness of ...","[after, begging, over, and, over, for, our, families, to, stay, away, from, each, other, during, christmas, thanksgiving, countless, people, on, my, side, and, my, husbands, side, tested, positive, today, of, course, everyone, got, together, on, christmas, and, thanksgiving, my, mother, in, law, father, in, law, brother, in, law, plus, everyone, else, at, their, christmas, party, tested, posit...","[after, begging, over, and, over, for, our, family, to, stay, away, from, each, other, during, christmas, thanksgiving, countless, people, on, my, side, and, my, husband, side, tested, positive, today, of, course, everyone, got, together, on, christmas, and, thanksgiving, my, mother, in, law, father, in, law, brother, in, law, plus, everyone, else, at, their, christmas, party, tested, positive..."
3,3,Don’t trust rapid negative results,"Hi all,\n\nHappy New Year’s Eve!\n\nSo I think I caught the virus from indoor small family get together on Christmas Eve. Most of my family members got tested a few days before meeting up so it’s strange that I picked it up there but I been pretty good at staying home since Covid hit. My job has us working remotely since March so I only leave the house for errands etc.\n\nAnyhow on Monday (12...",COVID19positive,1609460574,TurnipSuccessful2188,7,1,True,2020-12-31 16:22:54,"hi all, new year’s eve! i think i caught the virus from indoor small family get together on christmas eve. most of my family members got tested a few days before meeting up so it’s strange that i picked it up there but i been pretty good at staying home since covid hit. my job has us working remotely since march so i only leave the house for errands etc. on monday (12/28) i woke up with a cou...","[hi, all, new, year, s, eve, i, think, i, caught, the, virus, from, indoor, small, family, get, together, on, christmas, eve, most, of, my, family, members, got, tested, a, few, days, before, meeting, up, so, it, s, strange, that, i, picked, it, up, there, but, i, been, pretty, good, at, staying, home, since, covid, hit, my, job, has, us, working, remotely, since, march, so, i, only, leave, th...","[hi, all, new, year, s, eve, i, think, i, caught, the, virus, from, indoor, small, family, get, together, on, christmas, eve, most, of, my, family, member, got, tested, a, few, day, before, meeting, up, so, it, s, strange, that, i, picked, it, up, there, but, i, been, pretty, good, at, staying, home, since, covid, hit, my, job, ha, u, working, remotely, since, march, so, i, only, leave, the, h..."
4,4,What if I caught Covid while waiting to get the vaccine?,"I got my first dose of the Moderna Vaccine, but I had to wait in a room with about 15 people. The room was big. I had to get close to a couple of people. I got the vaccine on Tuesday night. Today I have a dry scratchy throat. I am very scared. How does that work? Will the vaccine help if I caught it exactly before the injection? I was wearing 5 masks and goggles and gloves.",COVID19positive,1609460716,AloneHeat,11,1,True,2020-12-31 16:25:16,"i got my first dose of the moderna vaccine, but i had to wait in a room with about 15 people. the room was big. i had to get close to a couple of people. i got the vaccine on tuesday night. today i have a dry scratchy throat. i am very scared. how does that work? will the vaccine help if i caught it exactly before the injection? i was wearing 5 masks and goggles and gloves.","[i, got, my, first, dose, of, the, moderna, vaccine, but, i, had, to, wait, in, a, room, with, about, 15, people, the, room, was, big, i, had, to, get, close, to, a, couple, of, people, i, got, the, vaccine, on, tuesday, night, today, i, have, a, dry, scratchy, throat, i, am, very, scared, how, does, that, work, will, the, vaccine, help, if, i, caught, it, exactly, before, the, injection, i, w...","[i, got, my, first, dose, of, the, moderna, vaccine, but, i, had, to, wait, in, a, room, with, about, 15, people, the, room, wa, big, i, had, to, get, close, to, a, couple, of, people, i, got, the, vaccine, on, tuesday, night, today, i, have, a, dry, scratchy, throat, i, am, very, scared, how, doe, that, work, will, the, vaccine, help, if, i, caught, it, exactly, before, the, injection, i, wa,..."
...,...,...,...,...,...,...,...,...,...,...,...,...,...
16796,19481,"Just tested positive for COVID fully vaccinated, when my isolation is over am I able to see my boyfriend who tested positive a few days later than me and will still be in isolation?",I am not sure how this all works. I tested positive Sunday for COVID despite being fully vaccinated. My boyfriend started showing symptoms later and tested positive Tuesday. Am I able to see him when my isolation is over and he is still in isolation or should I wait until he is out?,COVID19positive,1631227446,bunnygirl1716,7,1,True,2021-09-09 15:44:06,i am not sure how this all works. i tested positive sunday for covid despite being fully vaccinated. my boyfriend started showing symptoms later and tested positive tuesday. am i able to see him when my isolation is over and he is still in isolation or should i wait until he is out?,"[i, am, not, sure, how, this, all, works, i, tested, positive, sunday, for, covid, despite, being, fully, vaccinated, my, boyfriend, started, showing, symptoms, later, and, tested, positive, tuesday, am, i, able, to, see, him, when, my, isolation, is, over, and, he, is, still, in, isolation, or, should, i, wait, until, he, is, out]","[i, am, not, sure, how, this, all, work, i, tested, positive, sunday, for, covid, despite, being, fully, vaccinated, my, boyfriend, started, showing, symptom, later, and, tested, positive, tuesday, am, i, able, to, see, him, when, my, isolation, is, over, and, he, is, still, in, isolation, or, should, i, wait, until, he, is, out]"
16797,19482,"Has anyone tried taking cough syrup/Buckleys with Covid, and their body just throwing it back up?","Back in 2019, December 27 to be exact, I was the sickest I had ever been in my entire life. My chest hurt, it was hard to breathe, I coughed so much I popped blood vessels in my eyes, and I had no sense of smell. Completely bed ridden for 2 days, needing help to walk to the bathroom, sleeping 20h of the day.\n\nI was 25.\n\nThere's a part of me that thinks it was Covid, but seeing how it was i...",COVID19positive,1631227752,PsydemonCat,5,1,True,2021-09-09 15:49:12,"back in 2019, december 27 to be exact, i was the sickest i had ever been in my entire life. my chest hurt, it was hard to breathe, i coughed so much i popped blood vessels in my eyes, and i had no sense of smell. completely bed ridden for 2 days, needing help to walk to the bathroom, sleeping 20h of the day. was 25. a part of me that thinks it was covid, but seeing how it was in december, the ...","[back, in, 2019, december, 27, to, be, exact, i, was, the, sickest, i, had, ever, been, in, my, entire, life, my, chest, hurt, it, was, hard, to, breathe, i, coughed, so, much, i, popped, blood, vessels, in, my, eyes, and, i, had, no, sense, of, smell, completely, bed, ridden, for, 2, days, needing, help, to, walk, to, the, bathroom, sleeping, 20h, of, the, day, was, 25, a, part, of, me, that,...","[back, in, 2019, december, 27, to, be, exact, i, wa, the, sickest, i, had, ever, been, in, my, entire, life, my, chest, hurt, it, wa, hard, to, breathe, i, coughed, so, much, i, popped, blood, vessel, in, my, eye, and, i, had, no, sense, of, smell, completely, bed, ridden, for, 2, day, needing, help, to, walk, to, the, bathroom, sleeping, 20h, of, the, day, wa, 25, a, part, of, me, that, think..."
16798,19483,Alcohol after covid,"Just tried having a drink for the first time after recovering from covid about a month ago. Had food with the alcohol, but it seems to be hitting harder than normal. Curious what others have experienced.",COVID19positive,1631229654,waster02,12,1,True,2021-09-09 16:20:54,"just tried having a drink for the first time after recovering from covid about a month ago. had food with the alcohol, but it seems to be hitting harder than normal. curious what others have experienced.","[just, tried, having, a, drink, for, the, first, time, after, recovering, from, covid, about, a, month, ago, had, food, with, the, alcohol, but, it, seems, to, be, hitting, harder, than, normal, curious, what, others, have, experienced]","[just, tried, having, a, drink, for, the, first, time, after, recovering, from, covid, about, a, month, ago, had, food, with, the, alcohol, but, it, seems, to, be, hitting, harder, than, normal, curious, what, others, have, experienced]"
16799,19484,Covid,Can you get reinfected again with covid after recently recovering from covid?My sister tested negative for covid like in the first or second week of August after dealing with it and her friend was tested positive today. She’s been spending time with her at the gym and at work since they both work together. My mom is still recovering from pneumonia but we were all tested negative a couple of we...,COVID19positive,1631230273,Huge_Commercial_9976,7,1,True,2021-09-09 16:31:13,can you get reinfected again with covid after recently recovering from covid?my sister tested negative for covid like in the first or second week of august after dealing with it and her friend was tested positive today. she’s been spending time with her at the gym and at work since they both work together. my mom is still recovering from pneumonia but we were all tested negative a couple of we...,"[can, you, get, reinfected, again, with, covid, after, recently, recovering, from, covid, my, sister, tested, negative, for, covid, like, in, the, first, or, second, week, of, august, after, dealing, with, it, and, her, friend, was, tested, positive, today, she, s, been, spending, time, with, her, at, the, gym, and, at, work, since, they, both, work, together, my, mom, is, still, recovering, f...","[can, you, get, reinfected, again, with, covid, after, recently, recovering, from, covid, my, sister, tested, negative, for, covid, like, in, the, first, or, second, week, of, august, after, dealing, with, it, and, her, friend, wa, tested, positive, today, she, s, been, spending, time, with, her, at, the, gym, and, at, work, since, they, both, work, together, my, mom, is, still, recovering, fr..."


In [49]:
def lemitizing_from_DataFrame(dataFrame):
    text = tokenizing_from_DataFrame(dataFrame)
    return [lemmatizer.lemmatize(i) for i in text]

In [50]:
df['token_lem'] = list(map(lemitizing_from_DataFrame,df['post']))

In [63]:
# df.head()

In [58]:
# Print only those lemmatized tokens that are different.

def lemitize_check(dataFrame):
#     print('-------------\n',dataFrame)
    for i in range(len(dataFrame)):
        if dataFrame['token'][i] != dataFrame['token_lem'][i]:
            print(dataFrame['token'][i] ,dataFrame['token_lem'][i])

In [59]:
# df['selftext'].iloc[44]

In [62]:
# [lemitize_check(df.iloc[i]) for i in range(len(df))]

### Stemmizing
When we "stem" data, we take words and attempt to return a base form of the word. It tends to be cruder than using lemmatization.

In [70]:
Pstemmizer = PorterStemmer()

In [61]:
# Instantiate PorterStemmer.
def stemizer_from_dataFrame(dataf):
    corpus = list(dataf)
    word_token = word_tokenize(stemizer_from_dataFrame)
    Pstemmizer = PorterStemmer()
    return [Pstemmizer.stem(i) for i in word_token[j]]

In [71]:
def stemming_from_DataFrame(dataFrame):
    text = tokenizing_from_DataFrame(dataFrame)
    return [Pstemmizer.stem(i) for i in text]

In [72]:
# Stemmize tokens.
df['token_stem'] = list(map(stemming_from_DataFrame,df['post']))

In [73]:
df.head()

Unnamed: 0,index,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp,post,token,token_lem,token_stem
0,0,35/M DAY 4,I tested positive today but I’ve had symptoms for about four days (the worst day being day 2). Is anyone else experiencing ridiculous brain fog? Ever since this afternoon I have felt like I took a huge bong rip. I get confused easily and even lightheaded sometimes. At this point it’s my only symptom other than a stuffy nose. I honestly took me a while to type this because of the fog.,COVID19positive,1609459300,Humenoid,6,1,True,2020-12-31 16:01:40,i tested positive today but i’ve had symptoms for about four days (the worst day being day 2). is anyone else experiencing ridiculous brain fog? ever since this afternoon i have felt like i took a huge bong rip. i get confused easily and even lightheaded sometimes. at this point it’s my only symptom other than a stuffy nose. i honestly took me a while to type this because of the fog.,"[i, tested, positive, today, but, i, ve, had, symptoms, for, about, four, days, the, worst, day, being, day, 2, is, anyone, else, experiencing, ridiculous, brain, fog, ever, since, this, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confused, easily, and, even, lightheaded, sometimes, at, this, point, it, s, my, only, symptom, other, than, a, stuffy, nose, i, honestly, t...","[i, tested, positive, today, but, i, ve, had, symptom, for, about, four, day, the, worst, day, being, day, 2, is, anyone, else, experiencing, ridiculous, brain, fog, ever, since, this, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confused, easily, and, even, lightheaded, sometimes, at, this, point, it, s, my, only, symptom, other, than, a, stuffy, nose, i, honestly, too...","[i, test, posit, today, but, i, ve, had, symptom, for, about, four, day, the, worst, day, be, day, 2, is, anyon, els, experienc, ridicul, brain, fog, ever, sinc, thi, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confus, easili, and, even, lighthead, sometim, at, thi, point, it, s, my, onli, symptom, other, than, a, stuffi, nose, i, honestli, took, me, a, while, to, type..."
1,1,Tested positive after being extremely careful and no idea where I got it.,"This morning I received a positive test result. I also have high anxiety and I’m really scared. I’m not sure when my symptoms started - I thought I had a sinus infection about a week ago, but I completely lost my taste and smell on Monday. Mild congestion, and that’s about it. Does anyone have any idea where I could be in the timeline of things? Could I just have a mild case?",COVID19positive,1609459651,maddog1606,9,1,True,2020-12-31 16:07:31,"this morning i received a positive test result. i also have high anxiety and i’m really scared. i’m not sure when my symptoms started - i thought i had a sinus infection about a week ago, but i completely lost my taste and smell on monday. mild congestion, and that’s about it. does anyone have any idea where i could be in the timeline of things? could i just have a mild case?","[this, morning, i, received, a, positive, test, result, i, also, have, high, anxiety, and, i, m, really, scared, i, m, not, sure, when, my, symptoms, started, i, thought, i, had, a, sinus, infection, about, a, week, ago, but, i, completely, lost, my, taste, and, smell, on, monday, mild, congestion, and, that, s, about, it, does, anyone, have, any, idea, where, i, could, be, in, the, timeline, ...","[this, morning, i, received, a, positive, test, result, i, also, have, high, anxiety, and, i, m, really, scared, i, m, not, sure, when, my, symptom, started, i, thought, i, had, a, sinus, infection, about, a, week, ago, but, i, completely, lost, my, taste, and, smell, on, monday, mild, congestion, and, that, s, about, it, doe, anyone, have, any, idea, where, i, could, be, in, the, timeline, of...","[thi, morn, i, receiv, a, posit, test, result, i, also, have, high, anxieti, and, i, m, realli, scare, i, m, not, sure, when, my, symptom, start, i, thought, i, had, a, sinu, infect, about, a, week, ago, but, i, complet, lost, my, tast, and, smell, on, monday, mild, congest, and, that, s, about, it, doe, anyon, have, ani, idea, where, i, could, be, in, the, timelin, of, thing, could, i, just, ..."
2,2,Happy NYE! Several family members tested positive today.,"After begging over and over for our families to stay away from each other during Christmas/ Thanksgiving, countless people on my side, and my husbands side tested positive today. Of course, everyone got together on Christmas and thanksgiving. My Mother in law, father in law, brother in law, plus everyone else at their Christmas party tested positive. My MIL sounds AWFUL, dyspnea, shortness of ...",COVID19positive,1609460017,lemonprim3,2,1,True,2020-12-31 16:13:37,"after begging over and over for our families to stay away from each other during christmas/ thanksgiving, countless people on my side, and my husbands side tested positive today. of course, everyone got together on christmas and thanksgiving. my mother in law, father in law, brother in law, plus everyone else at their christmas party tested positive. my mil sounds awful, dyspnea, shortness of ...","[after, begging, over, and, over, for, our, families, to, stay, away, from, each, other, during, christmas, thanksgiving, countless, people, on, my, side, and, my, husbands, side, tested, positive, today, of, course, everyone, got, together, on, christmas, and, thanksgiving, my, mother, in, law, father, in, law, brother, in, law, plus, everyone, else, at, their, christmas, party, tested, posit...","[after, begging, over, and, over, for, our, family, to, stay, away, from, each, other, during, christmas, thanksgiving, countless, people, on, my, side, and, my, husband, side, tested, positive, today, of, course, everyone, got, together, on, christmas, and, thanksgiving, my, mother, in, law, father, in, law, brother, in, law, plus, everyone, else, at, their, christmas, party, tested, positive...","[after, beg, over, and, over, for, our, famili, to, stay, away, from, each, other, dure, christma, thanksgiv, countless, peopl, on, my, side, and, my, husband, side, test, posit, today, of, cours, everyon, got, togeth, on, christma, and, thanksgiv, my, mother, in, law, father, in, law, brother, in, law, plu, everyon, els, at, their, christma, parti, test, posit, my, mil, sound, aw, dyspnea, sh..."
3,3,Don’t trust rapid negative results,"Hi all,\n\nHappy New Year’s Eve!\n\nSo I think I caught the virus from indoor small family get together on Christmas Eve. Most of my family members got tested a few days before meeting up so it’s strange that I picked it up there but I been pretty good at staying home since Covid hit. My job has us working remotely since March so I only leave the house for errands etc.\n\nAnyhow on Monday (12...",COVID19positive,1609460574,TurnipSuccessful2188,7,1,True,2020-12-31 16:22:54,"hi all, new year’s eve! i think i caught the virus from indoor small family get together on christmas eve. most of my family members got tested a few days before meeting up so it’s strange that i picked it up there but i been pretty good at staying home since covid hit. my job has us working remotely since march so i only leave the house for errands etc. on monday (12/28) i woke up with a cou...","[hi, all, new, year, s, eve, i, think, i, caught, the, virus, from, indoor, small, family, get, together, on, christmas, eve, most, of, my, family, members, got, tested, a, few, days, before, meeting, up, so, it, s, strange, that, i, picked, it, up, there, but, i, been, pretty, good, at, staying, home, since, covid, hit, my, job, has, us, working, remotely, since, march, so, i, only, leave, th...","[hi, all, new, year, s, eve, i, think, i, caught, the, virus, from, indoor, small, family, get, together, on, christmas, eve, most, of, my, family, member, got, tested, a, few, day, before, meeting, up, so, it, s, strange, that, i, picked, it, up, there, but, i, been, pretty, good, at, staying, home, since, covid, hit, my, job, ha, u, working, remotely, since, march, so, i, only, leave, the, h...","[hi, all, new, year, s, eve, i, think, i, caught, the, viru, from, indoor, small, famili, get, togeth, on, christma, eve, most, of, my, famili, member, got, test, a, few, day, befor, meet, up, so, it, s, strang, that, i, pick, it, up, there, but, i, been, pretti, good, at, stay, home, sinc, covid, hit, my, job, ha, us, work, remot, sinc, march, so, i, onli, leav, the, hous, for, errand, etc, o..."
4,4,What if I caught Covid while waiting to get the vaccine?,"I got my first dose of the Moderna Vaccine, but I had to wait in a room with about 15 people. The room was big. I had to get close to a couple of people. I got the vaccine on Tuesday night. Today I have a dry scratchy throat. I am very scared. How does that work? Will the vaccine help if I caught it exactly before the injection? I was wearing 5 masks and goggles and gloves.",COVID19positive,1609460716,AloneHeat,11,1,True,2020-12-31 16:25:16,"i got my first dose of the moderna vaccine, but i had to wait in a room with about 15 people. the room was big. i had to get close to a couple of people. i got the vaccine on tuesday night. today i have a dry scratchy throat. i am very scared. how does that work? will the vaccine help if i caught it exactly before the injection? i was wearing 5 masks and goggles and gloves.","[i, got, my, first, dose, of, the, moderna, vaccine, but, i, had, to, wait, in, a, room, with, about, 15, people, the, room, was, big, i, had, to, get, close, to, a, couple, of, people, i, got, the, vaccine, on, tuesday, night, today, i, have, a, dry, scratchy, throat, i, am, very, scared, how, does, that, work, will, the, vaccine, help, if, i, caught, it, exactly, before, the, injection, i, w...","[i, got, my, first, dose, of, the, moderna, vaccine, but, i, had, to, wait, in, a, room, with, about, 15, people, the, room, wa, big, i, had, to, get, close, to, a, couple, of, people, i, got, the, vaccine, on, tuesday, night, today, i, have, a, dry, scratchy, throat, i, am, very, scared, how, doe, that, work, will, the, vaccine, help, if, i, caught, it, exactly, before, the, injection, i, wa,...","[i, got, my, first, dose, of, the, moderna, vaccin, but, i, had, to, wait, in, a, room, with, about, 15, peopl, the, room, wa, big, i, had, to, get, close, to, a, coupl, of, peopl, i, got, the, vaccin, on, tuesday, night, today, i, have, a, dri, scratchi, throat, i, am, veri, scare, how, doe, that, work, will, the, vaccin, help, if, i, caught, it, exactli, befor, the, inject, i, wa, wear, 5, m..."


### Compare token to token_stem

In [76]:
corpus_token=list(df['token'])

In [82]:
corpus_stem[0][1]

'test'

In [77]:
corpus_stem=list(df['token_stem'])

In [88]:
for i in range(40):
    if corpus_stem[0][i] != corpus_token[0][i]:
        print(( corpus_stem[0][i], corpus_token[0][i]))

('test', 'tested')
('posit', 'positive')
('symptom', 'symptoms')
('day', 'days')
('be', 'being')
('anyon', 'anyone')
('els', 'else')
('experienc', 'experiencing')
('ridicul', 'ridiculous')
('sinc', 'since')
('thi', 'this')


## UP TO THIS POINT ============================

In [92]:
# Remove stopwords from "spam_tokens."
# no_stop_words = [token for token in word if token not in stopwords.words('english')]

In [109]:
df

Unnamed: 0,index,title,selftext,subreddit,created_utc,author,num_comments,score,is_self,timestamp,post,token,token_lem,token_stem,token_noStopWords
0,0,35/M DAY 4,I tested positive today but I’ve had symptoms for about four days (the worst day being day 2). Is anyone else experiencing ridiculous brain fog? Ever since this afternoon I have felt like I took a huge bong rip. I get confused easily and even lightheaded sometimes. At this point it’s my only symptom other than a stuffy nose. I honestly took me a while to type this because of the fog.,COVID19positive,1609459300,Humenoid,6,1,True,2020-12-31 16:01:40,i tested positive today but i’ve had symptoms for about four days (the worst day being day 2). is anyone else experiencing ridiculous brain fog? ever since this afternoon i have felt like i took a huge bong rip. i get confused easily and even lightheaded sometimes. at this point it’s my only symptom other than a stuffy nose. i honestly took me a while to type this because of the fog.,"[i, tested, positive, today, but, i, ve, had, symptoms, for, about, four, days, the, worst, day, being, day, 2, is, anyone, else, experiencing, ridiculous, brain, fog, ever, since, this, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confused, easily, and, even, lightheaded, sometimes, at, this, point, it, s, my, only, symptom, other, than, a, stuffy, nose, i, honestly, t...","[i, tested, positive, today, but, i, ve, had, symptom, for, about, four, day, the, worst, day, being, day, 2, is, anyone, else, experiencing, ridiculous, brain, fog, ever, since, this, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confused, easily, and, even, lightheaded, sometimes, at, this, point, it, s, my, only, symptom, other, than, a, stuffy, nose, i, honestly, too...","[i, test, posit, today, but, i, ve, had, symptom, for, about, four, day, the, worst, day, be, day, 2, is, anyon, els, experienc, ridicul, brain, fog, ever, sinc, thi, afternoon, i, have, felt, like, i, took, a, huge, bong, rip, i, get, confus, easili, and, even, lighthead, sometim, at, thi, point, it, s, my, onli, symptom, other, than, a, stuffi, nose, i, honestli, took, me, a, while, to, type...",tested positive today i’ve symptoms four days (the worst day day 2). anyone else experiencing ridiculous brain fog? ever since afternoon felt like took huge bong rip. get confused easily even lightheaded sometimes. point it’s symptom stuffy nose. honestly took type fog.
1,1,Tested positive after being extremely careful and no idea where I got it.,"This morning I received a positive test result. I also have high anxiety and I’m really scared. I’m not sure when my symptoms started - I thought I had a sinus infection about a week ago, but I completely lost my taste and smell on Monday. Mild congestion, and that’s about it. Does anyone have any idea where I could be in the timeline of things? Could I just have a mild case?",COVID19positive,1609459651,maddog1606,9,1,True,2020-12-31 16:07:31,"this morning i received a positive test result. i also have high anxiety and i’m really scared. i’m not sure when my symptoms started - i thought i had a sinus infection about a week ago, but i completely lost my taste and smell on monday. mild congestion, and that’s about it. does anyone have any idea where i could be in the timeline of things? could i just have a mild case?","[this, morning, i, received, a, positive, test, result, i, also, have, high, anxiety, and, i, m, really, scared, i, m, not, sure, when, my, symptoms, started, i, thought, i, had, a, sinus, infection, about, a, week, ago, but, i, completely, lost, my, taste, and, smell, on, monday, mild, congestion, and, that, s, about, it, does, anyone, have, any, idea, where, i, could, be, in, the, timeline, ...","[this, morning, i, received, a, positive, test, result, i, also, have, high, anxiety, and, i, m, really, scared, i, m, not, sure, when, my, symptom, started, i, thought, i, had, a, sinus, infection, about, a, week, ago, but, i, completely, lost, my, taste, and, smell, on, monday, mild, congestion, and, that, s, about, it, doe, anyone, have, any, idea, where, i, could, be, in, the, timeline, of...","[thi, morn, i, receiv, a, posit, test, result, i, also, have, high, anxieti, and, i, m, realli, scare, i, m, not, sure, when, my, symptom, start, i, thought, i, had, a, sinu, infect, about, a, week, ago, but, i, complet, lost, my, tast, and, smell, on, monday, mild, congest, and, that, s, about, it, doe, anyon, have, ani, idea, where, i, could, be, in, the, timelin, of, thing, could, i, just, ...","morning received positive test result. also high anxiety i’m really scared. i’m sure symptoms started - thought sinus infection week ago, completely lost taste smell monday. mild congestion, that’s it. anyone idea could timeline things? could mild case?"
2,2,Happy NYE! Several family members tested positive today.,"After begging over and over for our families to stay away from each other during Christmas/ Thanksgiving, countless people on my side, and my husbands side tested positive today. Of course, everyone got together on Christmas and thanksgiving. My Mother in law, father in law, brother in law, plus everyone else at their Christmas party tested positive. My MIL sounds AWFUL, dyspnea, shortness of ...",COVID19positive,1609460017,lemonprim3,2,1,True,2020-12-31 16:13:37,"after begging over and over for our families to stay away from each other during christmas/ thanksgiving, countless people on my side, and my husbands side tested positive today. of course, everyone got together on christmas and thanksgiving. my mother in law, father in law, brother in law, plus everyone else at their christmas party tested positive. my mil sounds awful, dyspnea, shortness of ...","[after, begging, over, and, over, for, our, families, to, stay, away, from, each, other, during, christmas, thanksgiving, countless, people, on, my, side, and, my, husbands, side, tested, positive, today, of, course, everyone, got, together, on, christmas, and, thanksgiving, my, mother, in, law, father, in, law, brother, in, law, plus, everyone, else, at, their, christmas, party, tested, posit...","[after, begging, over, and, over, for, our, family, to, stay, away, from, each, other, during, christmas, thanksgiving, countless, people, on, my, side, and, my, husband, side, tested, positive, today, of, course, everyone, got, together, on, christmas, and, thanksgiving, my, mother, in, law, father, in, law, brother, in, law, plus, everyone, else, at, their, christmas, party, tested, positive...","[after, beg, over, and, over, for, our, famili, to, stay, away, from, each, other, dure, christma, thanksgiv, countless, peopl, on, my, side, and, my, husband, side, test, posit, today, of, cours, everyon, got, togeth, on, christma, and, thanksgiv, my, mother, in, law, father, in, law, brother, in, law, plu, everyon, els, at, their, christma, parti, test, posit, my, mil, sound, aw, dyspnea, sh...","begging families stay away christmas/ thanksgiving, countless people side, husbands side tested positive today. course, everyone got together christmas thanksgiving. mother law, father law, brother law, plus everyone else christmas party tested positive. mil sounds awful, dyspnea, shortness breath every word. still think it’s “just like flu.” side family, grandmother lupus, epilepsy, copd rece..."
3,3,Don’t trust rapid negative results,"Hi all,\n\nHappy New Year’s Eve!\n\nSo I think I caught the virus from indoor small family get together on Christmas Eve. Most of my family members got tested a few days before meeting up so it’s strange that I picked it up there but I been pretty good at staying home since Covid hit. My job has us working remotely since March so I only leave the house for errands etc.\n\nAnyhow on Monday (12...",COVID19positive,1609460574,TurnipSuccessful2188,7,1,True,2020-12-31 16:22:54,"hi all, new year’s eve! i think i caught the virus from indoor small family get together on christmas eve. most of my family members got tested a few days before meeting up so it’s strange that i picked it up there but i been pretty good at staying home since covid hit. my job has us working remotely since march so i only leave the house for errands etc. on monday (12/28) i woke up with a cou...","[hi, all, new, year, s, eve, i, think, i, caught, the, virus, from, indoor, small, family, get, together, on, christmas, eve, most, of, my, family, members, got, tested, a, few, days, before, meeting, up, so, it, s, strange, that, i, picked, it, up, there, but, i, been, pretty, good, at, staying, home, since, covid, hit, my, job, has, us, working, remotely, since, march, so, i, only, leave, th...","[hi, all, new, year, s, eve, i, think, i, caught, the, virus, from, indoor, small, family, get, together, on, christmas, eve, most, of, my, family, member, got, tested, a, few, day, before, meeting, up, so, it, s, strange, that, i, picked, it, up, there, but, i, been, pretty, good, at, staying, home, since, covid, hit, my, job, ha, u, working, remotely, since, march, so, i, only, leave, the, h...","[hi, all, new, year, s, eve, i, think, i, caught, the, viru, from, indoor, small, famili, get, togeth, on, christma, eve, most, of, my, famili, member, got, test, a, few, day, befor, meet, up, so, it, s, strang, that, i, pick, it, up, there, but, i, been, pretti, good, at, stay, home, sinc, covid, hit, my, job, ha, us, work, remot, sinc, march, so, i, onli, leav, the, hous, for, errand, etc, o...","hi all, new year’s eve! think caught virus indoor small family get together christmas eve. family members got tested days meeting it’s strange picked pretty good staying home since covid hit. job us working remotely since march leave house errands etc. monday (12/28) woke cough chest pain felt different immediately went urgent care get tested. waiting approx. 2 hour get tested cold, dr tells s..."
4,4,What if I caught Covid while waiting to get the vaccine?,"I got my first dose of the Moderna Vaccine, but I had to wait in a room with about 15 people. The room was big. I had to get close to a couple of people. I got the vaccine on Tuesday night. Today I have a dry scratchy throat. I am very scared. How does that work? Will the vaccine help if I caught it exactly before the injection? I was wearing 5 masks and goggles and gloves.",COVID19positive,1609460716,AloneHeat,11,1,True,2020-12-31 16:25:16,"i got my first dose of the moderna vaccine, but i had to wait in a room with about 15 people. the room was big. i had to get close to a couple of people. i got the vaccine on tuesday night. today i have a dry scratchy throat. i am very scared. how does that work? will the vaccine help if i caught it exactly before the injection? i was wearing 5 masks and goggles and gloves.","[i, got, my, first, dose, of, the, moderna, vaccine, but, i, had, to, wait, in, a, room, with, about, 15, people, the, room, was, big, i, had, to, get, close, to, a, couple, of, people, i, got, the, vaccine, on, tuesday, night, today, i, have, a, dry, scratchy, throat, i, am, very, scared, how, does, that, work, will, the, vaccine, help, if, i, caught, it, exactly, before, the, injection, i, w...","[i, got, my, first, dose, of, the, moderna, vaccine, but, i, had, to, wait, in, a, room, with, about, 15, people, the, room, wa, big, i, had, to, get, close, to, a, couple, of, people, i, got, the, vaccine, on, tuesday, night, today, i, have, a, dry, scratchy, throat, i, am, very, scared, how, doe, that, work, will, the, vaccine, help, if, i, caught, it, exactly, before, the, injection, i, wa,...","[i, got, my, first, dose, of, the, moderna, vaccin, but, i, had, to, wait, in, a, room, with, about, 15, peopl, the, room, wa, big, i, had, to, get, close, to, a, coupl, of, peopl, i, got, the, vaccin, on, tuesday, night, today, i, have, a, dri, scratchi, throat, i, am, veri, scare, how, doe, that, work, will, the, vaccin, help, if, i, caught, it, exactli, befor, the, inject, i, wa, wear, 5, m...","got first dose moderna vaccine, wait room 15 people. room big. get close couple people. got vaccine tuesday night. today dry scratchy throat. scared. work? vaccine help caught exactly injection? wearing 5 masks goggles gloves."
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
16796,19481,"Just tested positive for COVID fully vaccinated, when my isolation is over am I able to see my boyfriend who tested positive a few days later than me and will still be in isolation?",I am not sure how this all works. I tested positive Sunday for COVID despite being fully vaccinated. My boyfriend started showing symptoms later and tested positive Tuesday. Am I able to see him when my isolation is over and he is still in isolation or should I wait until he is out?,COVID19positive,1631227446,bunnygirl1716,7,1,True,2021-09-09 15:44:06,i am not sure how this all works. i tested positive sunday for covid despite being fully vaccinated. my boyfriend started showing symptoms later and tested positive tuesday. am i able to see him when my isolation is over and he is still in isolation or should i wait until he is out?,"[i, am, not, sure, how, this, all, works, i, tested, positive, sunday, for, covid, despite, being, fully, vaccinated, my, boyfriend, started, showing, symptoms, later, and, tested, positive, tuesday, am, i, able, to, see, him, when, my, isolation, is, over, and, he, is, still, in, isolation, or, should, i, wait, until, he, is, out]","[i, am, not, sure, how, this, all, work, i, tested, positive, sunday, for, covid, despite, being, fully, vaccinated, my, boyfriend, started, showing, symptom, later, and, tested, positive, tuesday, am, i, able, to, see, him, when, my, isolation, is, over, and, he, is, still, in, isolation, or, should, i, wait, until, he, is, out]","[i, am, not, sure, how, thi, all, work, i, test, posit, sunday, for, covid, despit, be, fulli, vaccin, my, boyfriend, start, show, symptom, later, and, test, posit, tuesday, am, i, abl, to, see, him, when, my, isol, is, over, and, he, is, still, in, isol, or, should, i, wait, until, he, is, out]",sure works. tested positive sunday covid despite fully vaccinated. boyfriend started showing symptoms later tested positive tuesday. able see isolation still isolation wait out?
16797,19482,"Has anyone tried taking cough syrup/Buckleys with Covid, and their body just throwing it back up?","Back in 2019, December 27 to be exact, I was the sickest I had ever been in my entire life. My chest hurt, it was hard to breathe, I coughed so much I popped blood vessels in my eyes, and I had no sense of smell. Completely bed ridden for 2 days, needing help to walk to the bathroom, sleeping 20h of the day.\n\nI was 25.\n\nThere's a part of me that thinks it was Covid, but seeing how it was i...",COVID19positive,1631227752,PsydemonCat,5,1,True,2021-09-09 15:49:12,"back in 2019, december 27 to be exact, i was the sickest i had ever been in my entire life. my chest hurt, it was hard to breathe, i coughed so much i popped blood vessels in my eyes, and i had no sense of smell. completely bed ridden for 2 days, needing help to walk to the bathroom, sleeping 20h of the day. was 25. a part of me that thinks it was covid, but seeing how it was in december, the ...","[back, in, 2019, december, 27, to, be, exact, i, was, the, sickest, i, had, ever, been, in, my, entire, life, my, chest, hurt, it, was, hard, to, breathe, i, coughed, so, much, i, popped, blood, vessels, in, my, eyes, and, i, had, no, sense, of, smell, completely, bed, ridden, for, 2, days, needing, help, to, walk, to, the, bathroom, sleeping, 20h, of, the, day, was, 25, a, part, of, me, that,...","[back, in, 2019, december, 27, to, be, exact, i, wa, the, sickest, i, had, ever, been, in, my, entire, life, my, chest, hurt, it, wa, hard, to, breathe, i, coughed, so, much, i, popped, blood, vessel, in, my, eye, and, i, had, no, sense, of, smell, completely, bed, ridden, for, 2, day, needing, help, to, walk, to, the, bathroom, sleeping, 20h, of, the, day, wa, 25, a, part, of, me, that, think...","[back, in, 2019, decemb, 27, to, be, exact, i, wa, the, sickest, i, had, ever, been, in, my, entir, life, my, chest, hurt, it, wa, hard, to, breath, i, cough, so, much, i, pop, blood, vessel, in, my, eye, and, i, had, no, sens, of, smell, complet, bed, ridden, for, 2, day, need, help, to, walk, to, the, bathroom, sleep, 20h, of, the, day, wa, 25, a, part, of, me, that, think, it, wa, covid, bu...","back 2019, december 27 exact, sickest ever entire life. chest hurt, hard breathe, coughed much popped blood vessels eyes, sense smell. completely bed ridden 2 days, needing help walk bathroom, sleeping 20h day. 25. part thinks covid, seeing december, world tells wasn't. thing is, also lived heavy tourism city, especially china. typical tourists. end day, i'm wondering.. whenever tried taking b..."
16798,19483,Alcohol after covid,"Just tried having a drink for the first time after recovering from covid about a month ago. Had food with the alcohol, but it seems to be hitting harder than normal. Curious what others have experienced.",COVID19positive,1631229654,waster02,12,1,True,2021-09-09 16:20:54,"just tried having a drink for the first time after recovering from covid about a month ago. had food with the alcohol, but it seems to be hitting harder than normal. curious what others have experienced.","[just, tried, having, a, drink, for, the, first, time, after, recovering, from, covid, about, a, month, ago, had, food, with, the, alcohol, but, it, seems, to, be, hitting, harder, than, normal, curious, what, others, have, experienced]","[just, tried, having, a, drink, for, the, first, time, after, recovering, from, covid, about, a, month, ago, had, food, with, the, alcohol, but, it, seems, to, be, hitting, harder, than, normal, curious, what, others, have, experienced]","[just, tri, have, a, drink, for, the, first, time, after, recov, from, covid, about, a, month, ago, had, food, with, the, alcohol, but, it, seem, to, be, hit, harder, than, normal, curiou, what, other, have, experienc]","tried drink first time recovering covid month ago. food alcohol, seems hitting harder normal. curious others experienced."
16799,19484,Covid,Can you get reinfected again with covid after recently recovering from covid?My sister tested negative for covid like in the first or second week of August after dealing with it and her friend was tested positive today. She’s been spending time with her at the gym and at work since they both work together. My mom is still recovering from pneumonia but we were all tested negative a couple of we...,COVID19positive,1631230273,Huge_Commercial_9976,7,1,True,2021-09-09 16:31:13,can you get reinfected again with covid after recently recovering from covid?my sister tested negative for covid like in the first or second week of august after dealing with it and her friend was tested positive today. she’s been spending time with her at the gym and at work since they both work together. my mom is still recovering from pneumonia but we were all tested negative a couple of we...,"[can, you, get, reinfected, again, with, covid, after, recently, recovering, from, covid, my, sister, tested, negative, for, covid, like, in, the, first, or, second, week, of, august, after, dealing, with, it, and, her, friend, was, tested, positive, today, she, s, been, spending, time, with, her, at, the, gym, and, at, work, since, they, both, work, together, my, mom, is, still, recovering, f...","[can, you, get, reinfected, again, with, covid, after, recently, recovering, from, covid, my, sister, tested, negative, for, covid, like, in, the, first, or, second, week, of, august, after, dealing, with, it, and, her, friend, wa, tested, positive, today, she, s, been, spending, time, with, her, at, the, gym, and, at, work, since, they, both, work, together, my, mom, is, still, recovering, fr...","[can, you, get, reinfect, again, with, covid, after, recent, recov, from, covid, my, sister, test, neg, for, covid, like, in, the, first, or, second, week, of, august, after, deal, with, it, and, her, friend, wa, test, posit, today, she, s, been, spend, time, with, her, at, the, gym, and, at, work, sinc, they, both, work, togeth, my, mom, is, still, recov, from, pneumonia, but, we, were, all, ...",get reinfected covid recently recovering covid?my sister tested negative covid like first second week august dealing friend tested positive today. she’s spending time gym work since work together. mom still recovering pneumonia tested negative couple weeks ago.


In [101]:
print(stopwords.words('english'))

['i', 'me', 'my', 'myself', 'we', 'our', 'ours', 'ourselves', 'you', "you're", "you've", "you'll", "you'd", 'your', 'yours', 'yourself', 'yourselves', 'he', 'him', 'his', 'himself', 'she', "she's", 'her', 'hers', 'herself', 'it', "it's", 'its', 'itself', 'they', 'them', 'their', 'theirs', 'themselves', 'what', 'which', 'who', 'whom', 'this', 'that', "that'll", 'these', 'those', 'am', 'is', 'are', 'was', 'were', 'be', 'been', 'being', 'have', 'has', 'had', 'having', 'do', 'does', 'did', 'doing', 'a', 'an', 'the', 'and', 'but', 'if', 'or', 'because', 'as', 'until', 'while', 'of', 'at', 'by', 'for', 'with', 'about', 'against', 'between', 'into', 'through', 'during', 'before', 'after', 'above', 'below', 'to', 'from', 'up', 'down', 'in', 'out', 'on', 'off', 'over', 'under', 'again', 'further', 'then', 'once', 'here', 'there', 'when', 'where', 'why', 'how', 'all', 'any', 'both', 'each', 'few', 'more', 'most', 'other', 'some', 'such', 'no', 'nor', 'not', 'only', 'own', 'same', 'so', 'than', '

## check nulls for last time

In [119]:
df.isnull().sum()

index                0
title                0
selftext             0
subreddit            0
created_utc          0
author               0
num_comments         0
score                0
is_self              0
timestamp            0
post                 0
token                0
token_lem            0
token_stem           0
token_noStopWords    0
dtype: int64

In [118]:
df.dropna(inplace=True)

## Save text-processed doc

In [120]:
df.to_csv('../datasets/text_processed_covid19positive.csv')