# Original Data Dataframe

This Notebook will gather the collected data and organize it in separated DataFrames. The collected dataframes will be  perform basic descriptive statistics. This notebook will be automattically updated as more information is gathered from the API webscrapers.

Note: 
1. I began collecting the mental health reddit information after a few iterations of the armed forces subreddits.
2. I did not want to preform any pre-processing on this data just organize and output to EDA folder


## Import Data

In [44]:
#Libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import os

In [45]:
#Import data

#create function to import and create grouped csv files

def process_and_output_csvs(input_folder, output_folder):
    # Create the output folder if it doesn't exist
    if not os.path.exists(output_folder):
        os.makedirs(output_folder)

    all_data = []
    
    # Iterate through each file in the input folder
    for filename in os.listdir(input_folder):
        if filename.endswith('.csv'):
            file_path = os.path.join(input_folder, filename)
            try:
                # Read the CSV file and append to the list
                data = pd.read_csv(file_path)
                all_data.append(data)
            except Exception as e:
                print(f"Error processing file {filename}: {e}")
     # Check if all_data is not empty
    if not all_data:
        print("No data to process. Check if the folder contains CSV files or if they are readable.")
        return

    # Concatenate all dataframes
    concatenated_df = pd.concat(all_data, ignore_index=True)
    
    # Drop duplicate rows
    concatenated_df.drop_duplicates(inplace=True)

    # Group by subreddit and create/update separate files
    for subreddit, group in concatenated_df.groupby('subreddit'):
        output_file = os.path.join(output_folder, f"{subreddit}.csv")
        if os.path.exists(output_file):
            # Read existing file, drop duplicates with the new data, and then append
            existing_data = pd.read_csv(output_file)
            combined_data = pd.concat([existing_data, group]).drop_duplicates()
            combined_data.to_csv(output_file, index=False)
        else:
            # If file doesn't exist, write with header
            group.to_csv(output_file, index=False)

In [46]:
input_folder = '../1_DATA_COLLECTION/data/'
output_folder = '../2_EDA/data/'
process_and_output_csvs(input_folder, output_folder)


# Subreddit Statistics

## Army Statistics

In [47]:
#Import data
army_df = pd.read_csv('../2_EDA/data/army.csv')

#Check data
army_df.shape

(1411, 118)

In [48]:
army_df.head(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
0,,Army,This was a convo I had with one of my buddies ...,t2_3vmh30ad,False,,0,False,If you could create a new MRE based on a Fast ...,[],...,,,,,,,,,,
1,,Army,BLUF: how do you overcome imposter syndrome?\n...,t2_9mqncmmb,False,,0,False,how do you even Army?,[],...,,,,,,,,,,
2,,Army,"Long story short, my estranged (soon to be ex)...",t2_ag69n7u7,False,,0,False,Command Directed No-Contact Order?,[],...,,,,,,,,,,
3,,Army,"\nMy husband is 35T, and just graduated AIT. W...",t2_lb56g2zm,False,,0,False,Anyone 35T?,[],...,,,,,,,,,,
4,,Army,I could use some advice on going recruiting. I...,t2_i4rellgt,False,,0,False,Thinking of going recruiter as brand new E5,[],...,,,,,,,,,,
5,,Army,At my new duty station in Massachusetts and al...,t2_msv2hc5h,False,,0,False,"Well everyone, I’m alone for the holidays and ...",[],...,,,,,,,,,,
6,,Army,Does anyone have experience with submitting th...,t2_9t3i5665,False,,0,False,DD Form 368 has started its way up. Looking fo...,[],...,,,,,,,,,,
7,,Army,Also what does ‘Standard Excess’ refer to for ...,t2_s45e278w,False,,0,False,What does ‘Permanent Change of Station’ refer ...,[],...,,,,,,,,,,
8,,Army,Any tips or advice. I'm in osut about to turn ...,t2_odrxc0fs,False,,0,False,Advice for 17 year old,[],...,,,,,,,,,,
9,,Army,Would anyone be able to describe the process o...,t2_128y15,False,,0,False,Space Force NCO to Army Warrant Officer,[],...,,,,,,,,,,


In [49]:
army_df.tail(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
1401,,Army,"Hello Everybody,\n\n&amp;#x200B;\n\nI have Que...",t2_3g13gdg,False,,0,False,Green To Gold ADO,[],...,,,,,,,,,,
1402,,Army,I have a “buddy” who smoked pot the second day...,t2_p1ahrwd2,False,,0,False,Block Leave UA,[],...,,,,,,,,,,
1403,,Army,"So when I first came to ait, I had signed up f...",t2_9cknp6ny,False,,0,False,Leaving Ait,[],...,,,,,,,,,,
1404,,Army,15R at Ft Eustis. Anyone here down to play hit...,t2_73oq3pex,False,,0,False,Made it to AIT looking for some guys to play s...,[],...,,,,,,,,,,
1405,,Army,Hey y’all I’m a 17F and recently enlisted in t...,t2_hzy2zrbd,False,,0,False,Switching over from reserves to active,[],...,,,,,,,,,,
1406,,Army,Just got a 30k bonus as a dumb SPC. Any sugges...,t2_fhsjacf2,False,,0,False,Money advice,[],...,,,,,,,,,,
1407,,Army,"15D aircraft power train repairer. SPC, been i...",t2_pk2un9j5t,False,,0,False,What are my options?,[],...,,,,,,,,,,
1408,,Army,Same as title. Joined DEP today but watched my...,t2_f11wxbnm,False,,0,False,Joined DEP today but having second thoughts. C...,[],...,,,,,,,,,,
1409,,Army,Have h/w in a week. Beefed up during the holid...,t2_ecut3h7x,False,,0,False,Any tips to pass height weight/tape,[],...,,,,,,,,,,
1410,,Army,,t2_fdaozdoo,False,,0,False,Does intense acne where a male would have a be...,[],...,,,,,True,,,,,


In [50]:
army_df.dtypes

approved_at_utc           float64
subreddit                  object
selftext                   object
author_fullname            object
saved                        bool
                           ...   
link_flair_template_id     object
poll_data                  object
crosspost_parent_list      object
crosspost_parent           object
collections               float64
Length: 118, dtype: object

## Marine Statistics

In [51]:
usmc_df = pd.read_csv('../2_EDA/data/USMC.csv')

usmc_df.shape

(2602, 118)

In [52]:
usmc_df.head(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
0,,USMC,"Say about 4 of these per platoon, this could b...",t2_b5wp1wrw,False,,0,False,Thoughts on this?,[],...,{'images': [{'source': {'url': 'https://extern...,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
1,,USMC,,t2_jtw07360x,False,,0,False,"Kareem Nikoui at Pumpkin Rock Trail in Norco, ...",[],...,,True,"{'u4r57pz4d48c1': {'status': 'valid', 'e': 'Im...","{'items': [{'media_id': 'u4r57pz4d48c1', 'id':...",,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
2,,USMC,I was reminiscing about the days before the wa...,t2_1v59fmwl,False,,0,False,Who was in Kuwait in 2003 when the first air r...,[],...,,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
3,,USMC,"Okinawa, Japan Summer 1988 toured a bit with m...",t2_8k13yaso,False,,0,False,TBT-,[],...,,True,"{'7z0w8pxj648c1': {'status': 'valid', 'e': 'Im...","{'items': [{'media_id': '7z0w8pxj648c1', 'id':...",,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
4,,USMC,"Hello everyone, to preface this I’m not a mari...",t2_15giq1d5,False,,0,False,Trying to find information about my grandfather,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
5,,USMC,Are they still stocked next to the duty huts o...,t2_ciwdqgh1p,False,,0,False,Xyience,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
6,,USMC,,t2_vqsnwh2p,False,,0,False,Who else remembers watching this guy on the Hi...,[],...,{'images': [{'source': {'url': 'https://extern...,,,,,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
7,,USMC,"For those who've used the GI Bill, would you r...",t2_442i5uw4,False,,0,False,GI Bill BAH and Dorm,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
8,,USMC,Ladies and gentleman it is with great pleasure...,t2_2rne3tva,False,,0,False,I'm Free,[],...,,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
9,,USMC,What was the scoring system for rifle qual in ...,t2_nr078offv,False,,0,False,Rifle qual scoring in the '70's,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,


In [53]:
usmc_df.tail(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
2592,,USMC,You guys ready for that post 96 / post leave d...,t2_i12nfibl,False,,0,False,Post Leave,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
2593,,USMC,The fitness threads remind me that I have a re...,t2_56c0emq1,False,,0,False,Reunions,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
2594,,USMC,"Gents, 1stLt coming up at the end of my first ...",t2_3ybqvdp0,False,,0,False,Raider Support Battalion,[],...,,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
2595,,USMC,It seems a lot of us (but not all) share the s...,t2_7mkf1mcs,False,,0,False,Tired of it,[],...,,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
2596,,USMC,,t2_7silw2k1,False,,0,False,Keep the population number at the same amount ...,[],...,{'images': [{'source': {'url': 'https://previe...,,,,,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
2597,,USMC,"Devil dogs with DDD how bad is it, and how bad...",t2_9rgra7h7,False,,0,False,Degenerative Disc Disease,[],...,,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
2598,,USMC,https://theaviationgeekclub.com/the-usmc-deems...,t2_t8aqt,False,,0,False,"Yes I know I’m a nerd, however, suck it Army",[],...,{'images': [{'source': {'url': 'https://previe...,,,,True,13206c00-553b-11e7-b72e-0ea26d094710,,,,
2599,,USMC,The National Anthem for Chargers v Buffalo gam...,t2_13ts76,False,,0,False,"How ‘bout some attention to detail there, NAVY!",[],...,{'images': [{'source': {'url': 'https://previe...,,,,True,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
2600,,USMC,SGT Plank had his neck snapped whilst standing...,t2_50yovzkx,False,,0,False,SGT Plank,[],...,{'images': [{'source': {'url': 'https://previe...,,,,True,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
2601,,USMC,So when fresh faced idiot babies (2nd Lts) go ...,t2_65sde9g,False,,0,False,"TBS Spear (peer) Eval list of attributes, anyo...",[],...,,,,,True,0637986a-553b-11e7-9183-0e42775a86da,,,,


In [54]:
usmc_df.dtypes

approved_at_utc           float64
subreddit                  object
selftext                   object
author_fullname            object
saved                        bool
                           ...   
link_flair_template_id     object
poll_data                  object
crosspost_parent_list     float64
crosspost_parent          float64
collections               float64
Length: 118, dtype: object

## Schizophrenic Statistics

In [55]:
schizo_df = pd.read_csv('../2_EDA/data/schizophrenia.csv')
schizo_df.shape

(2004, 118)

In [56]:
schizo_df.head(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
0,,schizophrenia,I think I'm in danger of actually hurting some...,t2_qnmv04e2y,False,,0,False,I think I'm going to lose it,[],...,,,,,,467cb704-b34f-11eb-85d3-0ed0dc5434c5,,,,
1,,schizophrenia,Holidays blow. Brain blows. Got money saved up...,t2_7kcdoafi,False,,0,False,Taking the jump.,[],...,,,,,,467cb704-b34f-11eb-85d3-0ed0dc5434c5,,,,
2,,schizophrenia,"Hope you’re all having happy, stress free, hol...",t2_o4638i2d,False,,0,False,Happy Selfie-Sunday!,[],...,,,,,,0001fbbc-b350-11eb-909f-0eae5e287fb7,,,,
3,,schizophrenia,This is my first time posting here. I’ve been ...,t2_tmpjtb4b,False,,0,False,Merry Christmas and don’t forget to thank your...,[],...,,,,,,7b58ddae-b34f-11eb-8a94-0e540082c8f1,,,,
4,,schizophrenia,I noticed development of my symptoms today but...,t2_42y71xbb,False,,0,False,Selfie Sunday,[],...,,,,,,0001fbbc-b350-11eb-909f-0eae5e287fb7,,,,
5,,schizophrenia,seriously why like there's so much to talk abo...,t2_qmx9wjmff,False,,0,False,why the shit is this sub just like face pics l...,[],...,,,,,,7b58ddae-b34f-11eb-8a94-0e540082c8f1,,,,
6,,schizophrenia,,t2_osdz3xpvx,False,,0,False,A pumpkinish time memory of now nature,[],...,,,,,,376816ae-b350-11eb-adc6-0e0d3bd20d2f,,,,
7,,schizophrenia,I was reading Ren Hang’s old depression journa...,t2_hfs594bc,False,,0,False,Ren Hang possibly had schizophrenia,[],...,,True,"{'r1ywrm9m7c8c1': {'status': 'valid', 'e': 'Im...","{'items': [{'media_id': 'r1ywrm9m7c8c1', 'id':...",,376816ae-b350-11eb-adc6-0e0d3bd20d2f,,,,
8,,schizophrenia,Hey there. I have bipolar but I used to hear v...,t2_8ip2z8wv,False,,0,False,Does this sound like it or something else,[],...,,,,,,dccd2c58-b34e-11eb-ae5b-0e48f7f519b5,,,,
9,,schizophrenia,Happy Birthday to Jesus!,t2_j4jqg,False,,0,False,Singing about good things!,[],...,,,,,,0001fbbc-b350-11eb-909f-0eae5e287fb7,,,,


In [57]:
schizo_df.tail(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
1994,,schizophrenia,I used to be able to drink caffeine and feel n...,t2_jq53ckx0s,False,,0,False,Caffeine and antipsychotics,[],...,,,,,,de51fc7e-b34f-11eb-8843-0e7ee5dad4c7,,,,
1995,,schizophrenia,,t2_3xificw5,False,,0,False,Eternal Rot,[],...,,,,,,376816ae-b350-11eb-adc6-0e0d3bd20d2f,,,,
1996,,schizophrenia,On my medical charts it says I have schizophre...,t2_jduqxzcpt,False,,0,False,Diagnosed with paranoid schizophrenia but I’ve...,[],...,,,,,,de51fc7e-b34f-11eb-8843-0e7ee5dad4c7,,,,
1997,,schizophrenia,"Like, whenever someone is a conspiracy theoris...",t2_5qkm38ze2,False,,0,False,"Does anyone else find ""schizo memes"" dehumanis...",[],...,,,,,,189cc42e-b34e-11eb-9bbd-0eaf23a01099,,,,
1998,,schizophrenia,My doctor won’t change my meds and I am unmoti...,t2_eltev,False,,0,False,How do people manage lack of motivation?,[],...,,,,,True,467cb704-b34f-11eb-85d3-0ed0dc5434c5,,,,
1999,,schizophrenia,,t2_qku2i5ze,False,,0,False,"selfie Sunday, just my name in a now broken Ar...",[],...,,,,,True,7b58ddae-b34f-11eb-8a94-0e540082c8f1,,,,
2000,,schizophrenia,"Upon being diagnosed, I made and acknowledged ...",t2_9fpzpinq,False,,0,False,Willpower and logics helps,[],...,,,,,True,467cb704-b34f-11eb-85d3-0ed0dc5434c5,,,,
2001,,schizophrenia,Attached below is todays video link to my “On ...,t2_1buxpqk3,False,,0,False,"Schizophrenia and inherent strengths, on YouTube",[],...,,,,,True,ef98928a-b350-11eb-9dae-0ed0c28c524b,,,,
2002,,schizophrenia,get check? i might relapse,t2_uig8k3d5,False,,0,False,I dont have schizophrenia. After relapse on ni...,[],...,,,,,True,a87835d4-f961-11eb-93fb-a627bfada68b,,,,
2003,,schizophrenia,Attached below is todays video link to my “On ...,t2_1buxpqk3,False,,0,False,"Schizophrenia and the persecutory, on YouTube",[],...,,,,,True,ef98928a-b350-11eb-9dae-0ed0c28c524b,,,,


In [58]:
schizo_df.dtypes

approved_at_utc           float64
subreddit                  object
selftext                   object
author_fullname            object
saved                        bool
                           ...   
link_flair_template_id     object
poll_data                  object
crosspost_parent_list      object
crosspost_parent           object
collections               float64
Length: 118, dtype: object

## Bipolar Statistics

In [59]:
bipolar_df = pd.read_csv('../2_EDA/data/bipolar.csv')
bipolar_df.shape

(2001, 118)

In [60]:
bipolar_df.head(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
0,,bipolar,I rarely had pressured speech in my manic epis...,t2_80gkngcwc,False,,0,False,Not much of a talker,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
1,,bipolar,I have been with BD since my diagnosis at 20 (...,t2_kdy196ssr,False,,0,False,Everything is good,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
2,,bipolar,"Hi all! I have type2, and lately I've been ext...",t2_2g032mka,False,,0,False,Paranoia around medications,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
3,,bipolar,So I am more mentally healthy than I have ever...,t2_4b3aq1by,False,,0,False,Thinking about applying at a psych hospital te...,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
4,,bipolar,It’s like living in a dream.. A dream that you...,t2_hkrrusky,False,,0,False,Lost..,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
5,,bipolar,Fyi: I got diagnosed with Bipolar 2 this year....,t2_pxvl1hpgq,False,,0,False,Can you feel a depressive episode coming on?,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
6,,bipolar,So I have bipolar disorder. It’s no secret in ...,t2_6d6otf1vl,False,,0,False,My mother used bipolar as an adjective at dinn...,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
7,,bipolar,So basically as the title says. I feel like fo...,t2_hbxdx2hi,False,,0,False,Does anyone have problems with keeping in touc...,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
8,,bipolar,Since I've being diagnosed officially and prop...,t2_jplnk3u5,False,,0,False,I Miss Manic Episodes,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
9,,bipolar,I don't know what's wrong with me.. I feel off...,t2_ba8cfgd2,False,,0,False,Not feeling good,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,


In [61]:
bipolar_df.tail(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
1991,,bipolar,(Edit: here I am feeling guilty for venting ab...,t2_j84358080,False,,0,False,Was proud of myself until I got laughed at,"[{'e': 'text', 't': 'Rant '}, {'a': ':angry-cr...",...,,,,,,e3c5a90c-37c5-11ed-be75-f6def89c9511,,,,
1992,,bipolar,Ever since my diagnosis back in early November...,t2_mt76kb3up,False,,0,False,I'm probably getting fired tomorrow,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
1993,,bipolar,"Hi everyone, I want to first start off by sayi...",t2_7ey0rvck3,False,,0,False,Are auditory hallucinations normal?,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
1994,,bipolar,How does your mania present? I don’t have symp...,t2_csy74lgt,False,,0,False,Mania,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
1995,,bipolar,\n***this post might be all over the place bec...,t2_8uag18br,False,,0,False,Feeling Stagnant But Not…,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
1996,,bipolar,Okay so i have this thought that since our ext...,t2_12qcnmr,False,,0,False,Bi polar people did you manage to start your o...,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
1997,,bipolar,may contain trigger warnings like religious tr...,t2_mu97lo10h,False,,0,False,i hate my mom for making me go to a wrong kind...,"[{'e': 'text', 't': 'Rant '}, {'a': ':angry-cr...",...,,,,,,e3c5a90c-37c5-11ed-be75-f6def89c9511,,,,
1998,,bipolar,"i’m new to taking vraylar, and i’ve noticed I’...",t2_lsxubrjf,False,,0,False,waking up,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
1999,,bipolar,During my major manic episode that landed me i...,t2_7wsnq371,False,,0,False,How do you know when you are manic? does your ...,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
2000,,bipolar,"Well, I don’t think I’m the first to say this ...",t2_vbuydy8o,False,,0,False,Hard Admitting to Myself,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,True,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,


In [62]:
bipolar_df.dtypes

approved_at_utc           float64
subreddit                  object
selftext                   object
author_fullname            object
saved                        bool
                           ...   
link_flair_template_id     object
poll_data                  object
crosspost_parent_list     float64
crosspost_parent          float64
collections                object
Length: 118, dtype: object

# Organize Data in a Dataframe

In [63]:
#concatenate dataframes
df = pd.concat([army_df, usmc_df, schizo_df, bipolar_df])

print(f'Master DF: {df.shape},\n Army: {army_df.shape},\n USMC: {usmc_df.shape},\n Schizo: {schizo_df.shape},\n Bipolar: {bipolar_df.shape}')

Master DF: (8018, 118),
 Army: (1411, 118),
 USMC: (2602, 118),
 Schizo: (2004, 118),
 Bipolar: (2001, 118)


### Output raw df in EDA folder for further processing and cleaning

In [64]:
#output to csv
df.to_csv('../2_EDA/data/00_master.csv', index=False)