# Original Data Dataframe

This Notebook will gather the collected data and organize it in separated DataFrames. The collected dataframes will be  perform basic descriptive statistics. This notebook will be automattically updated as more information is gathered from the API webscrapers.

Note: 
1. I began collecting the mental health reddit information after a few iterations of the armed forces subreddits.
2. I did not want to preform any pre-processing on this data just organize and output to EDA folder


## Import Data

In [1]:
#Libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import os

In [2]:
#Import data

#create function to import and create grouped csv files

def process_and_output_csvs(input_folder, output_folder):
    # Create the output folder if it doesn't exist
    if not os.path.exists(output_folder):
        os.makedirs(output_folder)

    all_data = []
    
    # Iterate through each file in the input folder
    for filename in os.listdir(input_folder):
        if filename.endswith('.csv'):
            file_path = os.path.join(input_folder, filename)
            try:
                # Read the CSV file and append to the list
                data = pd.read_csv(file_path)
                all_data.append(data)
            except Exception as e:
                print(f"Error processing file {filename}: {e}")
     # Check if all_data is not empty
    if not all_data:
        print("No data to process. Check if the folder contains CSV files or if they are readable.")
        return

    # Concatenate all dataframes
    concatenated_df = pd.concat(all_data, ignore_index=True)
    
    # Drop duplicate rows
    concatenated_df.drop_duplicates(inplace=True)

    # Group by subreddit and create/update separate files
    for subreddit, group in concatenated_df.groupby('subreddit'):
        output_file = os.path.join(output_folder, f"{subreddit}.csv")
        if os.path.exists(output_file):
            # Read existing file, drop duplicates with the new data, and then append
            existing_data = pd.read_csv(output_file)
            combined_data = pd.concat([existing_data, group]).drop_duplicates()
            combined_data.to_csv(output_file, index=False)
        else:
            # If file doesn't exist, write with header
            group.to_csv(output_file, index=False)

In [3]:
input_folder = '../1_DATA_COLLECTION/data/'
output_folder = '../2_EDA/data/'
process_and_output_csvs(input_folder, output_folder)


# Subreddit Statistics

## Army Statistics

In [4]:
#Import data
army_df = pd.read_csv('../2_EDA/data/army.csv')

#Check data
army_df.shape

(1611, 118)

In [5]:
army_df.head(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
0,,Army,This was a convo I had with one of my buddies ...,t2_3vmh30ad,False,,0,False,If you could create a new MRE based on a Fast ...,[],...,,,,,,,,,,
1,,Army,BLUF: how do you overcome imposter syndrome?\n...,t2_9mqncmmb,False,,0,False,how do you even Army?,[],...,,,,,,,,,,
2,,Army,"Long story short, my estranged (soon to be ex)...",t2_ag69n7u7,False,,0,False,Command Directed No-Contact Order?,[],...,,,,,,,,,,
3,,Army,"\nMy husband is 35T, and just graduated AIT. W...",t2_lb56g2zm,False,,0,False,Anyone 35T?,[],...,,,,,,,,,,
4,,Army,I could use some advice on going recruiting. I...,t2_i4rellgt,False,,0,False,Thinking of going recruiter as brand new E5,[],...,,,,,,,,,,
5,,Army,At my new duty station in Massachusetts and al...,t2_msv2hc5h,False,,0,False,"Well everyone, I’m alone for the holidays and ...",[],...,,,,,,,,,,
6,,Army,Does anyone have experience with submitting th...,t2_9t3i5665,False,,0,False,DD Form 368 has started its way up. Looking fo...,[],...,,,,,,,,,,
7,,Army,Also what does ‘Standard Excess’ refer to for ...,t2_s45e278w,False,,0,False,What does ‘Permanent Change of Station’ refer ...,[],...,,,,,,,,,,
8,,Army,Any tips or advice. I'm in osut about to turn ...,t2_odrxc0fs,False,,0,False,Advice for 17 year old,[],...,,,,,,,,,,
9,,Army,Would anyone be able to describe the process o...,t2_128y15,False,,0,False,Space Force NCO to Army Warrant Officer,[],...,,,,,,,,,,


In [6]:
army_df.tail(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
1601,,Army,"Dear SMA,\n\nHow to fix the army in a few easy...",t2_i1wz4nw5,False,,0,False,How to fix the army,[],...,,,,,,,,,,
1602,,Army,The garrison CSM reached out to me on Hots&amp...,t2_5fsyr,False,,0,False,Fort Huachuca barracks program,[],...,,True,"{'jeho2tneqwac1': {'status': 'valid', 'e': 'Im...","{'items': [{'media_id': 'jeho2tneqwac1', 'id':...",,,,,,
1603,,Army,,t2_bk8vm0qr,False,,0,False,This Was Our Shower In 2003. Bring Your Own Water,[],...,{'images': [{'source': {'url': 'https://previe...,,,,,,,,,
1604,,Army,So I need some advice because no one in my uni...,t2_r82wpdnl,False,,0,False,Medical Issues Being Ignored (Reserves),[],...,,,,,,,,,,
1605,,Army,"Hi all, \n\nI process these types of actions a...",t2_nro58u6,False,,0,False,Initial THC drug positive,[],...,,,,,,,,,,
1606,,Army,Anyone currently in the Army Guard or from the...,t2_ix555eqsg,False,,0,False,92W Army MOS,[],...,,,,,,,,,,
1607,,Army,,t2_aedw4,False,,0,False,SECDEF is about to have to check in with staff...,[],...,{'images': [{'source': {'url': 'https://extern...,,,,,,,,,
1608,,Army,Currently stationed overseas and am ETSing in ...,t2_v5wtx0ve,False,,0,False,ALC,[],...,,,,,True,,,,,
1609,,Army,I was wondering if there's anyone here who has...,t2_8r871m4d,False,,0,False,"BPTA, Poland Rotation",[],...,,,,,True,,,,,
1610,,Army,,t2_fdaozdoo,False,,0,False,Does intense acne where a male would have a be...,[],...,,,,,True,,,,,


In [7]:
army_df.dtypes

approved_at_utc           float64
subreddit                  object
selftext                   object
author_fullname            object
saved                        bool
                           ...   
link_flair_template_id     object
poll_data                  object
crosspost_parent_list      object
crosspost_parent           object
collections               float64
Length: 118, dtype: object

## Marine Statistics

In [8]:
usmc_df = pd.read_csv('../2_EDA/data/USMC.csv')

usmc_df.shape

(2803, 118)

In [9]:
usmc_df.head(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
0,,USMC,"Say about 4 of these per platoon, this could b...",t2_b5wp1wrw,False,,0,False,Thoughts on this?,[],...,{'images': [{'source': {'url': 'https://extern...,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
1,,USMC,,t2_jtw07360x,False,,0,False,"Kareem Nikoui at Pumpkin Rock Trail in Norco, ...",[],...,,True,"{'u4r57pz4d48c1': {'status': 'valid', 'e': 'Im...","{'items': [{'media_id': 'u4r57pz4d48c1', 'id':...",,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
2,,USMC,I was reminiscing about the days before the wa...,t2_1v59fmwl,False,,0,False,Who was in Kuwait in 2003 when the first air r...,[],...,,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
3,,USMC,"Okinawa, Japan Summer 1988 toured a bit with m...",t2_8k13yaso,False,,0,False,TBT-,[],...,,True,"{'7z0w8pxj648c1': {'status': 'valid', 'e': 'Im...","{'items': [{'media_id': '7z0w8pxj648c1', 'id':...",,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
4,,USMC,"Hello everyone, to preface this I’m not a mari...",t2_15giq1d5,False,,0,False,Trying to find information about my grandfather,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
5,,USMC,Are they still stocked next to the duty huts o...,t2_ciwdqgh1p,False,,0,False,Xyience,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
6,,USMC,,t2_vqsnwh2p,False,,0,False,Who else remembers watching this guy on the Hi...,[],...,{'images': [{'source': {'url': 'https://extern...,,,,,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
7,,USMC,"For those who've used the GI Bill, would you r...",t2_442i5uw4,False,,0,False,GI Bill BAH and Dorm,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
8,,USMC,Ladies and gentleman it is with great pleasure...,t2_2rne3tva,False,,0,False,I'm Free,[],...,,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
9,,USMC,What was the scoring system for rifle qual in ...,t2_nr078offv,False,,0,False,Rifle qual scoring in the '70's,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,


In [10]:
usmc_df.tail(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
2793,,USMC,,t2_2to4h5rn,False,,0,False,"What running the ""Gauntlet"" was like back duri...",[],...,{'images': [{'source': {'url': 'https://extern...,,,,,f173d5ec-553a-11e7-ae19-0e9955593edc,,,,
2794,,USMC,In the IRR. I check my MOL every 6 months or s...,t2_lprzgpg,False,,0,False,Promotion Question,[],...,{'images': [{'source': {'url': 'https://previe...,,,,,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
2795,,USMC,This has been a common recent dispute between ...,t2_8zvwkshf,False,,0,False,Which Tattoo Policy Applies?,[],...,,,,,,0637986a-553b-11e7-9183-0e42775a86da,,,,
2796,,USMC,For all you marines out there that are getting...,t2_qaz0tq6h8,False,,0,False,It’s really happening we’re going to war again.,[],...,,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
2797,,USMC,,t2_kestbit,False,,0,False,January 2024 Active Duty Cutting Scores Released,[],...,{'images': [{'source': {'url': 'https://extern...,,,,,34f329d0-553b-11e7-a3ff-0e825ef5e402,,,,
2798,,USMC,While shooting down a bunch of drones targetin...,t2_4owk1,False,,0,False,Entire crew of destroyer receives CARs,[],...,{'images': [{'source': {'url': 'https://extern...,,,,True,13206c00-553b-11e7-b72e-0ea26d094710,,,,
2799,,USMC,https://theaviationgeekclub.com/the-usmc-deems...,t2_t8aqt,False,,0,False,"Yes I know I’m a nerd, however, suck it Army",[],...,{'images': [{'source': {'url': 'https://previe...,,,,True,13206c00-553b-11e7-b72e-0ea26d094710,,,,
2800,,USMC,The National Anthem for Chargers v Buffalo gam...,t2_13ts76,False,,0,False,"How ‘bout some attention to detail there, NAVY!",[],...,{'images': [{'source': {'url': 'https://previe...,,,,True,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
2801,,USMC,SGT Plank had his neck snapped whilst standing...,t2_50yovzkx,False,,0,False,SGT Plank,[],...,{'images': [{'source': {'url': 'https://previe...,,,,True,49648516-8c0f-11e2-b1d6-12313d1839b0,,,,
2802,,USMC,So when fresh faced idiot babies (2nd Lts) go ...,t2_65sde9g,False,,0,False,"TBS Spear (peer) Eval list of attributes, anyo...",[],...,,,,,True,0637986a-553b-11e7-9183-0e42775a86da,,,,


In [11]:
usmc_df.dtypes

approved_at_utc           float64
subreddit                  object
selftext                   object
author_fullname            object
saved                        bool
                           ...   
link_flair_template_id     object
poll_data                  object
crosspost_parent_list     float64
crosspost_parent          float64
collections               float64
Length: 118, dtype: object

## Schizophrenic Statistics

In [12]:
schizo_df = pd.read_csv('../2_EDA/data/schizophrenia.csv')
schizo_df.shape

(2204, 118)

In [13]:
schizo_df.head(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
0,,schizophrenia,I think I'm in danger of actually hurting some...,t2_qnmv04e2y,False,,0,False,I think I'm going to lose it,[],...,,,,,,467cb704-b34f-11eb-85d3-0ed0dc5434c5,,,,
1,,schizophrenia,Holidays blow. Brain blows. Got money saved up...,t2_7kcdoafi,False,,0,False,Taking the jump.,[],...,,,,,,467cb704-b34f-11eb-85d3-0ed0dc5434c5,,,,
2,,schizophrenia,"Hope you’re all having happy, stress free, hol...",t2_o4638i2d,False,,0,False,Happy Selfie-Sunday!,[],...,,,,,,0001fbbc-b350-11eb-909f-0eae5e287fb7,,,,
3,,schizophrenia,This is my first time posting here. I’ve been ...,t2_tmpjtb4b,False,,0,False,Merry Christmas and don’t forget to thank your...,[],...,,,,,,7b58ddae-b34f-11eb-8a94-0e540082c8f1,,,,
4,,schizophrenia,I noticed development of my symptoms today but...,t2_42y71xbb,False,,0,False,Selfie Sunday,[],...,,,,,,0001fbbc-b350-11eb-909f-0eae5e287fb7,,,,
5,,schizophrenia,seriously why like there's so much to talk abo...,t2_qmx9wjmff,False,,0,False,why the shit is this sub just like face pics l...,[],...,,,,,,7b58ddae-b34f-11eb-8a94-0e540082c8f1,,,,
6,,schizophrenia,,t2_osdz3xpvx,False,,0,False,A pumpkinish time memory of now nature,[],...,,,,,,376816ae-b350-11eb-adc6-0e0d3bd20d2f,,,,
7,,schizophrenia,I was reading Ren Hang’s old depression journa...,t2_hfs594bc,False,,0,False,Ren Hang possibly had schizophrenia,[],...,,True,"{'r1ywrm9m7c8c1': {'status': 'valid', 'e': 'Im...","{'items': [{'media_id': 'r1ywrm9m7c8c1', 'id':...",,376816ae-b350-11eb-adc6-0e0d3bd20d2f,,,,
8,,schizophrenia,Hey there. I have bipolar but I used to hear v...,t2_8ip2z8wv,False,,0,False,Does this sound like it or something else,[],...,,,,,,dccd2c58-b34e-11eb-ae5b-0e48f7f519b5,,,,
9,,schizophrenia,Happy Birthday to Jesus!,t2_j4jqg,False,,0,False,Singing about good things!,[],...,,,,,,0001fbbc-b350-11eb-909f-0eae5e287fb7,,,,


In [14]:
schizo_df.tail(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
2194,,schizophrenia,Aliens fighting robots. There’s a war going on...,t2_72me7aot,False,,0,False,"Aliens,Robots(cyborgs), magic",[],...,,,,,,a87835d4-f961-11eb-93fb-a627bfada68b,,,,
2195,,schizophrenia,,t2_k82qz,False,,0,False,Rachel Star Withers: How I Navigate Life With ...,[],...,,,,,,c9d488c2-b3f7-11eb-9ebe-0e028e7fa7fb,,,,
2196,,schizophrenia,So I have a delusion that there’s something se...,t2_towb139l,False,,0,False,Internal voices coming out of your mouth?,[],...,,,,,,de51fc7e-b34f-11eb-8843-0e7ee5dad4c7,,,,
2197,,schizophrenia,Can you have a schizo spectrum disorder and be...,t2_i7x7q5bp,False,,0,False,psychosis and awareness,[],...,,,,,True,dccd2c58-b34e-11eb-ae5b-0e48f7f519b5,,,,
2198,,schizophrenia,My doctor won’t change my meds and I am unmoti...,t2_eltev,False,,0,False,How do people manage lack of motivation?,[],...,,,,,True,467cb704-b34f-11eb-85d3-0ed0dc5434c5,,,,
2199,,schizophrenia,,t2_qku2i5ze,False,,0,False,"selfie Sunday, just my name in a now broken Ar...",[],...,,,,,True,7b58ddae-b34f-11eb-8a94-0e540082c8f1,,,,
2200,,schizophrenia,"Upon being diagnosed, I made and acknowledged ...",t2_9fpzpinq,False,,0,False,Willpower and logics helps,[],...,,,,,True,467cb704-b34f-11eb-85d3-0ed0dc5434c5,,,,
2201,,schizophrenia,Attached below is todays video link to my “On ...,t2_1buxpqk3,False,,0,False,"Schizophrenia and inherent strengths, on YouTube",[],...,,,,,True,ef98928a-b350-11eb-9dae-0ed0c28c524b,,,,
2202,,schizophrenia,get check? i might relapse,t2_uig8k3d5,False,,0,False,I dont have schizophrenia. After relapse on ni...,[],...,,,,,True,a87835d4-f961-11eb-93fb-a627bfada68b,,,,
2203,,schizophrenia,Attached below is todays video link to my “On ...,t2_1buxpqk3,False,,0,False,"Schizophrenia and the persecutory, on YouTube",[],...,,,,,True,ef98928a-b350-11eb-9dae-0ed0c28c524b,,,,


In [15]:
schizo_df.dtypes

approved_at_utc           float64
subreddit                  object
selftext                   object
author_fullname            object
saved                        bool
                           ...   
link_flair_template_id     object
poll_data                  object
crosspost_parent_list      object
crosspost_parent           object
collections               float64
Length: 118, dtype: object

## Bipolar Statistics

In [16]:
bipolar_df = pd.read_csv('../2_EDA/data/bipolar.csv')
bipolar_df.shape

(2201, 118)

In [17]:
bipolar_df.head(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
0,,bipolar,I rarely had pressured speech in my manic epis...,t2_80gkngcwc,False,,0,False,Not much of a talker,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
1,,bipolar,I have been with BD since my diagnosis at 20 (...,t2_kdy196ssr,False,,0,False,Everything is good,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
2,,bipolar,"Hi all! I have type2, and lately I've been ext...",t2_2g032mka,False,,0,False,Paranoia around medications,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
3,,bipolar,So I am more mentally healthy than I have ever...,t2_4b3aq1by,False,,0,False,Thinking about applying at a psych hospital te...,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
4,,bipolar,It’s like living in a dream.. A dream that you...,t2_hkrrusky,False,,0,False,Lost..,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
5,,bipolar,Fyi: I got diagnosed with Bipolar 2 this year....,t2_pxvl1hpgq,False,,0,False,Can you feel a depressive episode coming on?,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
6,,bipolar,So I have bipolar disorder. It’s no secret in ...,t2_6d6otf1vl,False,,0,False,My mother used bipolar as an adjective at dinn...,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
7,,bipolar,So basically as the title says. I feel like fo...,t2_hbxdx2hi,False,,0,False,Does anyone have problems with keeping in touc...,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
8,,bipolar,Since I've being diagnosed officially and prop...,t2_jplnk3u5,False,,0,False,I Miss Manic Episodes,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
9,,bipolar,I don't know what's wrong with me.. I feel off...,t2_ba8cfgd2,False,,0,False,Not feeling good,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,


In [18]:
bipolar_df.tail(10)

Unnamed: 0,approved_at_utc,subreddit,selftext,author_fullname,saved,mod_reason_title,gilded,clicked,title,link_flair_richtext,...,preview,is_gallery,media_metadata,gallery_data,author_cakeday,link_flair_template_id,poll_data,crosspost_parent_list,crosspost_parent,collections
2191,,bipolar,Are there others out there like me? \n\n\nI'...,t2_5jb6o,False,,0,False,Why does nothing work for me?,"[{'e': 'text', 't': 'Rant '}, {'a': ':angry-cr...",...,,,,,,e3c5a90c-37c5-11ed-be75-f6def89c9511,,,,
2192,,bipolar,it’s been about two months since i’ve started ...,t2_in8mo79kd,False,,0,False,how do i know if my feelings will last?,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
2193,,bipolar,I'm a uni student with Bipolar type II. I used...,t2_lup1qexv,False,,0,False,Life is going downhill,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
2194,,bipolar,"When I saw my psychiatrist in November, she up...",t2_iyd0rac,False,,0,False,I just feel indifferent to everything lately,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
2195,,bipolar,"Hi, so I’d like advice on traveling and having...",t2_l5lorqka2,False,,0,False,Advice on traveling/being off routine,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
2196,,bipolar,My wife is going with me (44m) because she mig...,t2_7cj3h9tn,False,,0,False,I have an appointment with an ECT neurologist,"[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
2197,,bipolar,"Hey everyone, just wanted to share I finally g...",t2_q7pk48eng,False,,0,False,4+ years and finally diagnosed,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,
2198,,bipolar,"“Look at the bright side,” is something my mot...",t2_mkpql,False,,0,False,BP superpower,"[{'e': 'text', 't': 'Discussion '}, {'a': ':ne...",...,,,,,,8885c358-e44e-11ec-9bf1-c6e6c22e188f,,,,
2199,,bipolar,I have been diagnosed with Bipolar II for arou...,t2_crpm9rff,False,,0,False,"Had a Psychotic Episode, Unsure of What To Do","[{'e': 'text', 't': 'Support/Advice '}, {'a': ...",...,,,,,,3a0100b8-1a7b-11e3-9746-12313b04c5c2,,,,
2200,,bipolar,"Well, I don’t think I’m the first to say this ...",t2_vbuydy8o,False,,0,False,Hard Admitting to Myself,"[{'e': 'text', 't': 'Just Sharing'}]",...,,,,,True,19cabdb6-e9ed-11ec-b83a-029c2d035f04,,,,


In [19]:
bipolar_df.dtypes

approved_at_utc           float64
subreddit                  object
selftext                   object
author_fullname            object
saved                        bool
                           ...   
link_flair_template_id     object
poll_data                  object
crosspost_parent_list     float64
crosspost_parent          float64
collections                object
Length: 118, dtype: object

# Organize Data in a Dataframe

In [20]:
#concatenate dataframes
df = pd.concat([army_df, usmc_df, schizo_df, bipolar_df])

print(f'Master DF: {df.shape},\n Army: {army_df.shape},\n USMC: {usmc_df.shape},\n Schizo: {schizo_df.shape},\n Bipolar: {bipolar_df.shape}')

Master DF: (8819, 118),
 Army: (1611, 118),
 USMC: (2803, 118),
 Schizo: (2204, 118),
 Bipolar: (2201, 118)


### Output raw df in EDA folder for further processing and cleaning

In [21]:
#output to csv
df.to_csv('../2_EDA/data/00_master.csv', index=False)