# Summary
This notebook provides a basic analysis of the comments including example comments, how many comments are from templates, and how many unique comments exist.

In [2]:
import pandas
import numpy as np
import folium
from folium import plugins

The comments are loaded in from the Data Cleanup notebook.

In [3]:
data = pandas.read_json('./data/comments_with_attachments.json', orient='records', dtype='false')

Below is a print out of the pandas dataframe schema and first comment.

In [4]:
print("Loaded %d comments." % len(data))
display(data[:1])

Loaded 13236 comments.


Unnamed: 0,doc.attachment_download,doc.attachment_download -href,doc.attachment_name,doc.category,doc.city,doc.comment_body,doc.country,doc.name,doc.state,doc.zip
0,,,,,United States,"Dear Assistant General Counsel Hilary Malawer,...",Parent/Relative,Heather Hirsch,MN,55016


A sample of the comment_body in the dataset.

In [5]:
with pandas.option_context('display.max_colwidth', 5000):
    display(data[['doc.comment_body']].sample(5))

Unnamed: 0,doc.comment_body
5758,"Dear Assistant General Counsel Hilary Malawer,\n\nAll Department of Education civil rights regulations and guidance documents are important and necessary. Far from being burdensome, current civil rights rules and regulations benefit schools and students by providing a clear framework that, when followed, allow all students an equal opportunity to learn in a safe and welcoming environment regardless of sex, race, color, national origin, disability status, English proficiency, sexual orientation, or gender identity.\nI urge the Department to keep in its current form 34 C.F.R. pts. 1 thru 1299 , which include regulations governing the Secretary and the offices for Civil Rights; Elementary and Secondary Education; Special Education and Rehabilitative Services; Career, Technical, and Adult Education; Post-Secondary Education; Educational Research and Improvement; and the National Council on Disability. \n\nI also urge the Department to preserve all current significant guidance documents, including guidance on sexual, racial, and disability-based harassment (including guidance on sexual violence); access to athletic opportunities; gender equity in career and technical education; single-sex schools; equal access to educational resources; nondiscriminatory school discipline; racial diversity programs; the rights of students with disabilities in charter schools; restraint and seclusion of students with disabilities; and the rights of English language learners. I urge you to keep current regulations and guidance in place, and to continue enforcing these critical civil rights laws so that all students have an equal opportunity to learn and thrive.\n\nSincerely,\ndani ortolano\n New York, NY 10002"
10176,Devois needs to show that she cares about other human beings.
3472,"Dear Assistant General Counsel Hilary Malawer,\n\nAll Department of Education civil rights regulations and guidance documents are important and necessary. Far from being burdensome, current civil rights rules and regulations benefit schools and students by providing a clear framework that, when followed, allow all students an equal opportunity to learn in a safe and welcoming environment regardless of sex, race, color, national origin, disability status, English proficiency, sexual orientation, or gender identity.\nI urge the Department to keep in its current form 34 C.F.R. pts. 1 thru 1299 , which include regulations governing the Secretary and the offices for Civil Rights; Elementary and Secondary Education; Special Education and Rehabilitative Services; Career, Technical, and Adult Education; Post-Secondary Education; Educational Research and Improvement; and the National Council on Disability. \n\nI also urge the Department to preserve all current significant guidance documents, including guidance on sexual, racial, and disability-based harassment (including guidance on sexual violence); access to athletic opportunities; gender equity in career and technical education; single-sex schools; equal access to educational resources; nondiscriminatory school discipline; racial diversity programs; the rights of students with disabilities in charter schools; restraint and seclusion of students with disabilities; and the rights of English language learners. I urge you to keep current regulations and guidance in place, and to continue enforcing these critical civil rights laws so that all students have an equal opportunity to learn and thrive.\n\nSincerely,\nLinda L. Pearce Pearce\n Brentwood, TN 37027"
3642,"Dear Assistant General Counsel Hilary Malawer,\n\nAll Department of Education civil rights regulations and guidance documents are important and necessary. Far from being burdensome, current civil rights rules and regulations benefit schools and students by providing a clear framework that, when followed, allow all students an equal opportunity to learn in a safe and welcoming environment regardless of sex, race, color, national origin, disability status, English proficiency, sexual orientation, or gender identity.\nI urge the Department to keep in its current form 34 C.F.R. pts. 1 thru 1299 , which include regulations governing the Secretary and the offices for Civil Rights; Elementary and Secondary Education; Special Education and Rehabilitative Services; Career, Technical, and Adult Education; Post-Secondary Education; Educational Research and Improvement; and the National Council on Disability. \n\nI also urge the Department to preserve all current significant guidance documents, including guidance on sexual, racial, and disability-based harassment (including guidance on sexual violence); access to athletic opportunities; gender equity in career and technical education; single-sex schools; equal access to educational resources; nondiscriminatory school discipline; racial diversity programs; the rights of students with disabilities in charter schools; restraint and seclusion of students with disabilities; and the rights of English language learners. I urge you to keep current regulations and guidance in place, and to continue enforcing these critical civil rights laws so that all students have an equal opportunity to learn and thrive.\n\nSincerely,\nRachel Ruthenberg\n Clinton Township, MI 48038"
12666,"Dear Assistant General Counsel Hilary Malawer,\n\nAll Department of Education civil rights regulations and guidance documents are important and necessary. Far from being burdensome, current civil rights rules and regulations benefit schools and students by providing a clear framework that, when followed, allow all students an equal opportunity to learn in a safe and welcoming environment regardless of sex, race, color, national origin, disability status, English proficiency, sexual orientation, or gender identity.\nI urge the Department to keep in its current form 34 C.F.R. pts. 1 thru 1299 , which include regulations governing the Secretary and the offices for Civil Rights; Elementary and Secondary Education; Special Education and Rehabilitative Services; Career, Technical, and Adult Education; Post-Secondary Education; Educational Research and Improvement; and the National Council on Disability. \n\nI also urge the Department to preserve all current significant guidance documents, including guidance on sexual, racial, and disability-based harassment (including guidance on sexual violence); access to athletic opportunities; gender equity in career and technical education; single-sex schools; equal access to educational resources; nondiscriminatory school discipline; racial diversity programs; the rights of students with disabilities in charter schools; restraint and seclusion of students with disabilities; and the rights of English language learners. I urge you to keep current regulations and guidance in place, and to continue enforcing these critical civil rights laws so that all students have an equal opportunity to learn and thrive.\n\nSincerely,\nKimberly Idrovo\n Uniontown, PA 15401"


Below is a list of common strings to use in pattern matching and see how many similar comments exist. This list is handpicked and the matching is based on the one conducted [here](https://github.com/j2kao/fcc_nn_research/blob/master/proc_17_108_analysis_01_level_0_manual_tagging.ipynb) for FCC comments

In [6]:
common_strings = "Dear Assistant General Counsel Hilary Malawer,\n\nAll Department of Education civil rights regulations and guidance documents are important and necessary"
common_string2 = "Dear Assistant General Counsel Hilary Malawer,\n\nCurrent federal regulations and guidance help all studentsregardless of sex, race, color, sexual orientation,"

In [7]:
dup_removed = data.copy()

# Find rows containing the common strings
invalid_indexes = []
invalid_indexes.extend(data[data['doc.comment_body'].str.contains(common_strings, case=False, na=False)].index.values)
invalid_indexes.extend(data[data['doc.comment_body'].str.contains(common_string2, case=False, na=False)].index.values)

dup_removed.drop(invalid_indexes, inplace=True)
dup_removed = dup_removed.reset_index(drop=True)

# Saving the dataset with duplicates removed for later use
dup_removed.to_json('./data/comments_duplicates_removed.json')

print("Total comments after removing duplicates: ", len(dup_removed))

Total comments after removing duplicates:  3879


Another look at the dataset, now with the most common comments removed.

In [8]:
with pandas.option_context('display.max_colwidth', 5000):
    display(dup_removed[['doc.comment_body']].sample(50))

Unnamed: 0,doc.comment_body
2728,Do not turn your back on sexual assault survivors.
369,
375,
2664,"I recommend the required referral of all sexual based offenses (rape/sex assault, fondling, incest, stat. rape) as defined in the Clery Act be required to be fully investigated by a sworn law enforcement agency. Any disciplinary action should also be required to be processed through the criminal and/or civil courts of the applicable jurisdiction. Colleges / universities SHOULD NOT be allowed to process these types of offenses solely on their own. Any university/college disciplinary action should be informed by the outcomes of a law enforcement investigation and subject to the same burdens of proof."
16,"Dear Assistant General Counsel Hilary Malawer,\n\nEducation is a lifeline for disadvantaged children and making it safe and accessible is an immense issue. I'm shocked that you would call basic standards of safety and deecency ""burdensome."" All Department of Education civil rights regulations and guidance documents are important and necessary. Far from being burdensome, current civil rights rules and regulations benefit schools and students by providing a clear framework that, when followed, allow all students an equal opportunity to learn in a safe and welcoming environment regardless of sex, race, color, national origin, disability status, English proficiency, sexual orientation, or gender identity.\nI urge the Department to keep in its current form 34 C.F.R. pts. 1 thru 1299 , which include regulations governing the Secretary and the offices for Civil Rights; Elementary and Secondary Education; Special Education and Rehabilitative Services; Career, Technical, and Adult Education; Post-Secondary Education; Educational Research and Improvement; and the National Council on Disability. \n\nI also urge the Department to preserve all current significant guidance documents, including guidance on sexual, racial, and disability-based harassment (including guidance on sexual violence); access to athletic opportunities; gender equity in career and technical education; single-sex schools; equal access to educational resources; nondiscriminatory school discipline; racial diversity programs; the rights of students with disabilities in charter schools; restraint and seclusion of students with disabilities; and the rights of English language learners. I urge you to keep current regulations and guidance in place, and to continue enforcing these critical civil rights laws so that all students have an equal opportunity to learn and thrive.\n\nSincerely,\nFrederica Sandler\n New Orleans, LA 70118"
1362,"I am writing as a professional in the sexual violence prevention field. I have a Master of Public Health and 7 years of experience specifically in campus sexual violence prevention. \n\nI believe that the Title IX guidance from the DOE, while needing some improvement, should NOT be rescinded entirely. From my experience, schools that were struggling to implement the current guidance were most often due to the ""growing pains"" of learning new guidance (it takes time for any system to effectively implement new guidance), or it was a reflection upon the staff at that university (a lack of training and/or poor decision-making by individuals). I want to protect the rights of complainants and respondents at universities, and this can be done without completely overhauling the current guidance. \nI want to share my thoughts regarding the ""preponderance of evidence"" standard for campus proceedings. \n\nFirst, sexual assault reporting is incredibly low, both to police and to campus officials. (See http://bjs.ojp.usdoj.gov/content/pub/pdf/rsarp00.pdf and https://www.ncjrs.gov/pdffiles1/nij/grants/221153.pdf) \n\nOf the few sexual assaults that are reported to police, very few are found to be false (see https://www.nsvrc.org/sites/default/files/Publications_NSVRC_Overview_False-Reporting.pdf). In fact, the false report rate is on par with that of other violent crimes. (We do not yet have data about the percentage of false reports to campus officials.) In my experience, false reports are also easily discovered through basic investigation.\n\nIt is also incredibly rare for perpetrators to see any consequences. When they do, it is more common that a campus gives them a ""slap on the wrist"" (removal from a club, attending an educational event, writing a paper, etc). It is rarer still for a respondent to see consequences of any weight (such as suspension or expulsion).\n\nSo, it follows that the likelihood that someone reported a sexual assault (rare) that was false (even rarer) where the respondent received consequences of any weight (rarer STILL) is incredibly low. It is especially stark when you compare these numbers to the vast number of victims/survivors who never report their sexual assault, or who do report but never see justice (by a campus OR the criminal justice system). \n\nClearly, we still want to prevent this incredibly unlikely occurrence, and provide due process to respondents. But DOE should not remove the preponderance of evidence standard for campus sexual assault investigations.\n\nIt is considered best practice by higher education associations and professionals across the nation to use the preponderance of the evidence standard for campus adjudication processes, because the consequences for being found ""responsible"" by a university are much different from that of being found ""guilty"" by the criminal justice system. As stated above, the consequences handed down by a university are typically mild. Even in the case of expulsion, which does create some obstacles for a student, very few actually have ""sexual assault"" stated as the reason for their expulsion on their transcripts.\n\nBy contrast, the criminal justice system has much more serious consequences, such as fines, jail, or prison. In this case, using the ""beyond a reasonable doubt"" standard makes sense because the stakes are much higher.\n\nAs stated by current Title IX guidance, it should also be considered a form of discrimination to utilize a higher standard of proof for sexual assault cases than for other civil rights violations. Raising the standard of proof would considerably hinder the already rare ability of a university to bring perpetrators to justice and keep their campus safe; and considering that the vast majority of sexual assault victims are women, this should be considered discrimination against woman-identified students.\n\nTitle IX guidance already requires that schools provide ""an adequate, reliable, and impartial investigation."" If there are concerns regarding this, then the Department of Education should illuminate the ways in which schools can ensure an impartial investigation (like how to provide equal resources to the complainant and respondent) while maintaining trauma-informed practices (for example, streamlining the process so the victim does not have to continually re-tell their story). An impartial process and trauma-informed practice are not at odds. \n\nAs complicated and fraught as the criminal justice system is, I also believe it would create a much larger burden on universities to ask them to move towards replicating that system, which also fails victims miserably. Moving in this direction would lead to even fewer reports than campuses see today.\n\nTo conclude, I strongly urge Secretary DeVos and the DOE to keep current Title IX Guidance, and to consult experts in higher education, victims and survivors, and advocates to learn how to improve upon that guidance. \n\nThank you for your time."
2326,We cannot risk losing any accommodations in our schools for kids with learning differences or we will see an immediate increase in dropouts; then crime rates will soar. \nI promise it is not worth the phat bonus Betsy will receive. \n
1557,"Hi my name is David Green. I would like to introduce a comment that I have in regards to WIOA regulations. Ive been working at Lighthouse Louisiana for the past 13 years. I lost my site about 15 years ago due to an eye condition call retinitis pigmentosa, or RP.\n\nPrior to losing my sight, I was a very independent person. I had a lot of opportunities to choose where I wanted to work. I made choices as to where I went along in life.\nSince I lost my sight, a lot of opportunities that I once had, were no longer there. It was a very dark time in my life. I could not see a way out.\nSince then I have really made a big improvement in my life. I have found a place where I can be comfortable-where there's lots of training and opportunities for me to continue to be productive and successful. I got a job at Lighthouse Louisiana.\n\nComing to work at the Lighthouse- a lot of those opportunities re-appeared. I now saw choices that I had once before losing my sight, but now they were in reach. Working at a place like Lighthouse Louisiana really has been the best thing that happened to me since losing my sight. Lighthouse has really given me my independence, hope, and a sense of being the hard working American that I started out to be in life. I would like every person with a disability to have the same choices that I had. And be able to utilize them to their own discretion. \nIts all about choices. I lost my vision, but Im still fully capable of making my own choices. I dont want to be forced to go where other people think I should go. I want to be able to choose for myself, like I have been over the last 57 years of my life. I think every working American should be able to make their own choices. I really think its Un-American to have choices like that taken away from you. \nSo I would like all national officials to take a look at the WIOA language that is not good for a person with a disability and make changes so that everyone can make a choice of where they want to work and AbilityOne agencies like Lighthouse Louisiana arent discriminated against. \nDavid Green\nEmployee & Advocate\nLighthouse Louisiana\n"
1316,"I strongly request The U.S Department of education to withdraw the regulation appearing at 34 CFR Part 99, which was effective January 2012 and which greatly diminished the privacy protections of FERPA."
3854,"Ms. Hilary Malawer\nAssistant General Counsel, Office of the General Counsel\nU.S. Department of Education\n400 Maryland Ave SW., Room 6E231\nWashington, DC 20202\nRe: Docket ID: ED-2017-OS-0074\n\nDear Ms. Malawer:\n\nOur comment is in regards to regulations and sub-regulatory guidance issued by the U.S. Department of Education (DoEd), Rehabilitation Services Administration (RSA) for the purpose of implementing the integrated settings criteria under the definition of competitive integrated employment [34 CFR 361.5(c)(9)(ii) and 361.5(c)(32)(ii)] in the Workforce Innovation and Opportunity Act. These regulations and guidance are having an unintentional, but damaging, job-killing impact for people with significant disabilities. Specifically, RSAs guidance is indiscriminately disqualifying vocational rehabilitation job placements to certain nonprofit agencies (NPAs) based upon their participation in the congressionally-mandated U.S. AbilityOne Program. \nThe language in the integrated settings criteria publicized by RSA restricts access to quality competitive integrated jobs for people with disabilities and is inconsistent with other parts of the regulation, the departments longstanding practice and technical guidance. My states vocational rehabilitation (VR) agency is one of at least 19 states VR that has stopped referring and placing individuals with disabilities through NPAs that participate in the AbilityOne Program. \n{Phoenix-Huntsville} is an NPA participating in the AbilityOne Program and creates {hundreds} of jobs. Because referrals and placements from state vocational rehabilitation counselors have ceased, employment opportunities at my agency are going un-filled. Deserving individuals with significant disabilities are denied these opportunities and the ability to be a vital part of our community. \nWe request that the DoEd immediately rescinds the FAQ guidance (posted on DoEds website, https://www2.ed.gov/about/offices/list/osers/rsa/wioa/competitive-integrated-employment-faq.html ) related to the definition of integrated settings and issue clarifying guidance and that employment at community rehabilitation programs, including employment positions funded through the AbilityOne program, may be considered competitive integrated employment as long as it meets the criteria defined in RSA-TAC-06-01 and the WIOA (P.L. 113-128). \nThank you for the opportunity to comment on existing regulations that eliminate jobs, or inhibit job creation.\nSincerely,\nEarl Grilliot\nPhoenix of Huntsville\n2939 Johnson Rd.\nHuntsville, AL 35805\n\n"


Creating a dataframe of states and zip codes from the cleaned comments dataset (dupicate comments intact) to use in a map.

In [9]:
map_data = data[['doc.state', 'doc.zip']]

# Remove any null values
map_data.dropna(inplace=True)

A value is trying to be set on a copy of a slice from a DataFrame

See the caveats in the documentation: http://pandas.pydata.org/pandas-docs/stable/indexing.html#indexing-view-versus-copy
  after removing the cwd from sys.path.


Generating a heat map visualization by state of where comments came from along with listing the states from which the most comments were sent.

In [10]:
state_data = map_data.groupby(['doc.state']).size().reset_index(name='duplicate_count')
state_data = state_data.sort_values('duplicate_count', ascending=False).reset_index(drop=True)

zip_data = map_data.groupby(['doc.zip']).size().reset_index(name='duplicate_count')
zip_data = zip_data.sort_values('duplicate_count', ascending=False).reset_index(drop=True)

display(state_data[:5], zip_data[:5])

Unnamed: 0,doc.state,duplicate_count
0,CA,1464
1,NY,727
2,WA,441
3,FL,423
4,PA,383


Unnamed: 0,doc.zip,duplicate_count
0,10025,18
1,11215,18
2,10011,15
3,10023,13
4,95060,12


In [11]:
state_geo = './resources/us-states.json'

map = folium.Map(location=[48, -102], zoom_start=3)
map.choropleth(
    geo_data=state_geo,
    name='map',
    data=state_data,
    columns=['doc.state', 'duplicate_count'],
    key_on='feature.id',
    fill_color='BuPu',
    fill_opacity=0.9,
    line_opacity=0.1
)
folium.LayerControl().add_to(map)

map

Resources:
https://github.com/j2kao/fcc_nn_research/blob/master/proc_17_108_analysis_01_level_0_manual_tagging.ipynb