本文致力于对占领华尔街运动的相关推特进行研究分析。首先对推特内容进行清洗和描述,并选择2012年1月27日的11992条推特进行进一步分析,发现推特数和发布人数之间存在幂律分布关系,凌晨和晚间是推特发布高峰。同时提取积极与消极言论,建立情感模型,并构建出给用户推荐Hashtag的推荐系统。最后根据推特评论构建出关系网络,经分析发现发现绝大多数用户集中在网络边缘,意见领袖数量较少且比较分散。
2011年9月17日,上千名示威者聚集在美国纽约曼哈顿,试图占领华尔街。人们通过互联网组织起来,宣称要反对美国政治中权钱交易、两党政争以及社会不公正的状况。抗议活动逐渐升级,逐渐成为席卷全美的群众性社会运动。而推特是西方民众发表观点的重要平台,从中可一窥舆情态势。网民对占领华尔街运动持怎样的态度?推特的发布在时间等维度上有何特点?评论的网络又存在怎样的特征?带着诸多好奇与疑问,我们展开了对推特评论的分析。
本文的研究思路是从整体到具体、由浅入深、由单独个人到社交关联。首先对整体推特文本进行分析描述,观察特征后,选择其中具体一天的本文展开深入研究。
对具体某天的推特文本首先进行总体描述,而后深入挖掘文本,包括建立情感分析模型与推荐系统,最后构建评论网络,对其进行描述分析,观察网民之间的互动关系。
解决的问题:对推特数据进行总体分析,计算每一天有多少条tweets,提取某一天的数据dat,存到硬盘里。
过程:解压文件,在python中打开,查看总行数与变量名,处理错误换行,用pandas正确分列,查看每一天的tweets数量,随机选择了1月27日,将27日的tweets保存为一个新的dat格式文件,那天有11992条tweets。
解决的问题:对1月27日的dat数据进行清洗和描述,进行文本分析,建立情感分析模型。
过程:首先读取1月27日的数据,查看总行数与变量名,处理错误换行,删除无意义字符。然后对数据进行统计和描述,当天共有11992条推文发出,来自4785名用户,有9442条原创内容,发文数量最多的作者名为owsar,一天发布了469条;而人气最高的推文是用户OccupyWallStNYC发布的,得到89次转载。此外,我们还发现推特数和发布人数之间存在幂律分布关系,绝大部分人在这一天中发布的推特数都很少,只有少数人发布的推特达到两位数。将发布推特数和概率进行拟合,发现二者呈现反比关系。以一天内发布的时间和推特数分别为横纵坐标绘制散点图,我们可以看到这天凌晨和晚上20时左右是推特发布的两个高峰时段,早上10时是发布推特最少的时间。由于该日为工作日,上午10时为工作、学习时间,可能导致发推数量进入低谷。而凌晨和晚间是个人使用网络的集中时间段,这也与我们认知中的生活习惯相吻合。
接着我们对该日的推特的内容进行了清洗,把内容列单独保存成txt文件,进行文本分析。首先打开文件,观察前几页评论内容,发现对于占领华尔街运动网民的情绪和态度基本以负面为主,几乎没有看到正面言论,有一些是反讽的语句,所以我们将包含“fuck”“shut up”“jerk”等愤怒情绪的评论定义为“negative”,将理性表达观点、参与讨论的评论定义为“positive”,如“OWS has left an indelible impression on the GOP race; time will tell what effect it has on the Democratic side politics Water Has A “Memory” |Watch:tcot tlot ows p2 teaparty”。我们各选择了五句negative和positive的评论作为训练集,选择了朴素贝叶斯分类器,构建情感分析模型。然后我们随机选择了几句评论进行试验,例如i've fallen in love with the badass members of Poland\'s parliament protesting ACTA :)Anonymous OWS SOPA,虽然我们没有定义badass为negative,但是模型做出了正确的识别,情感为positive的评论也都能够正确识别。经计算,此模型的总准确率为60.000000%。
解决的问题1:构建给用户推荐Hashtag的推荐系统,将数据整理为人、hashtag、次数的格式,对以上数据构建基于物的相似性的推荐系统。
过程:首先对tweets文本中的数据进行整理,构建出{用户名:{hashtag:次数}}的字典,而后定义系列函数,构建基于物的相似性的推荐系统。
解决的问题2:构建评论网络,对评论网络进行描述和分析。
过程:首先从推特文本中利用正则表达式提取出@的用户,将@用户和原用户一一对应,构建出关系网络列表,其中共有5333个节点,8247条边,平均度是1.5464,并对网络的平均连接度、平均邻度、紧密中心度进行了计算。而后根据数据绘图,观察图形发现绝大多数用户集中在网络边缘,意见领袖数量较少且较为分散。27号一日之内,绝大多数推特得到转发的次数都很少。
1.通过对内容的分析,发现关于占领华尔街运动的推特内容以负面言论为主,其中不乏言辞激烈的语句,多见俚语脏话。
2.推特数和发布人数之间存在幂律分布关系,绝大部分人在这一天中发布的推特数都很少,只有少数人发布的推特达到两位数。而生活中许多事情也都符合这一关系,比如商品市场中广为应用的长尾理论。
3.经过对推特转发网络的分析,发现绝大多数用户集中在网络边缘。在我们所选取的27日,没有出现明显的意见领袖,没有诞生出现象级推文。
with open("/Users/YOLLY/desktop/cjc/ows-raw") as f:
tweets = f.readlines()
len(tweets)
6900830
varNames = tweets[0].replace('\n', '').split(',')
varNames
['"Twitter ID"', 'Text', '"Profile Image URL"', 'Day', 'Hour', 'Minute', '"Created At"', 'Geo', '"From User"', '"From User ID"', 'Language', '"To User"', '"To User ID"', 'Source']
with open("/Users/YOLLY/desktop/cjc/ows-raw", 'w') as f:
right_line = ''
blocks = []
for line in tweets:
right_line += line.replace('\n', ' ')
line_length = len(right_line.split(','))
if line_length >= 14:
blocks.append(right_line)
right_line = ''
for i in blocks:
f.write(i + '\n')
with open("/Users/YOLLY/desktop/cjc/ows-raw", 'rb') as f:
chunk = f.readlines()
len(chunk)
6602186
import csv
clean_lines = (line.replace('\x00','') \
for line in chunk[1:])
lines = csv.reader(clean_lines, delimiter=',', \
quotechar='"')
import pandas as pd
df = pd.read_csv("/Users/YOLLY/desktop/cjc/ows-raw",\
sep = ',', quotechar='"',error_bad_lines=False)
Skipping line 5051743: expected 14 fields, saw 15 Skipping line 5254718: expected 14 fields, saw 15 Skipping line 5281095: expected 14 fields, saw 15 Skipping line 5481759: expected 14 fields, saw 15 Skipping line 5482014: expected 14 fields, saw 15 Skipping line 5482532: expected 14 fields, saw 15 Skipping line 5516605: expected 14 fields, saw 15 Skipping line 5709055: expected 14 fields, saw 15 Skipping line 5796658: expected 14 fields, saw 15 Skipping line 5927412: expected 14 fields, saw 15 Skipping line 5927419: expected 14 fields, saw 15 Skipping line 5927421: expected 14 fields, saw 15 Skipping line 5927451: expected 14 fields, saw 15 Skipping line 5927478: expected 14 fields, saw 15 Skipping line 6229621: expected 14 fields, saw 16 Skipping line 6245861: expected 14 fields, saw 17 Skipping line 6278728: expected 14 fields, saw 15 Skipping line 6350262: expected 14 fields, saw 15 Skipping line 6387321: expected 14 fields, saw 15 Skipping line 6388879: expected 14 fields, saw 15
df[:3]
Twitter ID | Text | Profile Image URL | Day | Hour | Minute | Created At | Geo | From User | From User ID | Language | To User | To User ID | Source | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 121813144174727168 | RT @AnonKitsu: ALERT!!!!!!!!!!COPS ARE KETTLIN... | http://a2.twimg.com/profile_images/1539375713/... | 2011-10-06 | 5 | 4 | 2011-10-06 05:04:51 | N; | Anonops_Cop | 401240477 | en | NaN | 0 | <a href="http://twitter.com/">... |
1 | 121813146137657344 | @jamiekilstein @allisonkilkenny Interesting in... | http://a2.twimg.com/profile_images/1574715503/... | 2011-10-06 | 5 | 4 | 2011-10-06 05:04:51 | N; | KittyHybrid | 34532053 | en | jamiekilstein | 2149053 | <a href="http://twitter.com/">... |
2 | 121813150000619521 | @Seductivpancake Right! Those guys have a vict... | http://a1.twimg.com/profile_images/1241412831/... | 2011-10-06 | 5 | 4 | 2011-10-06 05:04:52 | N; | nerdsherpa | 95067344 | en | Seductivpancake | 19695580 | <a href="http://www.echofon.com/"... |
from collections import defaultdict
data_dict = defaultdict(int)
line_num = 0
tweets = csv.reader((line.replace('\x00','') for line in chunk[1:]), delimiter=',', quotechar='"')
for i in tweets:
line_num +=1
data_dict[i[3]] +=1 # i[3] 是日期列
data_dict.items()[:]
[('', 1), ('http://a0.twimg.com/profile_images/1662841731/tree_6_normal.jpg', 1), ('2012-01-23', 14896), ('http://a3.twimg.com/profile_images/1678532567/upAv-4edf08b0b4d66_normal.jpg', 1), ('http://a3.twimg.com/profile_images/1797168631/Blue_Flame_Eagle__normal.jpg', 1), (' ... http://t.co/xYApxZYN #google #OWS #US"', 1), ('2011-10-24', 64571), ('2011-12-10', 33987), ('2011-12-11', 26604), ('2011-12-12', 50448), ('2011-12-13', 39982), ('2011-12-14', 39399), ('2011-12-15', 36426), ('2011-12-16', 28107), ('2011-12-17', 37983), ('2011-12-18', 29022), ('2011-12-19', 20893), ('2012-02-16', 12837), ('2012-02-17', 12468), ('2012-02-14', 13580), ('2012-02-15', 13248), ('2012-02-12', 11178), ('2012-02-13', 13600), ('2012-02-10', 14083), ('2012-02-11', 12920), ('2012-02-18', 4859), ('2011-12-07', 45440), ('2011-12-06', 46917), ('2011-12-05', 42429), ('2011-12-04', 42326), ('2011-12-03', 35760), ('2011-12-02', 43135), ('2011-12-01', 48062), ('2011-11-20', 99365), ('2011-11-21', 87818), ('2011-11-22', 81802), ('2011-11-23', 61471), ('2011-11-24', 40522), ('2011-11-25', 40930), ('2011-12-09', 43030), ('2011-12-08', 40733), ('2012-02-05', 13656), ('2012-02-04', 16591), ('2012-02-07', 14940), ('2012-02-06', 14446), ('2012-02-01', 16578), ('2012-02-03', 14371), ('2012-02-02', 14995), ('2012-02-09', 13944), ('2012-02-08', 13451), ('2011-11-11', 48997), ('2011-10-31', 64256), ('2011-11-28', 53505), ('2012-01-28', 11530), ('2011-11-29', 40152), ('2012-01-26', 15159), ('2012-01-27', 11992), ('2012-01-24', 14290), ('2012-01-25', 18812), ('2012-01-22', 11948), ('2011-10-29', 75741), ('2012-01-20', 19840), ('2012-01-21', 15459), ('2011-10-28', 87216), ('http://a2.twimg.com/profile_images/1706563831/tw_13557041_1324496816_normal.jpg', 1), ('2011-11-30', 59289), ('http://a2.twimg.com/profile_images/1545568775/dont_upload__1__normal.jpg', 1), ('2011-10-23', 64808), ('2011-10-22', 70368), ('2011-10-21', 75256), ('2011-10-20', 73159), ('http://a3.twimg.com/profile_images/1736694761/upAv-4f06b39e8faf7_normal.jpg', 1), ('2011-10-27', 137172), ('2012-01-31', 18682), ('2012-01-30', 27198), ('2011-10-26', 141729), ('http://a3.twimg.com/profile_images/1165782455/l_12d207f66b2b47a1a3ac1111c3b30f5f_normal.jpg', 1), ('2011-11-26', 44523), ('2011-11-27', 42614), ('http://a2.twimg.com/profile_images/1557875906/ghostshell02_normal.jpg', 1), ('2012-01-01', 33192), ('2012-01-02', 19529), ('2012-01-03', 22790), ('2012-01-04', 16827), ('2012-01-05', 11227), ('2012-01-06', 10770), ('2012-01-07', 14432), ('2012-01-08', 15209), ('2012-01-09', 16589), ('2011-11-01', 62286), ('http://a3.twimg.com/profile_images/1630750314/hh_normal.png', 3), ('2011-10-12', 67753), ('2011-10-13', 98954), ('2011-10-10', 78619), ('2011-10-11', 67596), ('2011-10-16', 165381), ('2011-10-17', 113628), ('2011-10-14', 148062), ('2011-10-15', 161802), ('2011-10-18', 103337), ('2011-10-19', 88874), ('http://a3.twimg.com/profile_images/1744758211/upAv-4f0b8929e74c8_normal.jpg', 1), ('http://a2.twimg.com/profile_images/1178248651/IMG000037_normal.jpg', 1), ('2012-01-17', 20692), ('2012-01-16', 15844), ('2012-01-15', 16264), ('2012-01-14', 15046), ('2012-01-13', 15683), ('2012-01-12', 17705), ('2012-01-11', 22665), ('2012-01-10', 16829), ('http://a0.twimg.com/profile_images/1429928340/1__28__normal.jpeg', 1), ('2012-01-19', 14533), ('2012-01-18', 18162), ('2011-11-06', 55511), ('2011-11-07', 52820), ('2011-11-04', 63529), ('2011-11-05', 65019), ('2011-11-02', 74554), ('2011-11-03', 92698), ('2011-10-07', 65238), ('2011-10-06', 49638), ('2011-10-09', 65097), ('2011-10-08', 65949), ('2011-11-08', 47453), ('2011-11-09', 46503), ('http://a1.twimg.com/profile_images/1227414827/Minas_veste_azul_2_normal.jpg', 1), ('http://a0.twimg.com/profile_images/1609943884/Occupy_USA-logo_normal.jpg', 1), ('http://a2.twimg.com/profile_images/1766564610/rainbow_body_squared_normal.jpg', 1), ('2011-12-30', 17367), ('2011-12-31', 14692), ('2011-11-15', 409075), ('2011-11-14', 59815), ('2011-11-17', 279349), ('2011-11-16', 126520), ('2011-10-30', 77813), ('2011-11-10', 52664), ('2011-11-13', 45470), ('2011-11-12', 38189), ('2011-11-19', 107747), ('2011-11-18', 154395), ('http://a0.twimg.com/profile_images/1687725111/hand_art_normal.jpg', 1), ('2011-12-26', 12515), ('2012-01-29', 27219), ('2011-12-29', 17100), ('2011-12-28', 19117), ('2011-12-25', 10538), ('2011-12-24', 14196), ('2011-12-27', 16772), ('2011-10-25', 74641), ('2011-12-21', 20091), ('2011-12-20', 24045), ('2011-12-23', 19031), ('2011-12-22', 21972)]
alist = ['2012-01-27']
df['Day'].isin(alist)
df[df['Day'].isin(alist)]
Twitter ID | Text | Profile Image URL | Day | Hour | Minute | Created At | Geo | From User | From User ID | Language | To User | To User ID | Source | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
6263754 | 162686269481156608 | The 27 Republican Bills That Aren't About Jobs... | http://a3.twimg.com/profile_images/1739660175/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:03 | N; | zengreen | 170208476 | en | NaN | 0 | <a href="http://www.tweetdeck.com"... |
6263755 | 162686280986140672 | For those interested in #FinancialFriday, the ... | http://a1.twimg.com/profile_images/1751422937/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:06 | N; | OccupyManJose | 386911819 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263756 | 162686293048954880 | The 27 Republican Bills That Aren't About Jobs... | http://a0.twimg.com/profile_images/1209829084/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:09 | N; | dohlink | 21453088 | en | NaN | 0 | <a href="http://www.tweetdeck.com"... |
6263757 | 162686297276817408 | RT @TheNewDeal: Dear Republicans, ALL YOUR ANC... | http://a1.twimg.com/profile_images/1361063617/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:10 | N; | CoolR1a | 289005244 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263758 | 162686299298476032 | #WTF?! G3t your FILTHY hand5 0ff my int3rnet! ... | http://a2.twimg.com/profile_images/1571529600/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:10 | N; | St0pFear | 372610625 | en | NaN | 0 | <a href="http://twitter.com/download/a... |
6263759 | 162686332718686208 | A new quarter of Occupy begins at UC Davis - h... | http://a1.twimg.com/profile_images/1768816974/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:18 | N; | symphily | 238744876 | en | NaN | 0 | <a href="http://www.hootsuite.com"... |
6263760 | 162686364293406720 | RT @TheNewDeal: Dear Republicans, ALL YOUR ANC... | http://a0.twimg.com/profile_images/1162964653/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:26 | N; | Amy_Etkind | 15992672 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263761 | 162686372648452096 | RT @OccupyAustin: MT @kennethlipp: #Twitter ca... | http://a1.twimg.com/profile_images/1768327896/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:28 | N; | SilverJediShade | 333464348 | en | NaN | 0 | <a href="http://www.tweetdeck.com"... |
6263762 | 162686390763663360 | RT @TheNewDeal: Republicans are Worried About ... | http://a1.twimg.com/profile_images/1763408133/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:32 | N; | neutronneedle | 133609928 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263763 | 162686401643679744 | ICYMI: February 2009: UBS to Forfeit Secret Cl... | http://a2.twimg.com/profile_images/320055989/C... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:35 | N; | superguts | 56390535 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263764 | 162686401811451904 | 'Sorry, knowledge isn't free in your country' ... | http://a3.twimg.com/profile_images/1581242447/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:35 | N; | Daniel_James80 | 266994101 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263765 | 162686403224928256 | RT @TheNewDeal: Dear Republicans, ALL YOUR ANC... | http://a0.twimg.com/profile_images/1098021238/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:35 | N; | Violetrical | 67548940 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263766 | 162686404726501376 | MAY15 - MAY21 #NATOG8 . #OCCUPYCHICAGO http://... | http://a3.twimg.com/profile_images/1759977902/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:36 | N; | TrutherHipster | 465086652 | it | NaN | 0 | <a href="http://www.hootsuite.com"... |
6263767 | 162686404730687488 | MAY15 - MAY21 #NATOG8 . #OCCUPYCHICAGO http://... | http://a1.twimg.com/profile_images/1782615823/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:36 | N; | Abdusters | 474932367 | it | NaN | 0 | <a href="http://www.hootsuite.com"... |
6263768 | 162686404911054848 | RT @occupywallstnyc: You can now easily contac... | http://a2.twimg.com/profile_images/1134620212/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:36 | N; | micubita | 130863358 | en | NaN | 0 | <a href="http://www.hootsuite.com"... |
6263769 | 162686405313691648 | MAY15 - MAY21 #NATOG8 . #OCCUPYCHICAGO http://... | http://a1.twimg.com/profile_images/1246546879/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:36 | N; | JuliusMassage | 253273286 | it | NaN | 0 | <a href="http://www.hootsuite.com"... |
6263770 | 162686405934465024 | MAY15 - MAY21 #NATOG8 . #OCCUPYCHICAGO http://... | http://a2.twimg.com/profile_images/1248584533/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:36 | N; | WiziLeaks | 254250010 | it | NaN | 0 | <a href="http://www.hootsuite.com"... |
6263771 | 162686408799174656 | Penny Stock Analyst Peter Leeds, publisher of ... | http://a0.twimg.com/profile_images/1608614983/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:37 | N; | OWSTimes | 399166033 | en | NaN | 0 | <a href="http://twitterfeed.com" ... |
6263772 | 162686421163978752 | Sen. Sanders Pledges to Introduce Legislation ... | http://a2.twimg.com/profile_images/1770144757/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:40 | N; | EcoJive | 338162588 | en | NaN | 0 | <a href="http://www.hootsuite.com"... |
6263773 | 162686442370367488 | RT @TheNewDeal: Republicans are Worried About ... | http://a3.twimg.com/profile_images/1372742054/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:45 | N; | LipstickLibShow | 306872064 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263774 | 162686449727180800 | RT @TheNewDeal: Dear Republicans, ALL YOUR ANC... | http://a0.twimg.com/profile_images/1756094796/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:46 | N; | DutchessOfDork | 307535336 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263775 | 162686457931243520 | RT @phenytoin_E: If you have been swept away b... | http://a1.twimg.com/profile_images/1770495259/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:48 | N; | letfreedomspk | 148421672 | en | NaN | 0 | <a href="http://www.tweetcaster.com&qu... |
6263776 | 162686462863749120 | HEADS UP #occupy @Occupy_DC: Occupy the Alfalf... | http://a0.twimg.com/profile_images/1759086170/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:49 | N; | kleinjos | 371749764 | en | NaN | 0 | <a href="http://twitter.com/#!/downloa... |
6263777 | 162686484011417600 | Jumping on an outreach/actions meeting call w ... | http://a3.twimg.com/profile_images/1652774957/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:55 | N; | OccupyFreedomLA | 394186223 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263778 | 162686496111988736 | RT @OpWallStreet: #OccupyWallStreet #OWS : BR... | http://a3.twimg.com/profile_images/1781280880/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:57 | N; | mom4solidarity | 381050725 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263779 | 162686499178029056 | It's catching. RT @AnonOps: #Polish Parliamen... | http://a1.twimg.com/profile_images/1757455275/... | 2012-01-27 | 0 | 0 | 2012-01-27 00:00:58 | N; | Being_James | 15137167 | en | NaN | 0 | <a href="http://www.hootsuite.com"... |
6263780 | 162686517024800768 | RT @TheNewDeal: Dear Republicans, ALL YOUR ANC... | http://a1.twimg.com/sticky/default_profile_ima... | 2012-01-27 | 0 | 1 | 2012-01-27 00:01:02 | N; | craigsdaughter | 346153355 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263781 | 162686546225541120 | Jan Brewer pointing finger better than Obama b... | http://a1.twimg.com/profile_images/1780860320/... | 2012-01-27 | 0 | 1 | 2012-01-27 00:01:09 | N; | NoObamaZombie | 474176608 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263782 | 162686554500902912 | RT @OpWallStreet: #OccupyWallStreet #OWS : BR... | http://a3.twimg.com/profile_images/1719301054/... | 2012-01-27 | 0 | 1 | 2012-01-27 00:01:11 | N; | Wildey2 | 386653565 | en | NaN | 0 | <a href="http://twitter.com/">... |
6263783 | 162686555981488128 | RT @TheNewDeal: Dear Republicans, ALL YOUR ANC... | http://a2.twimg.com/profile_images/1782441286/... | 2012-01-27 | 0 | 1 | 2012-01-27 00:01:12 | N; | dilekdt | 193699557 | en | NaN | 0 | <a href="http://twitter.com/">... |
... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
6275716 | 163047679369297920 | RT @DecolonizeWS: Don't #Decolonize Your Mind,... | http://a3.twimg.com/profile_images/636387153/P... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:10 | N; | realtucsonn8tiv | 105675535 | en | NaN | 0 | <a href="http://twitter.com/">... |
6275717 | 163047688454152192 | @WeGotEd I am curating all the objections to #... | http://a2.twimg.com/profile_images/1668244520/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:12 | N; | kbucketusa | 260377929 | en | WeGotEd | 37666933 | <a href="http://www.socialoomph.com&qu... |
6275718 | 163047692996579328 | RT @JCActive: #OWS, #99% of USA, dare to go #b... | http://a2.twimg.com/profile_images/1661525666/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:13 | N; | WhoMakesTheFood | 423013004 | en | NaN | 0 | <a href="http://www.tweets60.com/"... |
6275719 | 163047714664361984 | RT @LCranston1939: Daily Kos: Wall Street dono... | http://a3.twimg.com/profile_images/1756080735/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:19 | N; | hotcoffeeandpie | 86164231 | en | NaN | 0 | <a href="http://levelupstudio.com/plum... |
6275720 | 163047741340123136 | [Dr. Pat] occupy london http://t.co/JkmurNoD ... | http://a3.twimg.com/profile_images/1744758211/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:26 | N; | occupyFlickr | 428174621 | it | NaN | 0 | <a href="http://props.org/" rel=&... |
6275721 | 163047753486839808 | RT @Orion_Anon: FBI arrests cops for rampant a... | http://a0.twimg.com/profile_images/1748716852/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:28 | N; | JamiaKelly | 377540405 | en | NaN | 0 | <a href="http://twitter.com/">... |
6275722 | 163047774840037376 | #RonPaul live in Lewiston, Maine - Re-tweet P... | http://a3.twimg.com/profile_images/1765779062/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:33 | N; | DebunkTheMedia | 171389416 | en | NaN | 0 | <a href="http://www.tweetdeck.com"... |
6275723 | 163047788861591552 | RT @Occupy55: #OO #OWS #OSF #OSJ #StopACTA #Tw... | http://a0.twimg.com/profile_images/1782845562/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:36 | N; | The99Percenters | 16988154 | en | NaN | 0 | <a href="http://twitter.com/">... |
6275724 | 163047804837691392 | RT @TheNewDeal: Eric Cantor Busted Insider Tra... | http://a0.twimg.com/profile_images/1152427324/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:40 | N; | myredmondonline | 65036335 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275725 | 163047807102631936 | #SOPA, #PIPA, #ACTA, HR 1981 Vs Internet Free... | http://a0.twimg.com/profile_images/1629408902/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:41 | N; | best_of_iran | 408019422 | it | NaN | 0 | <a href="http://www.bestoftheinternets... |
6275726 | 163047828044783616 | RT @99_film: Penn Badgley Sticks It To Bloombe... | http://a0.twimg.com/profile_images/1783890767/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:46 | N; | Krrrrrrristen | 325340415 | en | NaN | 0 | <a href="http://www.tweetcaster.com&qu... |
6275727 | 163047837049962496 | RT @TheNewDeal: GOP Presidential Candidates Te... | http://a1.twimg.com/profile_images/1223669720/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:48 | N; | careerfed | 19262912 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275728 | 163047844473872384 | RT @preciousliberty: Mark Levin caller - a neu... | http://a1.twimg.com/profile_images/1530342870/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:50 | N; | NNealWhitefield | 43598363 | en | NaN | 0 | <a href="http://www.hootsuite.com"... |
6275729 | 163047875482370048 | non-toxic, edible #food? right on, birke baeh... | http://a3.twimg.com/profile_images/1181177217/... | 2012-01-27 | 23 | 56 | 2012-01-27 23:56:57 | N; | completelybaked | 27754979 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275730 | 163047920806014976 | RT @TheNewDeal: THE ECONOMY IS IMPROVING UNDER... | http://a2.twimg.com/profile_images/1379751985/... | 2012-01-27 | 23 | 57 | 2012-01-27 23:57:08 | N; | 1Spinnellie | 21301931 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275731 | 163047998413221888 | Curious who in the 99% has no aspirations to b... | http://a3.twimg.com/profile_images/1771198372/... | 2012-01-27 | 23 | 57 | 2012-01-27 23:57:26 | a:2:{s:11:"coordinates";a:2:{i:0;d:41.80630000... | cxi | 7192292 | en | NaN | 0 | <a href="http://twitter.com/#!/downloa... |
6275732 | 163048008366301184 | RT @kgosztola: Remembering Howard Zinn: Noam C... | http://a1.twimg.com/profile_images/1779796483/... | 2012-01-27 | 23 | 57 | 2012-01-27 23:57:29 | N; | monic2420 | 347020278 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275733 | 163048087890296832 | @GOPLeader http://t.co/kkzwoPMt Rt #OWS #p2 #tcot | http://a3.twimg.com/profile_images/1587482756/... | 2012-01-27 | 23 | 57 | 2012-01-27 23:57:48 | N; | Kevin_byDesign | 22533825 | en | GOPLeader | 18891923 | <a href="http://twitter.com/">... |
6275734 | 163048180102070272 | Read @CREDOMobile's update on their campaign t... | http://a0.twimg.com/profile_images/1003438717/... | 2012-01-27 | 23 | 58 | 2012-01-27 23:58:10 | N; | Mlary77 | 92143637 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275735 | 163048192986988544 | RT @TheNewDeal: GOP Presidential Candidates Te... | http://a2.twimg.com/profile_images/1674180859/... | 2012-01-27 | 23 | 58 | 2012-01-27 23:58:13 | N; | ibnkafka | 14250699 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275736 | 163048288482885632 | Occupy protesters disrupt NY foreclosure aucti... | http://a1.twimg.com/profile_images/1395669560/... | 2012-01-27 | 23 | 58 | 2012-01-27 23:58:35 | N; | Vcrow | 16489200 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275737 | 163048312797282304 | Thanks! RT @OccupyPeninsula "OccupyRWC" is now... | http://a3.twimg.com/profile_images/1780629177/... | 2012-01-27 | 23 | 58 | 2012-01-27 23:58:41 | N; | OccupyRWC | 390244758 | en | OccupyPeninsula | 399791922 | <a href="http://twitter.com/">... |
6275738 | 163048344015470592 | Oakland! #OO #OSF #OSJ #OWS #OccupyOakland Rem... | http://a2.twimg.com/profile_images/1713618046/... | 2012-01-27 | 23 | 58 | 2012-01-27 23:58:49 | N; | Occupy55 | 409846832 | en | NaN | 0 | <a href="http://twitter.com/">... |
6275739 | 163048358712324096 | ALL NEED2know this Journey has been a #Devine ... | http://a3.twimg.com/profile_images/1734635386/... | 2012-01-27 | 23 | 58 | 2012-01-27 23:58:52 | N; | Anne124 | 29853783 | en | NaN | 0 | <a href="http://twitter.com/#!/downloa... |
6275740 | 163048530280316928 | RT @TheNewDeal: THE ECONOMY IS IMPROVING UNDER... | http://a0.twimg.com/profile_images/1106458161/... | 2012-01-27 | 23 | 59 | 2012-01-27 23:59:33 | N; | akabob2019 | 81951934 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
6275741 | 163048533652549632 | Twas an excellent day to shut Chase Bank DOWN!... | http://a2.twimg.com/profile_images/1763300132/... | 2012-01-27 | 23 | 59 | 2012-01-27 23:59:35 | N; | rosie2656 | 38104433 | en | NaN | 0 | <a href="http://twitter.com/#!/downloa... |
6275742 | 163048592767070208 | RT @LCranston1939: Daily Kos: Wall Street dono... | http://a0.twimg.com/profile_images/1567858757/... | 2012-01-27 | 23 | 59 | 2012-01-27 23:59:48 | N; | morphusbfly | 122309663 | en | NaN | 0 | <a href="http://levelupstudio.com/plum... |
6275743 | 163048603663867904 | .@ClaudeAdair we #ows agree, comrade. We need... | http://a0.twimg.com/profile_images/1700651771/... | 2012-01-27 | 23 | 59 | 2012-01-27 23:59:51 | N; | TheOccupyCult | 409122325 | en | ClaudeAdair | 385736746 | <a href="http://twitter.com/">... |
6275744 | 163048604293013504 | RT @OWSAtlanta: We're not leaving. Give the Pi... | http://a3.twimg.com/profile_images/94202611/li... | 2012-01-27 | 23 | 59 | 2012-01-27 23:59:51 | N; | connectedways | 19837809 | en | NaN | 0 | <a href="http://twitter.com/#!/downloa... |
6275745 | 163048621397389312 | Get the #Renewable #Energy Story Straight, Big... | http://a3.twimg.com/profile_images/1181177217/... | 2012-01-27 | 23 | 59 | 2012-01-27 23:59:55 | N; | completelybaked | 27754979 | en | NaN | 0 | <a href="http://twitter.com/tweetbutto... |
11992 rows × 14 columns
df0127 = df[df['Day'].isin(alist)]
type(df0127)
pandas.core.frame.DataFrame
df0127.to_csv("/Users/YOLLY/desktop/cjc/tweet0127.dat")