-
bianjiang committed
Dec 17, 2014 -
bianjiang committed
Dec 17, 2014 -
refactor, add search_for function, need more work to create crawler d…
…ynamically per request
bi0nji0ng committedDec 17, 2014 -
bianjiang committed
Dec 17, 2014 -
bianjiang committed
Dec 17, 2014
-
Jiang Bian committed
Nov 22, 2014
-
Merge pull request #2 from cbuntain/master
Bug fixes when commands fail and need to be redistributed.
bianjiang committedApr 3, 2014 -
Added behavior for crawling specific tweet IDs both manually and in
batch mode.
cbuntain committedApr 3, 2014 -
Fixed two bugs. In one, the node queue size used an incorrect variable
name. In the other, the command to redistribute to other nodes wasn't being populated.
Cody Buntain committedApr 3, 2014
-
Merge pull request #1 from bitdeli-chef/master
Add a Bitdeli Badge to README
bianjiang committedFeb 5, 2014 -
bitdeli-chef committed
Feb 5, 2014
-
bianjiang committed
Nov 14, 2013 -
bianjiang committed
Nov 14, 2013 -
bianjiang committed
Nov 14, 2013 -
bianjiang committed
Nov 14, 2013 -
bianjiang committed
Nov 14, 2013 -
bianjiang committed
Nov 14, 2013 -
Releasing health topic follower dataset
bianjiang committedNov 14, 2013
-
add support to clear a node's queues to get you a fresh start
bianjiang committedNov 13, 2013 -
only keeps http proxies, free https proxies noramlly just won't work...
bianjiang committedNov 13, 2013
-
bianjiang committed
Nov 11, 2013
-
shouldn't clear out the crawler queue after restart...
bianjiang committedNov 10, 2013 -
bug in recreating failed crawler
bianjiang committedNov 10, 2013 -
continue migrating local multiprocessing.Queue to redis queue
bianjiang committedNov 10, 2013
-
rewrite local crawler queue using redis; using lifo queue so that con…
…trol commands can be grab asap;
bianjiang committedNov 9, 2013
-
handle dead crawler gracefully: cleanup residue resources; and schedu…
…le it to restart after 30 mins
bianjiang committedNov 8, 2013 -
clean up leftover, if a crawler gets restarted
bianjiang committedNov 8, 2013
-
close redis connection when its done; otherwise, it causes error
Jiang Bian committedNov 7, 2013
-
Merge branch 'master' of bitbucket.org:bianjiang/tweetf0rm
Jiang Bian committedNov 6, 2013 -
add in function to get the full user object from a list of user ids; …
…and add in GET_USERS_FROM_IDS command; see tweetf0rm/client.py;
Jiang Bian committedNov 6, 2013
-
change python2.7 to python in boostrap.sh and client.sh
bianjiang committedNov 5, 2013
-
update readme, and sample config file
Jiang Bian committedNov 4, 2013 -
Jiang Bian committed
Nov 4, 2013
-
use app_key as id, and add-in support for restart the entire crawler
Jiang Bian committedNov 2, 2013