-
replace import futures with import concurrent.futures
bianjiang committedJun 30, 2016
-
bianjiang committed
Dec 17, 2014 -
refactor, add search_for function, need more work to create crawler d…
…ynamically per request
bi0nji0ng committedDec 17, 2014 -
bianjiang committed
Dec 17, 2014
-
Added behavior for crawling specific tweet IDs both manually and in
batch mode.
cbuntain committedApr 3, 2014 -
Fixed two bugs. In one, the node queue size used an incorrect variable
name. In the other, the command to redistribute to other nodes wasn't being populated.
Cody Buntain committedApr 3, 2014
-
add support to clear a node's queues to get you a fresh start
bianjiang committedNov 13, 2013
-
bianjiang committed
Nov 11, 2013
-
shouldn't clear out the crawler queue after restart...
bianjiang committedNov 10, 2013 -
bug in recreating failed crawler
bianjiang committedNov 10, 2013 -
continue migrating local multiprocessing.Queue to redis queue
bianjiang committedNov 10, 2013
-
rewrite local crawler queue using redis; using lifo queue so that con…
…trol commands can be grab asap;
bianjiang committedNov 9, 2013
-
handle dead crawler gracefully: cleanup residue resources; and schedu…
…le it to restart after 30 mins
bianjiang committedNov 8, 2013 -
clean up leftover, if a crawler gets restarted
bianjiang committedNov 8, 2013
-
close redis connection when its done; otherwise, it causes error
Jiang Bian committedNov 7, 2013
-
add in function to get the full user object from a list of user ids; …
…and add in GET_USERS_FROM_IDS command; see tweetf0rm/client.py;
Jiang Bian committedNov 6, 2013
-
Jiang Bian committed
Nov 4, 2013
-
use app_key as id, and add-in support for restart the entire crawler
Jiang Bian committedNov 2, 2013
-
save empty file as well so we can track progress
Jiang Bian committedOct 31, 2013 -
adding auto-gzip to archived folder support
Jiang Bian committedOct 31, 2013
-
bianjiang committed
Oct 23, 2013 -
bianjiang committed
Oct 23, 2013 -
show full stack when error occurs, so we can catch what's wrong..
bianjiang committedOct 23, 2013 -
debug message to find out bad status error
bianjiang committedOct 23, 2013 -
bianjiang committed
Oct 23, 2013
-
bi0nji0ng committed
Oct 22, 2013 -
cmd_handlers specific for control adding new cmds based on the depth …
…of the network you want to crawl
bi0nji0ng committedOct 22, 2013 -
get rid of duplicated handlers
bi0nji0ng committedOct 22, 2013 -
bi0nji0ng committed
Oct 22, 2013 -
user_id is not necessary always be numbers in the code
bi0nji0ng committedOct 22, 2013 -
bi0nji0ng committed
Oct 22, 2013 -
bi0nji0ng committed
Oct 22, 2013 -
bi0nji0ng committed
Oct 22, 2013 -
bi0nji0ng committed
Oct 22, 2013