Skip to content

Latest commit

 

History

History
24 lines (16 loc) · 1.2 KB

README.md

File metadata and controls

24 lines (16 loc) · 1.2 KB

Gather and analyze data from social media

Check out the ipython notebooks for examples.

Main functionality:

  • network_search.py
  • message_classification.py
  • twitter_stream_classification.py

network_search.py (Check out example ipython notebook)

Explores the messages, locations and connection of a user account of interest. Information is stored in an sqlite database created for the source_node. One potential risk is the exponential growth for each network depth. The user can counteract this in two ways: i) Retrieve information from only the top x% connections and ii) limit search to a specified network depth.

message_classification.py (Check out example ipython notebook)

Trains a text classifier on labeled examples. Then classifies tweets stored in an sqlite database to identify intersting candidates for further inspection.

twitter_stream_classification.py (Example ipython notebook comning soon.)

Identify interesting candidate tweets from the (keyword filtered) twitter stream. Use a trained text classifier to keep only tweets which fall into classes of interest with a minimal prediction probability.