Step 1: Collect tweets from Twitter based on certain keywords that are relevant to ENDS. [Output file: tweetOutput.csv] (Completed)
Step 2: Filter out retweeted tags [Output file: allTweetsNoRtTag.txt (original) Step2Op4Staging.txt(working copy)] (Completed)
Step 3: Filter out spam tweets (obvious bots and advertisers). [Output files: unwantedTweets.txt, unwantedTweetsWithFrequency.txt](Completed)
Step 4: Create clean working data set no spam tweets. [Output File: spamFreeData.txt] (Completed)
Step 5: Calculate user tweet frequency. [Output file: userFrequency.txt] (Completed)
Step 6: Review and shortlist users relevant to this research. (Pending)
Note: All Input and Output files are located in the Data folder.
Main code files are located at "ENDS/ENDSTweetAnalysis/ENDS/" in the repository.