Module to convert raw scraped data into a standardised format #5

adiah80 · 2020-05-12T22:17:04Z

Raw scraped data from Issue #4 would need to be processed before it can be used for training the models. We need a module that aggregates the raw data into a single dataset (.csv file) containing the training features and labels.

Each tweet tweeted by someone the user follows should be considered as a data point. All the tweets that were interacted with (liked, retweeted, or commented on) should be classified as a positive instance.

Features should include the tweet text, the user who tweeted the tweet, the global tweet interaction metrics (count of likes, retweets, comments), and the tweet time.

More complex features can also be thought of and included.

Akshat2430 · 2020-05-17T14:28:17Z

Dibs!

ajaysub110 · 2020-05-17T14:44:15Z

Can you please take up #9 first so that we have the scraping module ready?

adiah80 mentioned this issue May 12, 2020

Module for classifiers and interaction analysis #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Module to convert raw scraped data into a standardised format #5

Module to convert raw scraped data into a standardised format #5

adiah80 commented May 12, 2020

Akshat2430 commented May 17, 2020

ajaysub110 commented May 17, 2020

Module to convert raw scraped data into a standardised format #5

Module to convert raw scraped data into a standardised format #5

Comments

adiah80 commented May 12, 2020

Akshat2430 commented May 17, 2020

ajaysub110 commented May 17, 2020