feat (data): script to request tweets from twitter API #75

guillaume-salle · 2022-03-22T13:13:20Z

Objective: Build the database and having data on as much days as possible.

📖 Describe what you want

Update the script about dataset to request specific tweets from the API of twitter based on its date or ID.

The script MUST save ALL the tweets received into csv files in the data/raw/twitter directory, with the date and ID of first and last tweet specified (in the name ?). Possible format:

data/raw/twitter/[candidat_name]_[startdate]_[enddate].csv
data/raw/twitter/[candidat_name]_[first_id_tweet]_[last_id_tweet].csv
data/raw/twitter/candidat_name/[startdate]_[enddate]_[first_id_tweet]_[last_id_tweet].csv
data/raw/twitter/week_#x/[candidat_name]_[startdate]_[enddate]_[first_id_tweet]_[last_id_tweet].csv

A particular point must be considered: the script should collect small chunks of results in order to save all the results little by little to avoid issue related to cache memory, disk memory or whatever: Create a tmp directory where the little portions are stored and after that the script

This script should be designed to be launched periodically, every week (or every day?) and collect specified amount of tweets about each candidate. These amounts of tweets per day and per candidate are yet to be determined.

✔️ Definition of done

a functioning script is written.
a format for the filename is chosen,
the script create a tmp directory where it saves small chunks of the total results.
the script concatenate all the chunks into a final csv file.

This script should be FIRST and ONLY tested with small amounts of tweet requested to the API in order to save the amount of tweet we can request: for instance 1k tweets for 2 or 3 candidates. The person testing the script should be careful to check the above points.

This script will be used for larger amounts after the pull request is validated.

The text was updated successfully, but these errors were encountered:

madvid · 2022-03-22T15:04:54Z

Acutally, you do not need to write a need script but only add a feature to the existing one.

madvid · 2022-03-24T11:46:03Z

la requete:

poetry run python -m src data --download twitter --mention Melenchon --start_time '2022-03-18 8:00' --end_time '2022-03-18 22:00'

madvid · 2022-03-25T11:25:16Z

At this time, the part concerning:

A particular point must be considered: the script should collect small chunks of results in order to save all the results little by little to avoid issue related to cache memory, disk memory or whatever: Create a tmp directory where the little portions are stored and after that the script.

is not implmentend yet

guillaume-salle added the fixme This issue will be soon fixed label Mar 22, 2022

guillaume-salle added this to the 3 Create inference dataset milestone Mar 22, 2022

guillaume-salle added the feature New feature label Mar 22, 2022

guillaume-salle assigned madvid and guillaume-salle Mar 22, 2022

guillaume-salle mentioned this issue Mar 22, 2022

dvc (data): save tweets to build database & push to DVC #76

Closed

guillaume-salle mentioned this issue Mar 26, 2022

79 feature data add a feature to get the output of a model on given raw data #81

Merged

guillaume-salle removed their assignment Mar 27, 2022

ezalos modified the milestones: 3 Create inference dataset, Download Datas Mar 28, 2022

madvid linked a pull request Mar 28, 2022 that will close this issue

75 feat data script to request tweets from twitter api #83

Merged

12 tasks

madvid mentioned this issue Mar 28, 2022

75 feat data script to request tweets from twitter api #83

Merged

12 tasks

madvid closed this as completed in #83 Mar 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat (data): script to request tweets from twitter API #75

feat (data): script to request tweets from twitter API #75

guillaume-salle commented Mar 22, 2022 •

edited by madvid

Loading

madvid commented Mar 22, 2022

madvid commented Mar 24, 2022

madvid commented Mar 25, 2022 •

edited

Loading

feat (data): script to request tweets from twitter API #75

feat (data): script to request tweets from twitter API #75

Comments

guillaume-salle commented Mar 22, 2022 • edited by madvid Loading

📖 Describe what you want

✔️ Definition of done

madvid commented Mar 22, 2022

madvid commented Mar 24, 2022

madvid commented Mar 25, 2022 • edited Loading

guillaume-salle commented Mar 22, 2022 •

edited by madvid

Loading

madvid commented Mar 25, 2022 •

edited

Loading