Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upper and Lower bound of reading from Archives #1

Closed
LuchenTan opened this issue Jan 16, 2018 · 0 comments
Closed

Upper and Lower bound of reading from Archives #1

LuchenTan opened this issue Jan 16, 2018 · 0 comments

Comments

@LuchenTan
Copy link
Owner

Compute the max and min time of reading tweets from archives.
Task 1: Read each individual tweet JSON from TREC2016 data collection. Extract "text" field. Split text by whitespaces. Compute number of words in the text. Count time spent.
Task 2: Do Task 1 and insert each individual tweet to MySQL. Count time spent.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant