Skip to content
This repository has been archived by the owner on Feb 28, 2023. It is now read-only.

Scalability (# of messages)? #5

Closed
muesliq opened this issue Nov 7, 2016 · 2 comments
Closed

Scalability (# of messages)? #5

muesliq opened this issue Nov 7, 2016 · 2 comments
Labels

Comments

@muesliq
Copy link

muesliq commented Nov 7, 2016

Are there limits of the number of messages? I successfully tested the script with roughly 13k messages / 1.3mb in one conversation.

The script seems to cache the messages. Would it maybe more scalable if it stored the messages into a file in an incremental fashion instead of caching them?

@Mincka
Copy link
Owner

Mincka commented Nov 7, 2016

There is no limit in the number of messages that can be retrieved.

I know someone who has been able to retrieve a conversation with more than 80 000 tweets and 900 images. 😆

The script is building the entire conversation in memory but only of the parsed content, so I think the size is not so big for the majority of conversations. The goal was to reverse the order of the conversation at the end, to output them in a file in a chronological order.

However, I think it would be also possible to prepend the reversed content of each batch of 20 tweets in a file directly. I've already added a -raw-output switch to write the HTML content to a file after each request.

I am not sure yet of the best design to allow me to add additional output types. I was thinking about splitting the HTTP receiver and the parser completely so it would be possible to work on the parsing "offline".

@muhittin
Copy link

I know someone who has been able to retrieve a conversation with more than 80 000 tweets and 900 images. 😆

165k tweets, 2262 images, 60 videos from a conversation 😎

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

3 participants