Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Populate graph on chunks/streams as bot crawls #15

Open
AlexParra03 opened this issue Mar 18, 2019 · 0 comments
Open

Populate graph on chunks/streams as bot crawls #15

AlexParra03 opened this issue Mar 18, 2019 · 0 comments
Labels
feature Enhancement to help the application help wanted Extra attention is needed

Comments

@AlexParra03
Copy link
Owner

When the spider finished crawling, all the data is sent all at once, making the client-side waiting for a long time. A JSON with the vertices (domain names, and node ids) and edges (destination id and source id) is sent. Spider.WWWtoJSON() contains this logic

It will be better if instead the client loads the network/graph as the bot is crawling the web.

To accomplish this, we need to send the new things crawled (not the whole state over and over, just the differences).

I was thinking in creating a GET route such as /getNeighbors?node=domain.com , then getting all the neighbors of the nodes received in client, and build the graph on the client, or something similar. (It won't matter much if a few neighbors are sent on every request?).
Or making a request every X seconds for the diff., but keeping routes organized.

@AlexParra03 AlexParra03 added help wanted Extra attention is needed feature Enhancement to help the application labels Mar 18, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature Enhancement to help the application help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant