Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Timestamping the indexing #15

Open
TusharAgey opened this issue Oct 24, 2017 · 1 comment
Open

Timestamping the indexing #15

TusharAgey opened this issue Oct 24, 2017 · 1 comment

Comments

@TusharAgey
Copy link
Owner

Crawler will run after certain period. When it crawls for other than the first time, Crawler should skip the unmodified(already indexed) pages. it should crawl and index modified(already indexed but modified). it should crawl and index new(not yet indexed) pages.

@ypk4
Copy link
Contributor

ypk4 commented Nov 2, 2017

This issue is similar to "Selective indexing" issue (Issue #10)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants