Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Commandline Options #1

Closed
domoench opened this issue Sep 4, 2014 · 3 comments
Closed

Add Commandline Options #1

domoench opened this issue Sep 4, 2014 · 3 comments

Comments

@domoench
Copy link
Owner

domoench commented Sep 4, 2014

  • Ability to specify start point of crawling
  • Ability to specify a config file denoting subpaths to ignore. For example, maybe the path www.site.com/users/ if you don't care to crawl every user page in the domain.
@domoench
Copy link
Owner Author

domoench commented Sep 5, 2014

First point done.

@domoench
Copy link
Owner Author

domoench commented Sep 5, 2014

Added ability to specify number of crawler threads in Commit 2b6c072

@domoench
Copy link
Owner Author

domoench commented Sep 6, 2014

Now that things are more efficient, the ability to specify domain paths to exclude from crawling seems unnecessary.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant