Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

allow to download all pages from a url that matches some rules. #166

Closed
videni opened this issue Oct 12, 2019 · 3 comments
Closed

allow to download all pages from a url that matches some rules. #166

videni opened this issue Oct 12, 2019 · 3 comments
Labels
wontfix This will not be worked on

Comments

@videni
Copy link

videni commented Oct 12, 2019

for example, I want to download all pages that start with https://graphql.org/learn/

https://graphql.org/learn/queries/
https://graphql.org/learn/schema/

I've been searching this tools for years, but haven't found one. the reason why I need this is that I can make ebooks with Calibre easily. I tested plenty of tools, what suprises me is that they all let us to add pages one by one manully instead of batching pages by a simple regular expression.

@nmaier
Copy link
Member

nmaier commented Oct 12, 2019

Do you want to download the pages itself, or things from those pages?

@videni
Copy link
Author

videni commented Oct 13, 2019

the pages itself, seems it is out of scope of this project.

@nmaier nmaier added the wontfix This will not be worked on label Oct 13, 2019
@nmaier
Copy link
Member

nmaier commented Oct 13, 2019

Well, the pages itself we could download, but it is indeed out of scope to download the pages AND any other resources that make the page display correctly (the CSS, javascript, etc).

@nmaier nmaier closed this as completed Oct 13, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants