Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integration with Mercury Parser? #4191

Open
jesse-troy opened this issue Nov 14, 2019 · 2 comments
Open

Integration with Mercury Parser? #4191

jesse-troy opened this issue Nov 14, 2019 · 2 comments
Labels

Comments

@jesse-troy
Copy link

I have noticed that the Mercury parser is pretty good at getting some content that Wallabag fails to get. An example of this would be newyorker.com.

Would it be possible to utilize the Mercury parser either within Wallabag or as an external service?

I'm not really a coder, so I don't know where to start work on this but I have an idea for how it could work. Could I have a container with Mercury running and Wallabag would send the URL to the Mercury container, which would return the full contents to be saved in Wallabag? This would just be an alternative to the built in fetcher.

@j0k3r
Copy link
Member

j0k3r commented Nov 14, 2019

This might be an idea.
Maybe we can define a setting to enable it as a fallback when the default parser can't retrieve the data.

But for example, Mercury parser can't handle paywalled content when you define your credentials in wallabag.
As Mercury parser can be either in a docker or in an AWS Lambda, we only need to define the url of the parser.

@j0k3r j0k3r added the Feature label Nov 14, 2019
@tcitworld
Copy link
Member

As Mercury parser can be either in a docker or in an AWS Lambda, we only need to define the url of the parser.

Also through CLI https://github.com/postlight/mercury-parser#the-command-line-parser

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants