New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Import arbitrary web pages. #1964

Closed
ralsina opened this Issue Aug 22, 2015 · 3 comments

Comments

Projects
None yet
2 participants
@ralsina
Member

ralsina commented Aug 22, 2015

There are conventions about where the "main content"of a page is.

A command could be created to use those conventions and get the "important" HTML out of a page and into a HTML input file for Nikola.

Yes, it will never be perfect, but it could work.

(Based on question by @jjconti in twitter)

@ralsina ralsina added the enhancement label Aug 22, 2015

@ralsina ralsina added this to the v7.6.5 milestone Aug 22, 2015

@jjconti

This comment has been minimized.

Show comment
Hide comment
@jjconti

jjconti Aug 22, 2015

Contributor

👍

Contributor

jjconti commented Aug 22, 2015

👍

@ralsina

This comment has been minimized.

Show comment
Hide comment
@ralsina

ralsina Aug 23, 2015

Member

Using this it should be rather simple: https://github.com/datalib/libextract

Member

ralsina commented Aug 23, 2015

Using this it should be rather simple: https://github.com/datalib/libextract

@ralsina ralsina self-assigned this Aug 23, 2015

@ralsina ralsina added the in progress label Aug 23, 2015

@ralsina

This comment has been minimized.

Show comment
Hide comment
@ralsina

ralsina Sep 2, 2015

Member

Implemented as a plugin.

Member

ralsina commented Sep 2, 2015

Implemented as a plugin.

@ralsina ralsina closed this Sep 2, 2015

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment