<script> and <iframe> tags should be returned as-is #45

Quantisan · 2012-07-29T09:24:17Z

No description provided.

bitboxer · 2013-03-31T07:51:07Z

👍 for this one.

mcepl · 2014-04-09T08:38:35Z

-1 from me ... html2text IMHO should be kept to the minimum. If you need anything more complicated, go and pre-/post-process its input/output.

bitboxer · 2014-04-09T15:36:42Z

The problem is that if you want to convert HTML from Wordpress to a Jekyll Markdown, you want to preserve script and iframe tags. They will be lost afterwards. You could create a parser that replaces them by a marker string and replace that marker string after the conversion, but it would be way nicer if this lib has an option for this. And less error prone.

mcepl · 2014-04-09T20:44:40Z

What in the world is the point of storing iframes in Jekyll? Anyway, some escaping of HTML elements ('<' => <) should be sufficient shouldn't it? That's what I meant as pre-/post-processing.

bitboxer · 2014-04-09T20:46:29Z

What is the point? Maybe I just want to preserve youtube iframes when converting my blog 😉 . Escape the HTML elements is really bad and is very error prone. Why do all this ugyl workarounds when html2text can do this easily.

Alir3z4 · 2014-04-09T22:58:09Z

Currently html2text does everything in one place, I guess @mcepl is right about pre-/post-processing. We need to implement such a functionality to enable other control that behavior and do what ever they want to without touching html2text directly and make the stuff dirty.

Of course we can pass any tag to prevent removing them and have an option on html2text but all these stuff would make it ugly as possible.

After all my -1 vote for this issue.

@bulletmark

…andard-input-when-running-under-python-3 Fix aaronsw#45 does not accept standard input when running under python 3 Fixes aaronsw#45 Thanks to: * Mark Blakeney @bulletmark * @djr7C4 * @willemw12

pombredanne pushed a commit to pombredanne/html2text that referenced this issue Oct 10, 2015

Update ChangeLog for issue aaronsw#45

50d863f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

<script> and <iframe> tags should be returned as-is #45

<script> and <iframe> tags should be returned as-is #45

Quantisan commented Jul 29, 2012

bitboxer commented Mar 31, 2013

mcepl commented Apr 9, 2014

bitboxer commented Apr 9, 2014

mcepl commented Apr 9, 2014

bitboxer commented Apr 9, 2014

Alir3z4 commented Apr 9, 2014

<script> and <iframe> tags should be returned as-is #45

<script> and <iframe> tags should be returned as-is #45

Comments

Quantisan commented Jul 29, 2012

bitboxer commented Mar 31, 2013

mcepl commented Apr 9, 2014

bitboxer commented Apr 9, 2014

mcepl commented Apr 9, 2014

bitboxer commented Apr 9, 2014

Alir3z4 commented Apr 9, 2014