Output from expand_urls causes invalid Atom feeds #198

mikl · 2011-10-03T23:17:26Z

In atom.xml, the following line is used to escape and output the content:

    <content type="html">{{ post.content | expand_urls: site.url | xml_escape }}</content>

However, the output from expand_urls is not escaped, so if you have a code block in your code, you will get something like this:

&lt;p&gt;Mine looks like this:&lt;/p&gt;

&lt;p&gt;<div class='bogus-wrapper'><notextile><figure class='code'><figcaption><span> (solr.xml)</span> <a href='/downloads/code/solr.xml'>download</a></figcaption>
 <div class="highlight"><table><tr><td class="gutter"><pre class="line-numbers"><span class='line-number'>1</span>
<span class='line-number'>2</span>
<span class='line-number'>3</span>
<span class='line-number'>4</span>
<span class='line-number'>5</span>

ie. with escaped and unescaped HTML mixed together.

As the spec for text type decrees, an element with type="html"must only contain escaped HTML. It is also allowed to set `type="xhtml" and then have unescaped markup instead, but then the whole thing must be unescaped and valid XHTML.

So, long story short, using code blocks or similar tags yields invalid Atom feeds.

The text was updated successfully, but these errors were encountered:

imathis · 2011-10-03T23:21:14Z

Interesting. What do you think is the best fix here?

fhemberger · 2011-10-04T06:27:33Z

How about wrapping all content in CDATA tags instead of escaping them by hand?
If you use type="xhtml" you obviously have to wrap the content in a div giving the correct XML namespace: http://www.xml.com/pub/a/2005/12/07/handling-atom-text-and-content-constructs.html

Also, the  tag is converted to <!\u2013 more \u2013>, which throws warnings for the feed, see http://validator.w3.org/feed/

I'm using code blocks in my feed and it's validated correctly.

mikl · 2011-10-04T09:06:28Z

The CDATA solution could be better, but we need to be aware that CDATA doesn't nest, so we need to escape CDATA-end-tags (]]>) in the rendered content, so a blog post involving those doesn't end the CDATA envelope prematurely :)

fhemberger · 2011-10-04T09:38:04Z

We should add this CDATA end tag to the xml_escape perhaps.

imathis · 2011-10-04T14:05:17Z

@fhemberger or @mikl think you can manage a pull request for this? It sounds like you guys have a better idea on this than I do.

fhemberger · 2011-10-04T14:20:19Z

@imathis Aye, I'll have a look later on ...

fhemberger closed this as completed in 6315527 Oct 4, 2011

mikl mentioned this issue Mar 30, 2012

Invalid Atom feeds for code includes… #510

Closed

briansimmons pushed a commit to briansimmons/octopress that referenced this issue Aug 20, 2013

Adds CDATA sections to atom.xml, fixes imathis#198

6241d28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output from expand_urls causes invalid Atom feeds #198

Output from expand_urls causes invalid Atom feeds #198

mikl commented Oct 3, 2011

imathis commented Oct 3, 2011

fhemberger commented Oct 4, 2011

mikl commented Oct 4, 2011

fhemberger commented Oct 4, 2011

imathis commented Oct 4, 2011

fhemberger commented Oct 4, 2011

Output from expand_urls causes invalid Atom feeds #198

Output from expand_urls causes invalid Atom feeds #198

Comments

mikl commented Oct 3, 2011

imathis commented Oct 3, 2011

fhemberger commented Oct 4, 2011

mikl commented Oct 4, 2011

fhemberger commented Oct 4, 2011

imathis commented Oct 4, 2011

fhemberger commented Oct 4, 2011