Skip to content

Different output when parsing HTML #790

@longcdf

Description

@longcdf

Hi,
I'm facing some strange behavior when trying to parse a HTML file using LibTidy.
I have a html file contains something likes this:

<li class="ikonpunkt span8"><a href="http://www.vegvesen.no/jobb/Ledige+stillinger" class="menuitem"><img alt="Ledige stillinger hvit" src="http://www.vegvesen.no/jobb/For+studenter+og+nyutdannede/nyheter/_attachment/872915/binary/1031072?_ts=14d1da193f0"/><h3>Ledige stillinger</h3></a></li>

I'm using LibTidy to parse this file.
And it randomly generate 2 outputs:
1 -

<li class="ikonpunkt span8"><a href="http://www.vegvesen.no/om+statens+vegvesen/kontakt+oss" class="menuitem"><img alt="Kontakt oss hvit" src="http://www.vegvesen.no/jobb/For+studenter+og+nyutdannede/nyheter/_attachment/873039/binary/1031133?_ts=14d1dd66378">
<h3>Kontakt oss</h3>
</a></li>

2-

<li class="ikonpunkt span8"><a href="http://www.vegvesen.no/om+statens+vegvesen/kontakt+oss" class="menuitem"><img alt="Kontakt oss hvit" src="http://www.vegvesen.no/jobb/For+studenter+og+nyutdannede/nyheter/_attachment/873039/binary/1031133?_ts=14d1dd66378"></a>
<h3><a href="http://www.vegvesen.no/om+statens+vegvesen/kontakt+oss" class="menuitem">Kontakt oss</a></h3>
</li>

It happens very randomly.
Could you please help me explain it and suggest a way to overcome this issue?
Thanks

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions