You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With the current UI, I haven't found a good way to extract the multiple pieces of data in front of the elements.
The best I have come up with is to select the "p" element and apply a regex on the annotation, but that will only allow you to retrieve one value (such as the street or the telephone number)
This pattern of putting "floating" text outside of an html element seems pretty common, is there a good way of extracting them?
<p>
<span>Nombre de empresa:</span> Grupo Escape
<br/><br/>
<span>Tel:</span> 5860 1232 1233, 5845 6457 6457
<br/><br/>
<span><input class="DefBtn" type="submit" value="Contáctenos" onclick="location.href='/contact.php?cid=687770';"/></span>
<br/><br/>
<span>Street:</span> Eje 10 mz-32 Lote 3
<br/><br/>
<span>Colonia:</span> colonia Santa Catarina
<br/><br/>
<span>Código postal:</span> 13100
<br/><br/>
<span>Cuidad:</span> Tlahuac, Distrito Federal
<br/><br/> <span>Web:</span> <a href="http://www.grupoescape.com.mx">www.grupoescape.com.mx</a>
<br/><br/> </p> <h2>Mapa</h2>
<p>
The text was updated successfully, but these errors were encountered:
ainsleyc
changed the title
Getting
Getting "floating" html outside of html elements
Apr 25, 2014
ainsleyc
changed the title
Getting "floating" html outside of html elements
Getting "floating" text outside of html elements
Apr 25, 2014
First off, awesome project!
Looking at this sample page, there is a block of data as shown below:
http://tlahuac.wired.com.mx/687770/grupo-escape.html
With the current UI, I haven't found a good way to extract the multiple pieces of data in front of the elements.
The best I have come up with is to select the "p" element and apply a regex on the annotation, but that will only allow you to retrieve one value (such as the street or the telephone number)
This pattern of putting "floating" text outside of an html element seems pretty common, is there a good way of extracting them?
The text was updated successfully, but these errors were encountered: