You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<GENERAL_INFO><TITLE><![CDATA[Mobile Apple Devices (iPhones, iPads, and Smartwatches)]]></TITLE><SUMMARY><![CDATA[<p>This article highlights the key benefits and specifications of Apple iPhones, iPads, and Smartwatches.</p></SUMMARY></GENERAL_INFO>
Code to fetch data from the XML
from unstructured.partition.html import partition_html
_text = ' '.join([element.text for element in partition_html(text=_html_text)])
Is there any flag or function to enable extracting content from the CDATA ?
The text was updated successfully, but these errors were encountered:
Thanks for the issue @PhaneendraGunda ! We'll discuss and follow up
PhaneendraGunda
changed the title
Unstrutured library is unable extract CDATA from the xml data
Unstrutured library is unable to extract CDATA from the xml data
May 23, 2024
Sample XML:
<GENERAL_INFO><TITLE><![CDATA[Mobile Apple Devices (iPhones, iPads, and Smartwatches)]]></TITLE><SUMMARY><![CDATA[<p>This article highlights the key benefits and specifications of Apple iPhones, iPads, and Smartwatches.</p></SUMMARY></GENERAL_INFO>
Code to fetch data from the XML
Is there any flag or function to enable extracting content from the CDATA ?
The text was updated successfully, but these errors were encountered: