You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Raw source data can have character data, especially unicode, escaped in various ways.
During import this needs to be resolved and flagged. The verbatim data should be preserved, but the interpreted data stored in the db anywhere but the verbatim table should have properly unescaped characters:
Using a UnescapedVerbatimRedord wrapper class that does most of the job and remembers if any values have been modified so we can flag an issue: eee5d67
Raw source data can have character data, especially unicode, escaped in various ways.
During import this needs to be resolved and flagged. The verbatim data should be preserved, but the interpreted data stored in the db anywhere but the verbatim table should have properly unescaped characters:
&
&
&
U+0026
\x26
\046
\u00A9
\u{2F804}
\xA9
Apache commons has libraries for this: https://commons.apache.org/proper/commons-text/javadocs/api-release/org/apache/commons/text/StringEscapeUtils.html
The text was updated successfully, but these errors were encountered: