Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Strange Rebase Test #321

Closed
wants to merge 6 commits into from
Closed

Conversation

joka921
Copy link
Member

@joka921 joka921 commented Apr 13, 2020

  • SPARQL also allows escape sequences in PrefixedNames like rdfs:l,abel
    These were previously unsupported, which is now fixed.

  • We now also transform escape sequences in sparql literals to their
    correct form during index build.

  • Several changes to the Index class unit tests had to be made, because
    they used knowledge base elements like a instead of which is no
    longer supported by any of the parsers.

  • Disable the CTRE parser for now, since it becomes awefully slow with the
    the fixes for the prefixed names. TODO: Maybe we want to reimplement the
    old and wrong behavior and make CTRE a general "WikidataUnsafe" parser.

  • Get rid of misleading warning in case of whitespace at the end of a TTL
    file. Previously there was a "parsing of ttl has failed, but there is still content left" warning, although
    the remainder of the ttl input was only whitespace.

  • Made the ad_utility::hash_set also use absl.
    now we have completely removed the dependency from google::sparsehash and
    migrated to absl.

-- now we have completely removed the dependency from google::sparsehash and migrated to absl.
-- Note that so far, no unescaping is performed.
- Transitive Paths use HashSets which now also have undefined orderings with absl. Tests don't rely on this anymore

- The case, where a prefix was used with an empty "content" (e.g. <a> wd: <b>) was broken before,
  luckily there was a unit test and this is now fixed.
…TL file.

- Previously there was a "parsing of ttl has failed, but there is still content left" warning, although
  the remainder of the ttl input was only whitespace.

- This was due to a bug in the Parser's skipWhitespace() function which failed if the input consisted of ONLY whitespace. This is now fixed.
…ashSet

# Conflicts:
#	src/parser/TurtleParser.cpp
#	src/parser/TurtleParser.h
- also apply the normalization of literals correctly during index build time
- Adapt the Index unit tests to "legal" knowledge bases
- Disable the CTRE parser for now, since it becomes awefully slow with the PnameNS and PnLocal changes for some reasons.
- TODO: Maybe we want to renable the CTRE Parser with the old "wrong"
  behavior as a very fast way to parse Wikidata
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant