Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JSON: comments for types, and property URLs #48

Merged
merged 2 commits into from Dec 11, 2013

Conversation

dbs
Copy link
Contributor

@dbs dbs commented Aug 2, 2013

Having noticed that all types in all.json had empty comments, and that properties were missing their URL property,
I pulled these two commits together. In all likelihood, this is a matter of the schema.org site drifting away from the
old version of the scraper.

Diffing the old all.json and the new all.json shows only the expected changes.

dbs added 2 commits August 1, 2013 21:17
It seems likely that, at some point over the past year, the HTML
structure of the type pages changed to use a <div
property="rdfs:comment"> element to embed the comment, which broke the
scraper.

This enables us to greatly simplify the scraping process, and to make it
more consistent with properties via the use of get_inner_html().

Signed-off-by: Dan Scott <dan@coffeecode.net>
schema.org recently added URLs that resolve for properties, so let's
track those as well.

Signed-off-by: Dan Scott <dan@coffeecode.net>
@Iheartweb
Copy link

+1

mhausenblas added a commit that referenced this pull request Dec 11, 2013
JSON: comments for types, and property URLs
@mhausenblas mhausenblas merged commit f67e711 into mhausenblas:master Dec 11, 2013
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants