Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Insert metadata in (dataset) pages, for (google)datasetsearch #335

Open
MBcode opened this issue Mar 4, 2022 · 4 comments
Open

Insert metadata in (dataset) pages, for (google)datasetsearch #335

MBcode opened this issue Mar 4, 2022 · 4 comments
Labels
enhancement New feature or request

Comments

@MBcode
Copy link
Contributor

MBcode commented Mar 4, 2022

Is your feature request related to a problem? Please describe.
I would like to find clowder datasets in https://datasetsearch.research.google.com/ and via other aggregators

Describe the solution you'd like
Insert json-ld into at least the dataset pages, and then an associated entry in the sitemap, so it can be crawled

Describe alternatives you've considered
Given the possibly very large numbers, we need to give just enough metadata to get in the right area, and then be able to follow the Linked-Data follow your nose pattern. This might mean allowing for (api) calls to get the file/etc metadata as needed.

Additional context
This will be done in stages, starting with the mapping of the dataset and file classes, which is in a draft-PR

@MBcode MBcode added the enhancement New feature or request label Mar 4, 2022
@MBcode
Copy link
Contributor Author

MBcode commented Mar 28, 2022

sitemap will come later, as well as some of the other possible metadata
Right now just starting w/two classes: File and Dataset, to product jsonld describing their attributes mapped to schema.org vocabulary terms In the end the two *.scala.html outputs can be taken from the output html and pasted into: https://validator.schema.org to that once the sitemap is there, all of those elements from the linked Datasets could end up in https://datasetsearch.research.google.com/

@MBcode MBcode changed the title Insert metadata in (dataset) pages, and add sitemap, for (google)datasetsearch Insert metadata in (dataset) pages, for (google)datasetsearch Mar 28, 2022
@MBcode MBcode mentioned this issue Mar 29, 2022
13 tasks
@MBcode
Copy link
Contributor Author

MBcode commented Apr 11, 2022

Dataset Files and the classes they hold instances of, all have to_jsonld methods, that get kicked off in the view; only other change was to signature of Utils.baseURL

@MBcode
Copy link
Contributor Author

MBcode commented Apr 15, 2022

After this is accepted, the next step is issue #351 to get the sitemap so the datasets can get crawled

@MBcode
Copy link
Contributor Author

MBcode commented Jul 14, 2022

pr comments have been closed for awhile, also not much sitemap feedback, so will look at other issues too

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant