-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge feature_update_graph_pattern #43
Commits on Apr 13, 2021
-
Configuration menu - View commit details
-
Copy full SHA for e018a08 - Browse repository at this point
Copy the full SHA e018a08View commit details -
Configuration menu - View commit details
-
Copy full SHA for d9e67c1 - Browse repository at this point
Copy the full SHA d9e67c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 05741d9 - Browse repository at this point
Copy the full SHA 05741d9View commit details
Commits on Apr 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 4cfebac - Browse repository at this point
Copy the full SHA 4cfebacView commit details
Commits on Apr 24, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 109bc54 - Browse repository at this point
Copy the full SHA 109bc54View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8378a34 - Browse repository at this point
Copy the full SHA 8378a34View commit details
Commits on Apr 30, 2021
-
Completely refactor d1lod package
The old Interface and Graph classes had a lot of cruft and were really confusing to keep straight. I've done a refactor that uses a different class structure that's a lot easier for me to understand. Hopefully it's easier for you too. See the README for more info, including a fancy picture, but the Interface class is now wrapped into a top-level SlinkyClient class and the old Graph class is now tied into a SparqlTripleStore class. With the old setup, you had to instantiate an Interface and a Graph. Now you just instantiate a SlinkyClient and you're good to go. Here's some more detail, copied from the README: - `SlinkyClient`: Entrypoint class that manages a connection to DataONE, a triple store, and Redis for short-term persistence and delayed jobs - `FilteredCoordinatingNodeClient`: A view into a Coordinating Node that can limit what content appears to be available based on a Solr query. e.g., a CN client that can only see datasets that are part of a specific EML project or in a particular region - `SparqlTripleStore`: Handles inserting into and querying a generic SPARQL-compliant RDF triplestore via SPARQL queries. Designed to be used with multiple triple stores. - `Processor`: Set of classes that convert documents of various formats (e.g., XML, JSON-LD) into a set of RDF statements The old package code is left in the legacy submodule (to be deleted in the future) and its tests are still alive and working via pytest.
Configuration menu - View commit details
-
Copy full SHA for 478b62f - Browse repository at this point
Copy the full SHA 478b62fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 46009a7 - Browse repository at this point
Copy the full SHA 46009a7View commit details -
Convert web front-end to use a SlinkyClient
This is pretty basic but you can run this container and hit /get?id=foo to get the Slinky RDF for given DataONE PID.
Configuration menu - View commit details
-
Copy full SHA for 062c562 - Browse repository at this point
Copy the full SHA 062c562View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2fca953 - Browse repository at this point
Copy the full SHA 2fca953View commit details -
Configuration menu - View commit details
-
Copy full SHA for a782ff9 - Browse repository at this point
Copy the full SHA a782ff9View commit details
Commits on May 1, 2021
-
Configuration menu - View commit details
-
Copy full SHA for e1dd51f - Browse repository at this point
Copy the full SHA e1dd51fView commit details
Commits on May 4, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 85163f6 - Browse repository at this point
Copy the full SHA 85163f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for fd6f7cb - Browse repository at this point
Copy the full SHA fd6f7cbView commit details
Commits on May 12, 2021
-
Configuration menu - View commit details
-
Copy full SHA for ac73868 - Browse repository at this point
Copy the full SHA ac73868View commit details -
Configuration menu - View commit details
-
Copy full SHA for 104e190 - Browse repository at this point
Copy the full SHA 104e190View commit details -
Configuration menu - View commit details
-
Copy full SHA for 777cf1f - Browse repository at this point
Copy the full SHA 777cf1fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 04f9407 - Browse repository at this point
Copy the full SHA 04f9407View commit details
Commits on May 15, 2021
-
Configuration menu - View commit details
-
Copy full SHA for be4d94a - Browse repository at this point
Copy the full SHA be4d94aView commit details
Commits on May 18, 2021
-
Create new Virtuoso-specific store model
Closes #30 I couldn't find a way to send very large SPARQL queries to Virtuoso but Virtuoso does have an HTTP API that takes Turtle/NTriples/etc. Since this is specific to Virtuoso, I've made a separate model from SparqlTripleStore.
Configuration menu - View commit details
-
Copy full SHA for 9783fb6 - Browse repository at this point
Copy the full SHA 9783fb6View commit details -
Change BlazegraphStore's default port
I don't know why I had tweaked this.
Configuration menu - View commit details
-
Copy full SHA for 08cbd7a - Browse repository at this point
Copy the full SHA 08cbd7aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 025933b - Browse repository at this point
Copy the full SHA 025933bView commit details -
Configuration menu - View commit details
-
Copy full SHA for cb2ff80 - Browse repository at this point
Copy the full SHA cb2ff80View commit details -
Adjust logic for when update_job runs or doesn't
We don't want to run multiple update jobs at once and we also don't want to run update_job when the dataset queue is saturated. This change controls both of those scenarios.
Configuration menu - View commit details
-
Copy full SHA for 414fce3 - Browse repository at this point
Copy the full SHA 414fce3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9daeef9 - Browse repository at this point
Copy the full SHA 9daeef9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 65879b9 - Browse repository at this point
Copy the full SHA 65879b9View commit details -
Make get_new_datasets_since query range-exclusive
This prevents repeated calls to get_new_datasets_since from inserting the most recent dataset over and over again
Configuration menu - View commit details
-
Copy full SHA for 729d615 - Browse repository at this point
Copy the full SHA 729d615View commit details
Commits on May 19, 2021
-
Use response.content instead of response.text
Turns out you _really_ need to pass binary data to ElementTree because it'll treat your XML content as ASCII if you have requests/httpx decode it first.
Configuration menu - View commit details
-
Copy full SHA for 776f842 - Browse repository at this point
Copy the full SHA 776f842View commit details -
Configuration menu - View commit details
-
Copy full SHA for dd233d9 - Browse repository at this point
Copy the full SHA dd233d9View commit details -
Begin work refactoring setups/environments
We need a way to override default behavior depending on context. Development, production, etc. I'm not sure if I want to do this via configuration or via environment variables just yet. This'll all probably change once I start building Docker images.
Configuration menu - View commit details
-
Copy full SHA for 21e4323 - Browse repository at this point
Copy the full SHA 21e4323View commit details
Commits on May 26, 2021
-
Configuration menu - View commit details
-
Copy full SHA for d1af30a - Browse repository at this point
Copy the full SHA d1af30aView commit details -
Configuration menu - View commit details
-
Copy full SHA for aab1c30 - Browse repository at this point
Copy the full SHA aab1c30View commit details -
Prevent EMLProcessor from re-inserting identifier blank nodes
We use blank nodes for schema:identifier statements. If we reinsert the same dataset, we would get what are effectively duplicate blank nodes for things like identifier. ie If we inserted _twice_ a dataset where PersonA is a creator with ORCID O, we'd have to blank nodes for for PersonA, both with the value O. This isn't wrong but it'll make queries harder. So now we query first and don't re-insert the schema:identifier blank node (and its triples) if an equivalent set of triples already exist.
Configuration menu - View commit details
-
Copy full SHA for a6ef121 - Browse repository at this point
Copy the full SHA a6ef121View commit details -
Configuration menu - View commit details
-
Copy full SHA for ae07392 - Browse repository at this point
Copy the full SHA ae07392View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1c315ed - Browse repository at this point
Copy the full SHA 1c315edView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9b35c2a - Browse repository at this point
Copy the full SHA 9b35c2aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9958cf3 - Browse repository at this point
Copy the full SHA 9958cf3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 49d78bd - Browse repository at this point
Copy the full SHA 49d78bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7eee7bd - Browse repository at this point
Copy the full SHA 7eee7bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2a8586b - Browse repository at this point
Copy the full SHA 2a8586bView commit details -
Configuration menu - View commit details
-
Copy full SHA for c3eadc4 - Browse repository at this point
Copy the full SHA c3eadc4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7f64dbe - Browse repository at this point
Copy the full SHA 7f64dbeView commit details -
Configuration menu - View commit details
-
Copy full SHA for c954903 - Browse repository at this point
Copy the full SHA c954903View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d43a69 - Browse repository at this point
Copy the full SHA 1d43a69View commit details
Commits on May 28, 2021
-
Configuration menu - View commit details
-
Copy full SHA for ee1907c - Browse repository at this point
Copy the full SHA ee1907cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9eaec1e - Browse repository at this point
Copy the full SHA 9eaec1eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6c60a8f - Browse repository at this point
Copy the full SHA 6c60a8fView commit details
Commits on Jun 3, 2021
-
Configuration menu - View commit details
-
Copy full SHA for da28ab4 - Browse repository at this point
Copy the full SHA da28ab4View commit details -
The old code was just placeholder and the new code should be logically correct
Configuration menu - View commit details
-
Copy full SHA for dffb273 - Browse repository at this point
Copy the full SHA dffb273View commit details -
Configuration menu - View commit details
-
Copy full SHA for ee4e1a6 - Browse repository at this point
Copy the full SHA ee4e1a6View commit details
Commits on Jun 4, 2021
-
Configuration menu - View commit details
-
Copy full SHA for fa74fa5 - Browse repository at this point
Copy the full SHA fa74fa5View commit details -
Configuration menu - View commit details
-
Copy full SHA for c154540 - Browse repository at this point
Copy the full SHA c154540View commit details -
Add in support for SOSO PropertyValue model for attributes
This commit also brings in a helper in processor_util, model_has_statement to fill a gap in Redlands
Configuration menu - View commit details
-
Copy full SHA for 7482b19 - Browse repository at this point
Copy the full SHA 7482b19View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d15143 - Browse repository at this point
Copy the full SHA 6d15143View commit details -
Add count and format options to cli's get method
Get can now serialize turtle, ntriples, rdfxml, and jsonld. It can also now just return the number of triples in the graph instead of a serialization of the graph This introduces rdflib and rdflib's jsonld plugin as dependencies because redland doesn't support jsonld
Configuration menu - View commit details
-
Copy full SHA for d4ac71c - Browse repository at this point
Copy the full SHA d4ac71cView commit details
Commits on Jun 5, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 30e06a6 - Browse repository at this point
Copy the full SHA 30e06a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for aeec699 - Browse repository at this point
Copy the full SHA aeec699View commit details -
Remove RQ Dashboard from compose file
This is easy enough to run standalone
Configuration menu - View commit details
-
Copy full SHA for 471dda8 - Browse repository at this point
Copy the full SHA 471dda8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8f1217d - Browse repository at this point
Copy the full SHA 8f1217dView commit details -
Remove test for double-processing
I'm not really sure what to do about this yet. It's a bit painful to write code to handle every blank node pattern SOSO is going to throw at us. I might re-introduce this at some later point.
Configuration menu - View commit details
-
Copy full SHA for 59ca2cb - Browse repository at this point
Copy the full SHA 59ca2cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 91aa958 - Browse repository at this point
Copy the full SHA 91aa958View commit details -
Configuration menu - View commit details
-
Copy full SHA for d89cf23 - Browse repository at this point
Copy the full SHA d89cf23View commit details -
Configuration menu - View commit details
-
Copy full SHA for aa24cf2 - Browse repository at this point
Copy the full SHA aa24cf2View commit details -
Fix bug in FilteredCoordinatingNodeClient logic
I had originally designed the filtered client to take a base filter + an extra filter. I didn't account for the fact that the filtered client needs to manage three filters: (1) the base, (2) the actual filter of interest (eg SASAP-only, ARCTICA-only) and (3) the filter we use as the cursor to determine when there are new datasets. This makes the class fully aware of all of that.
Configuration menu - View commit details
-
Copy full SHA for 3928b6d - Browse repository at this point
Copy the full SHA 3928b6dView commit details
Commits on Jun 23, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 81f81c3 - Browse repository at this point
Copy the full SHA 81f81c3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 72b00fa - Browse repository at this point
Copy the full SHA 72b00faView commit details
Commits on Jul 8, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7ba9b9c - Browse repository at this point
Copy the full SHA 7ba9b9cView commit details
Commits on Aug 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 8d664dc - Browse repository at this point
Copy the full SHA 8d664dcView commit details
Commits on Nov 5, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 4b78c5c - Browse repository at this point
Copy the full SHA 4b78c5cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 16c9632 - Browse repository at this point
Copy the full SHA 16c9632View commit details -
Refactor the Scheduler and SlinkyClient interactions to support servi…
…ce-based network addresses
Configuration menu - View commit details
-
Copy full SHA for 50aa722 - Browse repository at this point
Copy the full SHA 50aa722View commit details -
Add __init__.py to the iso folder to let the python packager know we …
…need its contents
Configuration menu - View commit details
-
Copy full SHA for 8e03b92 - Browse repository at this point
Copy the full SHA 8e03b92View commit details -
Refactor the scheduler to always pull an image to avoid using old cac…
…hed images. Change the cli parameters to use the slinky cli. Use the slinky base image, which has the d1lod library installed on it
Configuration menu - View commit details
-
Copy full SHA for 3051b99 - Browse repository at this point
Copy the full SHA 3051b99View commit details -
Configuration menu - View commit details
-
Copy full SHA for 832c7a2 - Browse repository at this point
Copy the full SHA 832c7a2View commit details -
Remove the 'docker' folder since the d1lod image is now being used by…
… the scheduler and worker
Configuration menu - View commit details
-
Copy full SHA for ae271ea - Browse repository at this point
Copy the full SHA ae271eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for b938f59 - Browse repository at this point
Copy the full SHA b938f59View commit details -
Configuration menu - View commit details
-
Copy full SHA for 913791c - Browse repository at this point
Copy the full SHA 913791cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 49e9c72 - Browse repository at this point
Copy the full SHA 49e9c72View commit details