Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Curate Wikidata entries about components of the open science ecosystem #7

Closed
Daniel-Mietchen opened this issue May 1, 2018 · 15 comments
Labels
Curate Update or otherwise improve some piece of information help wanted Extra attention is needed mozsprint To remain compatible with other Mozsprint repos Review Share your thoughts on something existing

Comments

@Daniel-Mietchen
Copy link
Member

Some guidance is available at
https://www.wikidata.org/wiki/Wikidata:WikiProject_Open .

The focus here is on improving existing items. If you're not sure whether something you're interested in has or, or should have one, please leave a comment here or on the Discussion page of the above WikiProject.

@Daniel-Mietchen Daniel-Mietchen added mozsprint To remain compatible with other Mozsprint repos Review Share your thoughts on something existing Curate Update or otherwise improve some piece of information labels May 1, 2018
@Daniel-Mietchen
Copy link
Member Author

There is a dedicated Mozsprint project for mapping the open ecosystem (so not limited to science) via Wikidata: mozilla/global-sprint#237 .

@HeidiSeibold
Copy link

HeidiSeibold commented May 10, 2018

How do I make the connection to open science? Can I say "part of" "Open Science Tools"? How do we connect all the tools? I am quite new to wikidata...

@HeidiSeibold
Copy link

HeidiSeibold commented May 10, 2018

I created entries for open science movement and open science tool. Maybe this will help 🤓

Now I can add a tool via: "instance of" "open science tool" (see for example https://www.wikidata.org/wiki/Q52988856)

And then we can visualise all tools we collect during this sprint, e.g. with the wikidata-graph-builder

@Daniel-Mietchen
Copy link
Member Author

Thanks, @HeidiSeibold . I added a bit to these items. For pieces of software and file formats known to Wikidata,
http://wikidp.org/
provides a nice overview, e.g. see
http://wikidp.org/Q52988856 for OpenML and the discussion at
bioSyntax/bioSyntax#1 (pinging @emulatingkat ).

Yes, the graph builder and the Wikidata SPARQL endpoint could be used for some visualizations, so it would make sense to start collecting ideas for questions/ queries.

An example query not yet directly related to open science is Software applications ranked in descending order by the number of writable file formats .

The WikiDigi account on Twitter is tweeting lots of queries like these.

@bluerasberry
Copy link

@HeidiSeibold I tried to develop these also. Yes you did something clever by making these and yes you got the idea of Wikidata.

It is still easy to find the limitations of Wikidata's data models. I was looking at your "open science movement" item then cross checked it with other social movements, including women's rights movements, black rights movements, LGBT rights movements, etc. It seems that no one has yet modeled how to manage social movements in Wikidata.

Thanks for raising the issue and yes it would be nice at the end to come up with a data visualization. I will try more tomorrow.

@Daniel-Mietchen
Copy link
Member Author

Here's a graph builder query that @HeidiSeibold had shared this morning for open science tools indexed in Wikidata: https://angryloki.github.io/wikidata-graph-builder/?property=P31&item=Q52990223&mode=reverse .

Setting up items for these tools and services is one thing, annotating them with useful properties another. To get an idea of the kind of properties that are being used on items that are instances or subclasses of open science tools, queries like this one could be used:

SELECT DISTINCT ?property ?propertyLabel ?count
WITH {
  SELECT DISTINCT ?property (COUNT(*) AS ?count) WHERE {
    ?item wdt:P31/wdt:P279* wd:Q52990223 ;
    ?p [ ] .
    ?property a wikibase:Property;
                wikibase:claim ?p.
  }
  GROUP BY ?property 
  } AS %results WHERE {
  INCLUDE %results.
  SERVICE wikibase:label { bd:serviceParam wikibase:language "en". }
}
ORDER BY DESC(?count)
LIMIT 100

You can try it out here.

The more comprehensive the data, the more useful this query should become.

@bluerasberry
Copy link

I was looking at the Wikidata items for social movements and thinking about how to make sure that anyone who participates in the "open science movement" can tag their work and get due credit for whatever they do, however great or modest.

I was looking at citation structures and considering what kinds of work get less credit. Using academic conference presentations as a start, I wrote out a Wikidata data model for data of events.
https://www.wikidata.org/wiki/Wikidata:WikiProject_Event_Metadata

There is more to be developed but this is a start to collecting information about all sorts of events which broadcast information, and might be recorded and published media, but which are currently uncommon to cite. My goal here was to be able to take in conference programs.

I am not there but I made some movement on the issue.

@Daniel-Mietchen
Copy link
Member Author

I guess @fnielsen would like to take a look at that.

@Daniel-Mietchen
Copy link
Member Author

Having thought a bit more about this, I think instead of classifying tools or services as instances or subclasses of open science tool, it might be better to create a catalog of which the tools or services are a part.

@HeidiSeibold
Copy link

@Daniel-Mietchen I am not sure I understand how to do this. Can you clarify? Has there been any work in this direction so far?

@Daniel-Mietchen
Copy link
Member Author

@HeidiSeibold Catalogs are in use in various contexts, from artworks and astronomy to zoology, and there is a dedicated property for that: https://www.wikidata.org/wiki/Property:P972 . There is also
https://www.wikidata.org/wiki/Property:P360 for lists but I haven't looked much into this.

@Daniel-Mietchen
Copy link
Member Author

For some catalogue examples, see https://tools.wmflabs.org/scholia/catalogue/ .

@Daniel-Mietchen
Copy link
Member Author

We just had a similar discussion here at the Wikimedia Hackathon on how to index such lists of things, and on focus list of Wikimedia project (P5008) came up as an additional and possibly better option than catalogues.

@xolotl
Copy link
Member

xolotl commented May 18, 2018

Linking this issue to the one focused on open science tool mappings and visualizations #11

@xolotl
Copy link
Member

xolotl commented Sep 18, 2018

This issue was moved to OpenScienceRoadmap/roadmap#7

@xolotl xolotl closed this as completed Sep 18, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Curate Update or otherwise improve some piece of information help wanted Extra attention is needed mozsprint To remain compatible with other Mozsprint repos Review Share your thoughts on something existing
Projects
None yet
Development

No branches or pull requests

4 participants