Skip to content
This repository has been archived by the owner on May 8, 2024. It is now read-only.

Fix missing roles in wikidata/missing in members of parliament file #252

Closed
MansMeg opened this issue Mar 20, 2023 · 57 comments
Closed

Fix missing roles in wikidata/missing in members of parliament file #252

MansMeg opened this issue Mar 20, 2023 · 57 comments
Milestone

Comments

@MansMeg
Copy link
Collaborator

MansMeg commented Mar 20, 2023

Some MPs are not included in the member_of_parliament.csv file. I guess they are missing the role in wikidata.

Hence we should probably systematically try to identify people with this lacking. An example:

Gustaf Johnsson, Q111804160,
https://www.wikidata.org/wiki/Q111804160
You can also find him in the "Biografibanden", Band 2, p. 285

@fredrik1984
Copy link
Collaborator

Maybe this is something to prioritize when we all MPs have a wiki-id? How was it, did Emil finish the work of going through the MP list of the bicameral period and check which ones that have a wiki-id?

@MansMeg
Copy link
Collaborator Author

MansMeg commented Mar 20, 2023

Yes. Lets wait for Emils work. Then we can start this.

@fredrik1984

This comment was marked as resolved.

@salgo60
Copy link
Contributor

salgo60 commented Apr 18, 2023

@fredrik1984

as you understand Wikidata is not rocket science and people like me are more bold --> we get some edit wars....

  • I have done the below decisions
    • Try to use the bok Tvåkammarriksdagen as the bible
    • Occupation is sadly not well structured in WD we have objects for them but as I have shown it is a small chaos out there salgo60/HISCOKoder
    • Party
      • We get a small war in sv:Wikipedia if something is displayed with the wrong label (most people in sv:Wikipedia dont understands WD or just hate it ;-()
      • for Swedish Church parish an user Larske has implemented that when we pick a Swedish church parish from Wikidata a check is done what was the name of the object at that time e.g. "Gävle församling" has that stored names with start/end in Q10512441#P2561
        • this is not implemented for parties and we wait on maybe you to be an authority telling what party names are the same parties and when they changed name and what should be split to more parties....
        • so I check in book Tvåkammar Riksdagen and if they say (bf) I have created a new object Q110472693 that has P460 as "said to be same as" --> the SPARQL will be more difficult BUT what is displayed in sv:Wikipedia Axel Löfvander and in Wikidata is "correct" and hopefully we get a better solution....
        • my opinion is that Wikidata core lacks a function give me this object with the correct name year 1934 - WD backlog
      • Pictures are stored in WIkicommons that has its own "Wikidata" called Wikibase in a project structured data in commons this was implemented Commons:Structured_data video --> in Wikicommons we have semantic interoperability with Wikidata and sets depicts Q5969615 --> search haswbstatement:P180=Q5969615
        "statements": {
                 "P180": [
                      {
                          "mainsnak": {
                                  "snaktype": "value",
                                  "property": "P180",
                                  "hash": "95268a1b7ec81a7709cbf48f610db4ecb23067bb",
                                  "datavalue": {
                                            "value": {
                                            "entity-type": "item",
                                            "numeric-id": 5969615,
                                            "id": "Q5969615"
                                    },
                                   "type": "wikibase-entityid"
                           }
                        },
                        "type": "statement",
                         "id": "M130845966$F092D723-3261-46BD-9CDA-FC50AC340F69",
                         "rank": "preferred"
                      }
                 ]
        

@fredrik1984
Copy link
Collaborator

Thanks @salgo60 ! I appreciate your commitment!

@salgo60
Copy link
Contributor

salgo60 commented Apr 18, 2023

Its @Ainali @miroli @tmtmtmtm @belteshassar @SchermanJ who started it and as said we dont speak much with each other - interesting queries @Miroll WD user Popperipop wrote down see #112 can give an indication what he plans.....

  • also I am inspired what you do its the first time I see humaniora related metadata management that is inline with how we worked when I was a scrum master on SEB handling international transactions... "with 0 tolerance for mistakes"

FYI @fredrik1984

FYI:

image

image

@fredrik1984
Copy link
Collaborator

This person https://www.wikidata.org/wiki/Q5804642 miss i-ort (Heüman i Jönköping). Bio book 2, p. 159.

@fredrik1984

This comment was marked as resolved.

@salgo60
Copy link
Contributor

salgo60 commented Apr 19, 2023

@fredrik1984 thanks

FYI:

@fredrik1984
Copy link
Collaborator

@salgo60 yes indeed, the hierarchical tree of "party" formations and splits in the bio book is fascinating. What constitutes a party before 1900 is not always clear. But in the Swerik project, we have also decided to use the bio books as our bible. However, in the end, we might merge some of the party names that represent the same party but with different names.

We are currently working on a gold standard for name introduction for the whole period 1867–today, and that is why post some of the issues we come across in doing that work. Appreciate that you help out correcting stuff on Wikidata!

@salgo60
Copy link
Contributor

salgo60 commented Apr 20, 2023

@fredrik1984 you also have "vilde"

  • in wikidata they suggests that "vilde" is an empty value for party but I have started to create WD objects for different types of "vilde"

image

image

image

image

image

@Ainali
Copy link

Ainali commented Apr 20, 2023

  • n wikidata they suggests that "vilde" is an empty value for party but I have started to create WD objects for different types of "vilde"

The items may be fine, but they should not be used to populate member of political party (P102) or parliamentary group (P4100).

@salgo60
Copy link
Contributor

salgo60 commented Apr 20, 2023

The items may be fine, but they should not be used to populate member of political party (P102) or parliamentary group (P4100).

@Ainali
They are right now in exception to constraint P102#P2302 which is a "good solution" until someone decide as @fredrik1984 states it "What constitutes a party before 1900 is not always clear" and we get a "golden standard" maybe we need new properties....

image

Olof Karswall wrote a paper "Historical Settlement Units as Linked Open Data" I guess we need something like that for "political parties/parliamentgroupes/ active politician who has left the party they were elected for/ independent politicians.... from a project as ParlaMint that includes more countries ?!?!?!

image

@fredrik1984
Copy link
Collaborator

We will discuss the party issue before 1900 in the Swerik project. Back in that time "parties" were more lists or groups that voted for the same thing in the parliament. Although they were not parties in our modern definition I think it is reasonable to still tag them as parties.

"Vilde" sounds like a good category to be included, especially since our "bible" (the bio books) uses that term.

@salgo60
Copy link
Contributor

salgo60 commented Apr 21, 2023

My understanding is that @Ainali and @miroli are doing it right now as the rest of the Wikidata community e.g.

image

  • in WIkidata we also have an construction end cause P1534
    • if we look at Kakabaveh we should in my opinion have a WD object end cause P1534 in Swedish sv:Wikipedia says "Den 28 augusti 2019 beslutade Vänsterpartiets verkställande utskott att inleda ett uteslutningsärende gällande Kakabaveh.[24][25] Detta på grund av att hon, enligt partisekreteraren Aron Etzler, "ofta och allvarligt skadar förtroendet för partiet gentemot väljarna och mot sina kollegor".[24] Kakabaveh anklagades bland annat för att inte ha närvarat vid statsministeromröstningen 2018,[26] en omröstning där Vänsterpartiet lade ned sina röster.[27] Hon anklagades även för att ha underlåtit att betala partiskatt.[28]

Kakabaveh hävdade att hon inte meddelades i förväg om uteslutningen utan fick reda på det via media.[29] Hon lämnade partiet samma dag på egen begäran

  • Emma Carlsson Löfdahl WD Q4941925

    image
  • end cause are mostly used for position held P39 is my understanding but as I say above maybe it should be used for parties also.... structure is always good ;-)

    image

    • my vote is also that a project like ParlaMint has a controlled "vocabulary" for end causes... so we get some kind of consistency if not ParlaMint is possible a start is doing it in a project "Välfärden analyseras". When I look at ELI they suggest usage of controlled vocabularies... which I think is a good pattern....

@salgo60
Copy link
Contributor

salgo60 commented Apr 21, 2023

@fredrik1984 another research "wet dream" I guess is the connection between Valmyndigheten and the Swedish PM data see my try 2019 #85 Valmyndigheten koppling Riksdagens öppna data maybe you can get that data as researcher I hope they have the Swedish "personnummer" in both systems but I didnt get a good license

Looks like UK has coordinates for election districts in Wikidata map below see example district Q3238840

image

OT in Wikidata we supports areas - shapes - which is even better than a singel coordinate --> would be nice to get electoral Swedish districts as shapes

image

@salgo60
Copy link
Contributor

salgo60 commented Apr 23, 2023

@fredrik1984 Another related discussion on sv:Wikipedia regarding the names of parties and they the need to have the correct name when something happened "Wikipedia:Bybrunnen#Inkonsekventa_partinamn"

  1. Wikipedia lacks an authority
  2. Easy get this data as CC-0 and machine readable

@fredrik1984
Copy link
Collaborator

https://www.wikidata.org/wiki/Q6178299 is missing i-ort (Sjödahl i Göteborg) and specific start dates in FK (20 March 1931). Source: bio book 4, p. 150.

He has not got a link to the Swedish portrait archive either.

@fredrik1984
Copy link
Collaborator

https://www.wikidata.org/wiki/Q6012010 is missing i-ort (Nilsson i Mölndal). Source: bio book 4, p. 128.

He has not got a link to the Swedish portrait archive.

@BobBorges
Copy link
Collaborator

Re the original issue, is this (image) what's missing that causes the people/wiki_ids to not get scraped into our metadata?

image

It's related to issue #265 and I would / could just run through our known_mps/catalog and add this attribute to the ones who don't have it.

@MansMeg
Copy link
Collaborator Author

MansMeg commented Apr 24, 2023

@BobBorges Im not sure, but I dont think so. @ninpnin knows the script to update the mp database based on wikidata. I would check how they are selected in the API query to wikidata. I think they are selected based on role.

@BobBorges
Copy link
Collaborator

So, it's these:

image

wd:Q10655178
wd:Q33071890
wd:Q81531912

@MansMeg
Copy link
Collaborator Author

MansMeg commented Apr 24, 2023

I guess so, but @ninpnin should confirm. Are the MPs missing in our database missing these attributes?

@BobBorges
Copy link
Collaborator

We talked about it last week -- if I understand what's going on correctly -- this causes missing_member_of_parliament, and also some of the other missing_s are because without this attr the wiki_id doesn't make it into the query results in the first place.

@MansMeg
Copy link
Collaborator Author

MansMeg commented Apr 24, 2023

Great! @salgo60 Do you have time to look at these MPs?

@salgo60
Copy link
Contributor

salgo60 commented Apr 24, 2023

@BobBorges I cant check your list (guess I miss permissions) so I do it on #121

@fredrik1984
Copy link
Collaborator

https://www.wikidata.org/wiki/Q55955 is missing AK start date (16 March 1897). Source: https://portrattarkiv.se/details/sj9PGLAlnmUAAAAAABfN4g

@salgo60
Copy link
Contributor

salgo60 commented Apr 24, 2023

@fredrik1984 FYI a discussion is now on Wikipedia Bybrunnen that we are not consequent in the usage of the names of Swedish parties ... I suggest that we instead of having just tables in articles make this data as data in Wikidata or any other database...

I våra artiklar om äldre svenska städer, köpingar och kommuner (till exempel artikeln Strömstads stad) är vi inte konsekventa vad gäller partinamnen

  1. best would be if you had this data... and we could quote you 🚀 I guess

image

@fredrik1984
Copy link
Collaborator

https://www.wikidata.org/wiki/Q6209073 as a slightly incorrect i-ort (correct should be Thorell i Stolp-Ekeby). Also, he is missing party for https://www.wikidata.org/wiki/Q10554125. Source: bio book 1, p. 271. https://portrattarkiv.se/details/sj9PGLAlnmUAAAAAABHgKQ

@fredrik1984
Copy link
Collaborator

https://www.wikidata.org/wiki/Q6078273 is missing i-ort (Roos i Malmö). Source: bio book 3, p. 266.

@fredrik1984
Copy link
Collaborator

@salgo60 from Swerik's perspective, if a party just change name (e.g. from bondeförbundet to centerpartiet) then it should be regarded as the same party. But bonderförbundet is not the same as jordbrukarnas riksförbund (the latter merged into the former in 1921).

@MansMeg
Copy link
Collaborator Author

MansMeg commented Apr 24, 2023

Are you sure about this Fredrik? I think we might want an instance per name so folkpartiet becomes liberalerna. Even though the party as such has not changed, the data has. So we need to keep track of all the names (and their abbreviations) to map to the protocols, is my guess.

@fredrik1984
Copy link
Collaborator

I guess my comment was more from a historical perspective. From a technical/curational point of view I would say you are right @MansMeg

@MansMeg
Copy link
Collaborator Author

MansMeg commented Apr 24, 2023

Yes. When I think about it this is not very clear how to define. We could also say that we have multiple instances of a name for the same party.

@BobBorges
Copy link
Collaborator

@salgo60 how confident are you in the iorts from the list you and @emil produced? If you're relatively confident they're mostly OK, I will add them all to wikidata programatically which would sort out @fredrik1984 's comment about Roos i Malmö and the other 500ish MPs missing an iort.

@MansMeg
Copy link
Collaborator Author

MansMeg commented Apr 24, 2023

Should we wait to update all these iorts? Im a little hesitant to bulk upload before we have done any quality control of the files.

@BobBorges
Copy link
Collaborator

This is why I was asking about it. I was under the impression that the list I got from Emil (why can't I tag him btw?) was hand curated from the bio books -- if that's the case it should be pretty good. If it's not the case then ¯\_⦅ツ⦆_/¯ we should quality control it.

@salgo60
Copy link
Contributor

salgo60 commented Apr 24, 2023

https://www.wikidata.org/wiki/Q6012010

@fredrik1984 done

@salgo60
Copy link
Contributor

salgo60 commented Apr 24, 2023

@salgo60 how confident are you in the iorts from the list you and @emil produced? If you're relatively confident they're mostly OK, I will add them all to wikidata programatically which would sort out @fredrik1984 's comment about Roos i Malmö and the other 500ish MPs missing an iort.

@BobBorges
The only error so far is

If you do the update as an transaction then we could rollback if it looks too crazy.....

  • I plan later add more things from Emils files for the Enkammar...book if no one else does it....

Question: What does iort stands for?

@fredrik1984
Copy link
Collaborator

@salgo60 iort/i-ort stands for "i riksdagen kallad", used by the speaker of the house to address MPs, often the place where an MP lived. I just found this wiki page: https://sv.wikipedia.org/wiki/I_riksdagen_kallad

Good thing to know that after 1976/77 they stopped address MPs with herr/fru/fröken. "Sedan 1977/1978 betecknas ledamöterna istället med både för- och efternamn, men vid behov av särskiljning tillfogas ett orts- eller gårdsnamn"

@fredrik1984
Copy link
Collaborator

https://www.wikidata.org/wiki/Q6012769 is missing i-ort (Nisser i Grycksbo) and specific start date in FK (11 January 1938). He is also missing a reference to the portrait archive. Source: bio book 5, p. 80. ping @salgo60 (do you want me to ping you for each post of a missing MP metadata in Wikidata?)

A question to @MansMeg @BobBorges @ninpnin – regarding Bob's comment above, should I not continue reporting these missing MP metadata on Wikidata? I thought that Emil had already added all missing MPs on Wikidata, but that some metadata is still missing, hence why I add these comments.

@MansMeg
Copy link
Collaborator Author

MansMeg commented Apr 25, 2023

Please add this Fredrik. Emil and @salgo60 added all people on wikidata that was missing. Although not all metadata has been added. We do this iteratively. So now we have found people that is missing information on role that needs to be fixed so we get all mps in. Thats the first step. Then we gonna add more and more metadata on a need to have basis.

@fredrik1984
Copy link
Collaborator

Ok, good, I thought so! Going through Väinö's CSV file with MP introduction is very good to see what the quality of the MP metadata in Wikidata looks like, and it is often very good.

@Lottabrorsson – would be great if you and Mattias also report missing MP metadata in Wikidata here when you go through your share of MPs in the CSV file!

@fredrik1984
Copy link
Collaborator

fredrik1984 commented Apr 25, 2023

@salgo60
Copy link
Contributor

salgo60 commented Apr 25, 2023

@fredrik1984 WD is behind... my understanding was that Emil added often the Alias and some data if it was empty...

I try to add

  1. Name with form "I riksdagen kallad"
  2. The link to SPA (OT I checked the number of views we have in WIkidata on pictures from SPA and its > 200 000 views per month and pictures used on 74 different Wikis... nice scaling link)
  3. Try to set first names
  4. party: try to follow the Bible but I am not skilled and do mistakes...
  5. birth/ death location and date and also check if we have other sources and somtimes add the churchbook
  6. does a search on Svenska Gravar and add Gravstensinventeringen
  7. Check with the gadget Authority Control if the person is in VIAF/LIBRIS XL....
  8. updates the sv:Wikipedia article with the name "I riksdagen kallad" and if no template is used I add Faktamall biografi WD and Auktoritetsdata- see change Axel Roos
  9. Add pictures from SPA and metadata depict WD object
  10. "Described by source P1343" - Tvåkammar-riksdagen 1867–1970 / Q110346241
    --> We now have > 3400 First/Sec chamber people with this source / as a table

image

@salgo60
Copy link
Contributor

salgo60 commented Apr 25, 2023

ping @salgo60 (do you want me to ping you for each post of a missing MP metadata in Wikidata?)

@fredrik1984
Maybe better add a new issue in salgo60/Wikidata_riksdagen-corpus and assign me or add to one of my todo lists like #121

@salgo60
Copy link
Contributor

salgo60 commented Apr 25, 2023

@salgo60 iort/i-ort stands for "i riksdagen kallad", used by the speaker of the house to address MPs, often the place where an MP lived. I just found this wiki page: https://sv.wikipedia.org/wiki/I_riksdagen_kallad

Good thing to know that after 1976/77 they stopped address MPs with herr/fru/fröken. "Sedan 1977/1978 betecknas ledamöterna istället med både för- och efternamn, men vid behov av särskiljning tillfogas ett orts- eller gårdsnamn"

Thanks
😃 I was part of creating that Wikipage

  • I added iort and i-ort as alias for the WD object Q110382440
    • but I still doesnt understand how iort is related with "i riksdagen kallad" 😕

image

@fredrik1984
Copy link
Collaborator

Ok! I will start a new issue there and move the missing MP metadata in Wikidata things. Would it be possible for you @salgo60 to fix the rest of my MP posts in this thread?

@salgo60
Copy link
Contributor

salgo60 commented Apr 25, 2023

@fredrik1984
@ping me when something is done right now I am confused were the latest version of your files are

In my version I found an error.... correct is

Q6157121,Nisser i Falun, Ernst M W, 5:79

@salgo60
Copy link
Contributor

salgo60 commented Apr 26, 2023

@salgo60 from Swerik's perspective, if a party just change name (e.g. from bondeförbundet to centerpartiet) then it should be regarded as the same party. But bonderförbundet is not the same as jordbrukarnas riksförbund (the latter merged into the former in 1921).

@fredrik1984 @MansMeg
I suggest that you start look into SKOS see my try to explain for the people at Skansen

image

I think as more mature you get on using linked data and more people get involved then you will split into more objects

image

@salgo60
Copy link
Contributor

salgo60 commented May 11, 2023

@fredrik1984 you also have "vilde"

  • in wikidata they suggests that "vilde" is an empty value for party but I have started to create WD objects for different types of "vilde"

@fredrik1984 as WD now has more pictures scanned of Swedish PM I did a test to filter out people described as "vilde" see #139

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants