To investigate and extract all the photographers partaking in the Zeri Photo Archive we started with a simple query.
This chooses and groups the photographers and the associations that carried out the photographs. We can already see some of the photographers we already know took part on it, confirming the validity of the query itself. 
However we may want to clean this output: 
 - First, we want only the major contributors 
 - Second, we may want to exclude associations, academies and univerisities. 

In [8]:
import rdflib
import pprint
from rdflib import Namespace
from rdflib.namespace import DCTERMS
from rdflib.namespace import RDFS
from rdflib import URIRef, Literal
from rdflib.namespace import XSD
import numpy as np 
import matplotlib.pyplot as plt 
from SPARQLWrapper import SPARQLWrapper, JSON
import ssl
ssl._create_default_https_context = ssl._create_unverified_context

# get the endpoint API
fototeca_endpoint = "http://data.fondazionezeri.unibo.it/sparql"

# prepare the query : 10 random triples
my_SPARQL_query = """
PREFIX crm: <http://www.cidoc-crm.org/cidoc-crm/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
SELECT ?photographer
WHERE { 
  	?x rdf:type <http://www.essepuntato.it/2014/03/fentry/Photograph> ; 
    crm:P94i_was_created_by ?creation .
    ?creation crm:P14_carried_out_by ?photographer .
 }
GROUP BY ?photographer 
"""

# set the endpoint 
sparql_ft = SPARQLWrapper(fototeca_endpoint)
# set the query
sparql_ft.setQuery(my_SPARQL_query)
# set the returned format
sparql_ft.setReturnFormat(JSON)
# get the results
results = sparql_ft.query().convert()

# manipulate the result
for result in results["results"]["bindings"]:
    print(result["photographer"]["value"])



https://w3id.org/zericatalog/organization/2204/fototecnica-fortunati
https://w3id.org/zericatalog/organization/2816/studio-guy
https://w3id.org/zericatalog/person/926/agosto-mario
https://w3id.org/zericatalog/person/205/liverani-giorgio
https://w3id.org/zericatalog/person/900/foto-wells
https://w3id.org/zericatalog/organization/3124/british-museum
https://w3id.org/zericatalog/organization/2821/christies
https://w3id.org/zericatalog/person/3384/poree-monique
https://w3id.org/zericatalog/organization/3445/guidotti-fratelli
https://w3id.org/zericatalog/organization/2905/immagine-fotografica
https://w3id.org/zericatalog/person/2478/guidotti-bartolomeo
https://w3id.org/zericatalog/organization/2489/publifoto
https://w3id.org/zericatalog/organization/2744/ciurnelli
https://w3id.org/zericatalog/organization/2550/dalessandri-fratelli
https://w3id.org/zericatalog/person/994/chiodi-walter
https://w3id.org/zericatalog/person/832/gasparini-e
https://w3id.org/zericatalog/organization/2847/cnb
https

To this query we want to specify that the results must be: 
- Photographers 
- That they hold a certain role in time 
- Map the roles, look at the distribution, select the major contributors 