# Cuneiform Geography

Each cuneiform tablet has a provenience, or place of discovery. Sometimes this information is lost due to illicit digging and the lack of documentation, but when it is known, this data has been published in the open source databases of cuneiform texts, most notably: [CDLI](https://cdli.mpiwg-berlin.mpg.de/), [ORACC](oracc.museum.upenn.edu), and [BDTNS](http://bdtns.filol.csic.es/).
In 2021 a team of specialists published a dataset of proveniences for the cuneiform sources in Zenodo, entitled [Cuneiform Inscriptions Geographical Site Index](https://zenodo.org/record/5642899#.YxeO1uzMLy9). Their data curation extended beyond that of the existing databases, and has become the dataset of record for geographical metadata for these artifacts, and has since been included in the CDLI.

The goal of this initial notebook is (1) to access this dataset using reproducible methods, and compare previous and current versions, and lastly (2) to prepare this data for Linked Open Data (LOD) triple statements in the [FactGrid Cuneiform Project](https://database.factgrid.de/wiki/FactGrid:Cuneiform_Project).

------------

**License:**

Attribution 4.0 International [CC-BY](https://github.com/santisoler/cc-licenses/blob/main/LICENSE-CC-BY)

----------
> __Citation references:__
* License: [CC-BY 4.0](https://github.com/santisoler/cc-licenses/blob/main/LICENSE-CC-BY)
* Title: Cuneiform Geography
* Date: Spring 2023
* Authors:
  1. Tina Chen (czz129@berkeley.edu). UC Berkeley Data Science major (2024)
  2. Dr. Adam Anderson (adamganderson@gmail.com). UC Berkeley Data Science Discovery Partner, FactGrid Cuneiform project director.

* Project: [FactGrid Cuneiform](https://database.factgrid.de/wiki/FactGrid:Cuneiform_Project)
* Program: [UC Berkeley Data Science Discovery](https://data.berkeley.edu/project/factgrid-cuneiform)


In [None]:
!pip3 install geojson
!pip3 install shapely.constructive
!pip install --upgrade geopandas
import pandas as pd
import numpy as np
import csv
import plotly.express as px
import geopandas as gpd
import json
import requests
%matplotlib inline
from shapely.geometry import Point
from geopandas import datasets, GeoDataFrame, read_file

Collecting geojson
  Downloading geojson-3.0.1-py3-none-any.whl (15 kB)
Installing collected packages: geojson
Successfully installed geojson-3.0.1
[31mERROR: Could not find a version that satisfies the requirement shapely.constructive (from versions: none)[0m[31m
[0m[31mERROR: No matching distribution found for shapely.constructive[0m[31m


In [None]:
from google.colab import drive
drive.mount('/content/drive')
workdir = '/content/drive/MyDrive/Sumerian Network' # for Tina


Mounted at /content/drive


##1 Published Data for Cuneiform Tablet Proveniences

1.1 The first step is to import the CIGS dataset (geojson) [Rattenborg, Rune, Johansson, Carolin, Nett, Seraina, Smidt, Gustav Ryberg, & Andersson, Jakob. (2021). Cuneiform Inscriptions Geographical Site Index (CIGS) (1.4) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.5642899](https://zenodo.org/record/5642899#.YxeO1uzMLy9)

1.2 Then we will compare this with the updated version of the CIGS dataset (CSV) [Rattenborg, Rune, Johansson, Carolin, Melin-Kronsell, Nils, Nett, Seraina, Smidt, Gustav Ryberg, & Andersson, Jakob. (2023). Cuneiform Inscriptions Geographical Site Index (CIGS) (v1.6). Zenodo. https://doi.org/10.5281/zenodo.8126955](https://zenodo.org/record/8126955)


In [None]:
CIGS_old = gpd.read_file('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/CIGS_v1_4_20211101.geojson')
CIGS
CIGS = gpd.read_file('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/CIGS_v1_6_20230613.csv')
CIGS

Unnamed: 0,site_id,cdli_provenience_id,accuracy,anc_name,transc_name,ara_name,arm_name,fas_name,geo_name,gre_name,...,wik_ara,wik_fas,wik_gre,wik_heb,wik_tr,rla_url,wikidata_url,lon_wgs1984,lat_wgs1984,geometry
0,ADA,0,2,,Adalar,,,,,,...,,,,,,,,42.5142,39.1240,
1,ADB,252,3,Adab,Bismāyā,بسمايا,,,,,...,https://ar.wikipedia.org/wiki/أداب_(مدينة),,,,,,https://wikidata.org/wiki/Q346445,45.6233,31.9509,
2,ADH,0,3,,Tall Abū al-Dhahab,تل أبو الذهب,,,,,...,,,,,,,,46.6926,30.7382,
3,ADI,0,1,,Adilcevaz,,,,,,...,,,,,https://tr.wikipedia.org/wiki/Adilcevaz,,https://wikidata.org/wiki/Q357251,42.7291,38.8014,
4,AFA,0,3,,Tulūl al-Fāj,تلول الفاج,,,,,...,,,,,,,,45.0956,32.5741,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
592,ZLB,58,1,,Zālūl Āb,زالوآب,,,,,...,,https://fa.wikipedia.org/wiki/زالوآب_(روانسر),,,,,,46.5799,34.6309,
593,ZOH,116,1,,Sar-i Pol-i Zohāb,,,سر پل زهاب,,,...,,,,,,,,45.8692,34.4633,
594,ZTP,244,3,Tushhan,Ziyaret Tepe,,,,,,...,,,,,https://tr.wikipedia.org/wiki/Ziyaret_Tepe_Höyüğü,,https://wikidata.org/wiki/Q4819172,40.7929,37.7936,
595,ZVA,0,1,,Zvartʻnotsʻ,,Զվարթնոց,,,,...,,https://fa.wikipedia.org/wiki/سنگ%E2%80%8Cنگار...,,,,,https://wikidata.org/wiki/Q505175,44.3366,40.1596,


In [None]:
CIGS.iloc[407]

site_id                                                        QIT
cdli_provenience_id                                              0
accuracy                                                         3
anc_name                                                          
transc_name                                          Tall al-Qiṭar
ara_name                                                 تل القطار
arm_name                                                          
fas_name                                                          
geo_name                                                          
gre_name                                                          
heb_name                                                          
rus_name                                                          
legacy_name                                                       
pleiades_url                                                      
osm_url                https://www.openstreetmap.org/way/76603

In [None]:
CIGS.columns

Index(['site_id', 'cdli_provenience_id', 'accuracy', 'anc_name', 'transc_name',
       'ara_name', 'arm_name', 'fas_name', 'geo_name', 'gre_name', 'heb_name',
       'rus_name', 'legacy_name', 'pleiades_url', 'osm_url', 'geonames_url',
       'wik_en', 'wik_ara', 'wik_fas', 'wik_gre', 'wik_heb', 'wik_tr',
       'rla_url', 'wikidata_url', 'lon_wgs1984', 'lat_wgs1984', 'geometry'],
      dtype='object')

In [None]:
# CIGS["pleiades_id"]

##2 LOD format for FactGrid
In order for the CIGS dataset to be imported into FactGrid, we need to add a few things for proper formatting in Linked Open Data (LOD), which include:
1. Property IDs for the corresponding items in the header field
2. Quotation marks around labels for each item
3. Additional formatting tags for special language fields, etc.

FactGrid uses Quickstatements for importing in CSV format. The formatting rules for Quickstatements are documented for Wikidata (https://www.wikidata.org/wiki/Help:QuickStatements). There is also a video description [here](https://blog.factgrid.de/archives/811).


In [None]:
path = '/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/LOD Tablet Dictionary - CIGS_FG.csv'
CIGS_FG = pd.read_csv(path, header=[0])
CIGS_FG

Unnamed: 0,Double record?,qid,Sarwiki,P418,P34,Unnamed: 5,Unnamed: 6,Senwiki,P671,P34.1,...,wik_fas,wik_gre,wik_heb,wik_en.1,lan,qid.2,"qid,Len,Lfr,Lde,Den,P2,P2,P131,P131,P48,P670,qal18",place,link,placeLabel
0,,Q389858,"""سيبار_أمنانوم""","""6927582""",,,,"""Sippar-Amnanum ""","""392373882""","ar:""تل الدير""",...,,,,https://en.wikipedia.org/wiki/Sippar-Amnanum,,,",""Tall al-Daiyr"",""Tall al-Daiyr"",""Tall al-Daiy...",Q390078,Q756957,Hasankeyf
1,,Q389791,,,,,,,,"ar:""أم الجير""",...,,,,,,,",""Umm al-Jīr"",""Umm al-Jīr"",""Umm al-Jīr"",""ancie...",,,
2,,Q389689,,"""7108432""","fa:""تپه حصار""",,,"""Tepe Hissar ""","""942329""",,...,https://fa.wikipedia.org/wiki/تپه%E2%80%8Cحصار,,,https://en.wikipedia.org/wiki/Tepe_Hissar,,,",""Tapah Ḥiṣār"",""Tapah Ḥiṣār"",""Tapah Ḥiṣār"",""an...",,,
3,,Q390037,,,,Strwiki,"""Maşat Höyük""","""Maşat Höyük ""","""133411593""",,...,,,,https://en.wikipedia.org/wiki/Maşat_Höyük,,,",""Maşat Höyük"",""Maşat Höyük"",""Maşat Höyük"",""an...",Q389991,Q82070,Ṣūr
4,,Q389935,,,,,,,"""249999668""","ar:""تل محمد""",...,,,,,,,",""Tall Muḥamad"",""Tall Muḥamad"",""Tall Muḥamad"",...",,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
576,,Q390166,,"""6268312""",,,,,,,...,,,,,,,",""Новый Кумак"",""Новый Кумак"",""Новый Кумак"",""an...",,,
577,,Q390170,,,,,,P671,"""804216207""",,...,,,,,,,",""Acemhöyük"",""Acemhöyük"",""Acemhöyük"",""ancient ...",,,
578,,Q389671,,"""281184""",,,,P671,"""687928""","ar:""القدس""",...,,,,,,,",""Yrṿshlym"",""Yrṿshlym"",""Yrṿshlym"",""ancient loc...",,,
579,,Q389718,,,"fa:""سقندل""",,,P671,"""884203""",,...,,,,,,,",""Saqandal"",""Saqandal"",""Saqandal"",""ancient loc...",,,


Changed the columns without quotation mark

In [None]:
# CIGS_FG_without_quote = CIGS_FG.copy()
# for i, col in enumerate(CIGS_FG.columns):
#     CIGS_FG.iloc[:, i] = CIGS_FG.iloc[:, i].replace('"', '')
# CIGS_FG

In [None]:
CIGS_FG_without_quote = pd.read_csv("/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/CIGS_FG_without_quote.csv" )
CIGS_FG_without_quote

Unnamed: 0.1,Unnamed: 0,Double record?,qid,Sarwiki,P418,P34,Unnamed: 5,Unnamed: 6,Senwiki,P671,...,wik_fas,wik_gre,wik_heb,wik_en.1,lan,qid.2,"qid,Len,Lfr,Lde,Den,P2,P2,P131,P131,P48,P670,qal18",place,link,placeLabel
0,0,,Q389858,سيبار_أمنانوم,6927582.0,,,,Sippar-Amnanum,392373882.0,...,,,,https://en.wikipedia.org/wiki/Sippar-Amnanum,,,",""Tall al-Daiyr"",""Tall al-Daiyr"",""Tall al-Daiy...",Q390078,Q756957,Hasankeyf
1,1,,Q389791,,,,,,,,...,,,,,,,",""Umm al-Jīr"",""Umm al-Jīr"",""Umm al-Jīr"",""ancie...",,,
2,2,,Q389689,,7108432.0,fa:تپه حصار,,,Tepe Hissar,942329.0,...,https://fa.wikipedia.org/wiki/تپه%E2%80%8Cحصار,,,https://en.wikipedia.org/wiki/Tepe_Hissar,,,",""Tapah Ḥiṣār"",""Tapah Ḥiṣār"",""Tapah Ḥiṣār"",""an...",,,
3,3,,Q390037,,,,Strwiki,Maşat Höyük,Maşat Höyük,133411593.0,...,,,,https://en.wikipedia.org/wiki/Maşat_Höyük,,,",""Maşat Höyük"",""Maşat Höyük"",""Maşat Höyük"",""an...",Q389991,Q82070,Ṣūr
4,4,,Q389935,,,,,,,249999668.0,...,,,,,,,",""Tall Muḥamad"",""Tall Muḥamad"",""Tall Muḥamad"",...",,,
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
576,576,,Q390166,,6268312.0,,,,,,...,,,,,,,",""Новый Кумак"",""Новый Кумак"",""Новый Кумак"",""an...",,,
577,577,,Q390170,,,,,,P671,804216207.0,...,,,,,,,",""Acemhöyük"",""Acemhöyük"",""Acemhöyük"",""ancient ...",,,
578,578,,Q389671,,281184.0,,,,P671,687928.0,...,,,,,,,",""Yrṿshlym"",""Yrṿshlym"",""Yrṿshlym"",""ancient loc...",,,
579,579,,Q389718,,,fa:سقندل,,,P671,884203.0,...,,,,,,,",""Saqandal"",""Saqandal"",""Saqandal"",""ancient loc...",,,


In [None]:
CIGS_FG_without_quote["Len"]

0      Tall al-Daiyr
1         Umm al-Jīr
2        Tapah Ḥiṣār
3        Maşat Höyük
4       Tall Muḥamad
           ...      
576      Новый Кумак
577        Acemhöyük
578         Yrṿshlym
579         Saqandal
580       Al-Rajībah
Name: Len, Length: 581, dtype: object

In [None]:
CIGS_FG_without_quote.columns

Index(['Unnamed: 0', 'Double record?', 'qid', 'Sarwiki', 'P418', 'P34',
       'Unnamed: 5', 'Unnamed: 6', 'Senwiki', 'P671', 'P34.1', 'Wikidata_qid',
       'Wikidata2', 'Len', 'Den', 'P2', 'P2.1', 'P131', 'P131.1',
       'lat_lon_P48', 'P670', 'qal18', 'wik_en', 'sv.wiki', 'wik_tr',
       'Description', 'anc_name', 'lat_wgs1984', 'lon_wgs1984', 'accuracy',
       'qid.1', 'cdli_legacy_"Len"', 'cdli_provenience_id_P694',
       'transciption_name', 'language', 'tr_name', 'ara_name', 'arm_name',
       'fas_name', 'geo_name', 'gre_name', 'heb_name', 'rus_name',
       'pleiades_id_P671', 'osm_id', 'osm_type', 'site_id', 'geonames_id',
       'Unnamed: 47', 'wik_ara', 'wik_fas', 'wik_gre', 'wik_heb', 'wik_en.1',
       'lan', 'qid.2', 'qid,Len,Lfr,Lde,Den,P2,P2,P131,P131,P48,P670,qal18',
       'place', 'link', 'placeLabel'],
      dtype='object')

Next we merge the two data tables, in order to see which fields have been added, and which still need to be added into FactGrid. Some of the fields which were not yet added may require that we first make the Property for the items in FactGrid before we can import the data directly.

In [None]:
CIGS_MERGE = CIGS_FG_without_quote.merge(CIGS, how = 'right', left_on = 'transciption_name', right_on = "transc_name",indicator= True)
CIGS_MERGE.columns

Index(['Unnamed: 0', 'Double record?', 'qid', 'Sarwiki', 'P418', 'P34',
       'Unnamed: 5', 'Unnamed: 6', 'Senwiki', 'P671', 'P34.1', 'Wikidata_qid',
       'Wikidata2', 'Len', 'Den', 'P2', 'P2.1', 'P131', 'P131.1',
       'lat_lon_P48', 'P670', 'qal18', 'wik_en_x', 'sv.wiki', 'wik_tr_x',
       'Description', 'anc_name_x', 'lat_wgs1984_x', 'lon_wgs1984_x',
       'accuracy_x', 'qid.1', 'cdli_legacy_"Len"', 'cdli_provenience_id_P694',
       'transciption_name', 'language', 'tr_name', 'ara_name_x', 'arm_name_x',
       'fas_name_x', 'geo_name_x', 'gre_name_x', 'heb_name_x', 'rus_name_x',
       'pleiades_id_P671', 'osm_id', 'osm_type', 'site_id_x', 'geonames_id',
       'Unnamed: 47', 'wik_ara_x', 'wik_fas_x', 'wik_gre_x', 'wik_heb_x',
       'wik_en.1', 'lan', 'qid.2',
       'qid,Len,Lfr,Lde,Den,P2,P2,P131,P131,P48,P670,qal18', 'place', 'link',
       'placeLabel', 'site_id_y', 'cdli_provenience_id', 'accuracy_y',
       'anc_name_y', 'transc_name', 'ara_name_y', 'arm_name_y', 'fas_

In [None]:
CIGS_MERGE_old = CIGS_FG_without_quote.merge(CIGS_old, how = 'right', left_on = 'transciption_name', right_on = "transc_name",indicator= True)
CIGS_MERGE_old.columns

Index(['Unnamed: 0', 'Double record?', 'qid', 'Sarwiki', 'P418', 'P34',
       'Unnamed: 5', 'Unnamed: 6', 'Senwiki', 'P671', 'P34.1', 'Wikidata_qid',
       'Wikidata2', 'Len', 'Den', 'P2', 'P2.1', 'P131', 'P131.1',
       'lat_lon_P48', 'P670', 'qal18', 'wik_en_x', 'sv.wiki', 'wik_tr_x',
       'Description', 'anc_name_x', 'lat_wgs1984', 'lon_wgs1984', 'accuracy_x',
       'qid.1', 'cdli_legacy_"Len"', 'cdli_provenience_id_P694',
       'transciption_name', 'language', 'tr_name', 'ara_name_x', 'arm_name_x',
       'fas_name_x', 'geo_name_x', 'gre_name_x', 'heb_name_x', 'rus_name_x',
       'pleiades_id_P671', 'osm_id_x', 'osm_type_x', 'site_id_x',
       'geonames_id_x', 'Unnamed: 47', 'wik_ara_x', 'wik_fas_x', 'wik_gre_x',
       'wik_heb_x', 'wik_en.1', 'lan', 'qid.2',
       'qid,Len,Lfr,Lde,Den,P2,P2,P131,P131,P48,P670,qal18', 'place', 'link',
       'placeLabel', 'site_id_y', 'cdli_provenience_id', 'accuracy_y',
       'anc_name_y', 'transc_name', 'ara_name_y', 'arm_name_y', 'fa

In [None]:
CIGS_MERGE.to_csv('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/CIGS_FG_loss_v6.csv')

In [None]:
CIGS_FG_loss = CIGS_MERGE.loc[CIGS_MERGE['_merge'] == 'right_only']

CIGS_FG_loss

Unnamed: 0.1,Unnamed: 0,Double record?,qid,Sarwiki,P418,P34,Unnamed: 5,Unnamed: 6,Senwiki,P671,...,wik_fas_y,wik_gre_y,wik_heb_y,wik_tr_y,rla_url,wikidata_url,lon_wgs1984_y,lat_wgs1984_y,geometry,_merge
2,,,,,,,,,,,...,,,,,,,46.6926,30.7382,,right_only
3,,,,,,,,,,,...,,,,https://tr.wikipedia.org/wiki/Adilcevaz,,https://wikidata.org/wiki/Q357251,42.7291,38.8014,,right_only
4,,,,,,,,,,,...,,,,,,,45.0956,32.5741,,right_only
5,,,,,,,,,,,...,,,,,,https://wikidata.org/wiki/Q7697374,36.7988,35.9039,,right_only
9,,,,,,,,,,,...,,,,,,,44.8225,32.4605,,right_only
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
560,,,,,,,,,,,...,,,,,,,45.0983,32.5612,,right_only
561,,,,,,,,,,,...,,,,,,https://wikidata.org/wiki/Q27365,38.5499,27.6219,,right_only
565,,,,,,,,,,,...,,,,,,,46.5784,31.3901,,right_only
584,,,,,,,,,,,...,,,,,,,45.3421,32.1834,,right_only


In [None]:
booleen = CIGS_MERGE.columns.isin(CIGS_MERGE_old.columns)
new_col = CIGS_MERGE.columns.to_series()
new_col.loc[booleen != True]

lat_wgs1984_x    lat_wgs1984_x
lon_wgs1984_x    lon_wgs1984_x
osm_id                  osm_id
osm_type              osm_type
geonames_id        geonames_id
legacy_name        legacy_name
pleiades_url      pleiades_url
osm_url                osm_url
geonames_url      geonames_url
rla_url                rla_url
wikidata_url      wikidata_url
lon_wgs1984_y    lon_wgs1984_y
lat_wgs1984_y    lat_wgs1984_y
dtype: object

In [None]:
# CIGS_FG_loss.to_csv('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/CIGS_FG_loss.csv')

In [None]:
factgrid = pd.read_json('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/query.json')
factgrid

Unnamed: 0,ancientplace,coord,romanscript,namehistory,cdli2,pleiades
0,https://database.factgrid.de/entity/Q389901,Point(43.2304 35.5931),Tall Ḥuwaysh,تل حويش,95.0,
1,https://database.factgrid.de/entity/Q389900,Point(41.166 36.816),Tall Ḥamīdī,تل حميدي,,874740.0
2,https://database.factgrid.de/entity/Q389898,Point(40.0399 36.8268),Tall Ḥalaf,تل حلف,,874739.0
3,https://database.factgrid.de/entity/Q389892,Point(45.7032 31.8254),Tall Jidar,تل جدر,318.0,912957.0
4,https://database.factgrid.de/entity/Q389891,Point(40.5872 36.7381),Tall Baiydar,تل بيدر,260.0,423885388.0
...,...,...,...,...,...,...
581,https://database.factgrid.de/entity/Q390009,Point(45.1764 35.5579),Kānī Shāyah,كاني شاية,,
582,https://database.factgrid.de/entity/Q390019,Point(43.4762 36.0446),Nigūb,نگوب,273.0,413309737.0
583,https://database.factgrid.de/entity/Q390073,Point(40.7929 37.7936),Ziyaret Tepe,,244.0,
584,https://database.factgrid.de/entity/Q390044,Point(36.9096 39.3084),Kuşaklı,,190.0,117143016.0


In [None]:
CIGS_MERGE = CIGS_MERGE.drop("_merge", axis = 1)
factgrid_merge = factgrid.merge(CIGS_MERGE, how = 'left', left_on = 'romanscript', right_on = 'transc_name', indicator= True)
factgrid_merge

Unnamed: 0.1,ancientplace,coord,romanscript,namehistory,cdli2,pleiades,Unnamed: 0,Double record?,qid,Sarwiki,...,wik_fas_y,wik_gre_y,wik_heb_y,wik_tr_y,rla_url,wikidata_url,lon_wgs1984_y,lat_wgs1984_y,geometry,_merge
0,https://database.factgrid.de/entity/Q389901,Point(43.2304 35.5931),Tall Ḥuwaysh,تل حويش,95.0,,58.0,,Q389901,,...,,,,,,,43.2304,35.5931,,both
1,https://database.factgrid.de/entity/Q389900,Point(41.166 36.816),Tall Ḥamīdī,تل حميدي,,874740.0,303.0,,Q389900,,...,,,,,,,41.1660,36.8160,,both
2,https://database.factgrid.de/entity/Q389898,Point(40.0399 36.8268),Tall Ḥalaf,تل حلف,,874739.0,289.0,,Q389898,,...,,,,,,,40.0399,36.8268,,both
3,https://database.factgrid.de/entity/Q389892,Point(45.7032 31.8254),Tall Jidar,تل جدر,318.0,912957.0,195.0,,Q389892,,...,,,,,,,45.7032,31.8254,,both
4,https://database.factgrid.de/entity/Q389891,Point(40.5872 36.7381),Tall Baiydar,تل بيدر,260.0,423885388.0,155.0,,Q389891,تل_بيدر,...,,,,,,https://wikidata.org/wiki/Q1658146,40.5872,36.7381,,both
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
603,https://database.factgrid.de/entity/Q390009,Point(45.1764 35.5579),Kānī Shāyah,كاني شاية,,,486.0,,Q390009,,...,,,,,,,45.1764,35.5579,,both
604,https://database.factgrid.de/entity/Q390019,Point(43.4762 36.0446),Nigūb,نگوب,273.0,413309737.0,164.0,,Q390019,,...,,,,,,,43.4762,36.0446,,both
605,https://database.factgrid.de/entity/Q390073,Point(40.7929 37.7936),Ziyaret Tepe,,244.0,,143.0,,Q390073,,...,,,,https://tr.wikipedia.org/wiki/Ziyaret_Tepe_Höyüğü,,https://wikidata.org/wiki/Q4819172,40.7929,37.7936,,both
606,https://database.factgrid.de/entity/Q390044,Point(36.9096 39.3084),Kuşaklı,,190.0,117143016.0,115.0,,Q390044,,...,,,,https://tr.wikipedia.org/wiki/Sarissa,,,36.9096,39.3084,,both


In [3]:
# factgrid_merge["P402_(Wikidata)"] = factgrid_merge['osm_id_x']
# factgrid_merge["P10689_(Wikidata)"] = factgrid_merge["osm_type_x"]

In [None]:
# factgrid_merge = factgrid_merge.drop("romanscript", axis =1 )

In [2]:
# factgrid_merge.columns

In [None]:
# factgrid_merge.to_csv('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/factgrid_merge.csv')

In [1]:
# id_columns = factgrid_merge[['qid','P402_(Wikidata)', 'P10689_(Wikidata)']]
# id_columns

In [None]:
# id_columns.to_csv('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/id_columns.csv')

In [None]:
# factgrid_loss = factgrid_merge.loc[factgrid_merge['_merge'] == 'left_only']
# factgrid_loss

In [None]:
# factgrid_loss.to_csv('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/factgrid_loss.csv')

In [None]:
# factgrid_merge.to_csv('/content/drive/MyDrive/FactGrid Cuneiform (AWCA)/geography/factgrid_merge.csv')

Next steps (10-26-22):

i. *Check / match with this dataset:* https://docs.google.com/spreadsheets/d/1P9OQSvQ18EkFg2AiI9QZpF5eQbvahy2wOZQqzKvOorI/edit?usp=sharing

ii. *link `cdli_legacy` (wtih `romanscript` `ancientplace`)  to Chronology:*