# Advanced Topics in Data Engineering - Assignment 2

>Stamatis Sideris, f2822113 <br/>
>MSc in Business Analytics <br/>
>Department of Management Science and Technology <br/>
>Athens University of Economics and Business <br/>

## Import Libraries

In [8]:
import pandas as pd

## Import Data, Clean Data and Concatenate Columns to create a new column that includes all the words per row in a list

In [9]:
data = pd.read_csv('ER-Data.csv',sep=';')
#change year type to string
data.year = data.year.astype(str)
#Replace NAs with string value None
data = data.fillna('None')

In [10]:
#convert all letters to lowercase and split them on blanks to create tokens
data.authors = data.authors.str.casefold().str.split()
data.year = data.year.str.casefold().str.split()
data.venue = data.venue.str.casefold().str.split()
data.title = data.title.str.casefold().str.split()

In [11]:
#concatenate the attributes' columns
data["tokens"] = data.authors+data.venue+data.year+data.title 

In [12]:
#as some extra commas exist after the split which we do not want to include in our tokens, 
#replace the extra commas with nothing
for i in range(0, len(data)):
    for j in range(0,len(data.tokens[i])):
        data.tokens[i][j] = data.tokens[i][j].replace(',','')

In [13]:
#delete rest of columns
data1 = data.drop(['authors','venue','title','year'], axis=1)

In [14]:
#display data final format
data1

Unnamed: 0,id,tokens
0,1,"[qd, inc, san, diego, nan, 11578, sorrento, va..."
1,2,"[as, argon, jg, hannoosh, phil., mag, nan, ini..."
2,3,"[gh, hansen, ll, wetterberg, h, sjã¶strã¶m, o,..."
3,4,"[tm, hammett, p, harmon, w, rhodes, see, nan, ..."
4,5,"[jr, cogdell, new, directions, for, teaching, ..."
...,...,...
66874,66875,"[a, shukla, p, deshpande, j, naughton, k, rama..."
66875,66876,"[none, none, 2003.0, call, for, book, reviews]"
66876,66877,"[r, ramakrishnan, d, ram, vldb, 1996.0, modeli..."
66877,66878,"[j, shafer, r, agrawal, m, mehta, vldb, 1996.0..."


# Question A
Use the Token Blocking (not to be confused with Standard Blocking) method to create blocks in the form of K-V (Key-value) pairs. The key for every entry will be each distinct Blocking Key (BK) derived from the entities’ attribute values and the values for each BK will be the entities’ ids. Please note that the id column in the data can be used only as reference for the blocking index and it will NOT be used in the blocking process (block index creation). Please also note that you are advised to transform every string to lower case during the tokens’ creation (before you insert it in the index) to avoid mismatches. At the end of the creation use a function to pretty-print the index.

## Each token will take place as a key in our dictionary. We search for each token in the column tokens per row and append to the dictionary the ids of the rows we find to include each token in. Each key must include at least 2 values to exist.

In [15]:
#initialize the dictionary
kv_pairs = {}

In [16]:
#iterate each row
for i in range(len(data1)):
    #take every item in the tokens list per row
    for key in data1.tokens[i]:
        #search if the token already exists as a key
        if key in kv_pairs:
            #append the id of the row to the appropriate key in the dictionary
            kv_pairs[key].append(data1.id.iloc[i])
        else:
            #else first create the key and an empty list for it and then append the id of the row to the key
            kv_pairs[key] = []
            kv_pairs[key].append(data1.id.iloc[i])

In [17]:
#delete keys equal to nan, none and blanks
kv_pairs.pop('nan')
kv_pairs.pop('none')
kv_pairs.pop('')

[101,
 135,
 642,
 776,
 1555,
 1853,
 2908,
 2915,
 3538,
 3538,
 3538,
 3538,
 3538,
 3538,
 3876,
 4133,
 4159,
 5884,
 6170,
 6954,
 7171,
 7287,
 7429,
 8038,
 8208,
 8298,
 8710,
 8973,
 9010,
 9099,
 10178,
 10443,
 10584,
 11833,
 12044,
 12197,
 12365,
 12814,
 13601,
 13658,
 13673,
 13782,
 13882,
 14013,
 14521,
 14878,
 14963,
 15088,
 15247,
 15311,
 15375,
 15442,
 15462,
 15699,
 15845,
 16010,
 16120,
 16620,
 16746,
 17246,
 17755,
 17885,
 18760,
 19827,
 19989,
 20295,
 20348,
 20445,
 20486,
 20714,
 20930,
 21313,
 22114,
 22325,
 22852,
 23341,
 24191,
 24332,
 24471,
 24961,
 25097,
 25495,
 25528,
 25728,
 26255,
 26335,
 26518,
 26575,
 27381,
 27459,
 28593,
 28863,
 28866,
 29068,
 29297,
 29302,
 29517,
 29574,
 29633,
 29967,
 29977,
 30222,
 30225,
 30252,
 30266,
 30699,
 30810,
 31073,
 31536,
 31861,
 31911,
 31952,
 32134,
 32680,
 32718,
 32769,
 33033,
 33078,
 33719,
 33889,
 34623,
 34964,
 35851,
 36722,
 36919,
 37737,
 38133,
 38375,
 38376,
 3

In [18]:
#print the key value pairs
for key,values in kv_pairs.items():
    print('Token:',key,'\n' 'Entities including it:',values,'\n')

Token: qd 
Entities including it: [1, 55360] 

Token: inc 
Entities including it: [1, 852, 923, 2857, 3057, 3486, 4378, 4854, 5589, 6339, 7038, 8574, 9368, 10500, 11596, 15004, 16358, 17005, 18337, 21912, 22216, 22275, 23987, 24308, 26475, 27244, 29327, 30987, 32596, 36256, 36411, 38590, 39000, 40028, 41393, 42111, 42685, 43073, 43918, 43918, 44647, 44908, 45497, 46763, 47124, 49406, 50987, 51988, 52505, 53149, 56545, 56805, 57323, 58857, 59169, 60213, 61288, 61397, 62978, 65238] 

Token: san 
Entities including it: [1, 8, 11, 65, 215, 219, 352, 382, 496, 574, 607, 698, 732, 743, 780, 826, 852, 923, 1184, 1290, 1375, 1603, 1673, 1805, 1907, 1958, 1980, 2001, 2004, 2076, 2147, 2263, 2340, 2415, 2658, 2816, 2857, 2897, 2922, 2944, 3057, 3104, 3117, 3204, 3342, 3434, 3463, 3468, 3601, 3654, 3655, 3715, 3718, 3759, 3812, 3892, 3918, 3962, 4031, 4200, 4203, 4266, 4378, 4384, 4449, 4581, 4664, 4766, 5087, 5424, 5830, 5937, 5997, 6065, 6109, 6194, 6433, 6529, 6589, 6600, 6650, 6749, 6802, 681

IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)



 marquet 
Entities including it: [1772] 

Token: mary 
Entities including it: [1772, 3511, 4006, 9657, 40375, 40968, 46200, 48989, 55219, 61826] 

Token: indo-pacific 
Entities including it: [1772, 31567] 

Token: fish 
Entities including it: [1772, 1834, 2018, 2698, 3599, 4592, 7645, 10021, 10030, 10311, 10806, 11426, 11469, 14249, 14361, 15730, 17412, 19281, 19281, 20384, 20973, 21333, 21393, 21756, 23614, 23950, 24928, 25755, 26260, 27714, 27818, 31567, 33142, 34693, 36453, 37015, 38950, 40307, 40693, 41875, 41942, 41942, 42806, 42862, 45496, 45553, 46134, 46699, 48909, 49119, 49272, 49402, 50481, 52160, 52584, 53214, 53531, 54091, 54215, 54505, 55216, 55977, 57412, 57574, 58096, 59031, 60442, 60442, 60442, 61007, 61007, 61770, 62940] 

Token: conference(noumea 
Entities including it: [1772] 

Token: caledonian 
Entities including it: [1772, 40301, 58280] 

Token: freshwater 
Entities including it: [1772, 2922, 4154, 6065, 6360, 6826, 7274, 16184, 23140, 26781, 31774, 38298, 41196, 


Token: gionis 
Entities including it: [1955, 6039, 6778, 7051, 10095, 19761, 26044, 42940, 57170, 60174, 64146, 64935, 65127, 65681] 

Token: seshadri 
Entities including it: [1955, 2477, 2793, 3210, 3399, 6081, 7051, 7079, 7359, 10095, 10103, 11554, 13463, 13765, 14201, 14252, 17080, 17419, 17649, 17857, 18253, 19683, 19693, 20303, 20999, 21889, 22146, 25877, 26322, 27107, 27523, 27947, 28780, 29145, 30225, 30446, 30711, 30827, 31035, 33370, 35410, 35530, 35980, 37217, 38396, 39941, 40119, 47874, 48646, 50869, 51076, 53492, 55794, 55905, 59572, 60854, 60918, 62260, 62985, 63220, 63670, 64284, 64320, 64419, 64589, 64877, 64939, 64943, 65052, 65077, 65127, 65202, 65245, 65279, 65315, 65333, 65432, 65615, 65725, 65958, 66046, 66066, 66088, 66137, 66140, 66144, 66192, 66692, 66695, 66817] 

Token: xtract: 
Entities including it: [1955, 7051, 10095, 65127] 

Token: descriptors 
Entities including it: [1955, 6755, 7051, 10095, 19243, 27313, 29526, 65127] 

Token: eliassi-rad 
Entities incl

Token: warmuth 
Entities including it: [2112, 6805, 11033, 12244, 25392, 38172, 59598] 

Token: christel 
Entities including it: [2113, 26235, 26786, 35545, 44003, 45528, 46460, 47754, 48847, 51191] 

Token: winkler 
Entities including it: [2113, 7047, 8668, 8927, 14075, 17700, 24419, 25289, 27214, 28891, 35545, 37215, 41195, 41501, 47778, 48432, 51191, 51872, 56019, 60258, 61806, 62817] 

Token: abstractions 
Entities including it: [2113, 3692, 14374, 14674, 17787, 20044, 24274, 25074, 28337, 28951, 35545, 37550, 41416, 44506, 51191, 51509, 59121, 63876] 

Token: heimbigner 
Entities including it: [2114, 45864] 

Token: process-centered 
Entities including it: [2114, 19141, 24029, 29674, 60149] 

Token: negri 
Entities including it: [2115, 16463, 32083, 32388, 48563, 62122] 

Token: pelagatti 
Entities including it: [2115, 18678, 32083, 32388, 41350, 43112, 56475, 62122, 62588] 

Token: mamoulis 
Entities including it: [2116, 2428, 4478, 6503, 10971, 18034, 18429, 28561, 32925, 43290,

Entities including it: [2289] 

Token: souvenirs 
Entities including it: [2289] 

Token: olston 
Entities including it: [2290, 2309, 2598, 18593, 20957, 21537, 28454, 30278, 30330, 36513, 40244, 46743, 50304, 50593, 54301, 63587, 64459, 64554, 64661, 64897, 65448, 65460, 65648] 

Token: best-effort 
Entities including it: [2290, 13837, 18899, 24168, 31452, 47963, 50245, 59660, 64661] 

Token: lamersdorf 
Entities including it: [2291, 35268, 36410, 58290] 

Token: bartelt 
Entities including it: [2291, 61300] 

Token: fahrenholtz 
Entities including it: [2291] 

Token: tu 
Entities including it: [2291, 4668, 5415, 16893, 19968, 23667, 27263, 30970, 31780, 36410, 37483, 38886, 52322, 60149, 66445] 

Token: key 
Entities including it: [2292, 3059, 4480, 4510, 4563, 4741, 5087, 5160, 5440, 7101, 7156, 7495, 7769, 9350, 10338, 11512, 11768, 12328, 12817, 13036, 13237, 13516, 14380, 14724, 15069, 15877, 16442, 17032, 17064, 17697, 18543, 19877, 20056, 20129, 22089, 23926, 24780, 24850, 25373

Token: ecology: 
Entities including it: [2459, 2893, 3345, 6065, 27162, 35395, 36294, 43653, 62325] 

Token: insect-plant 
Entities including it: [2460] 

Token: oviposition 
Entities including it: [2460, 40606] 

Token: plant-feeding 
Entities including it: [2460] 

Token: insects 
Entities including it: [2460, 5742, 10234, 11815, 13099, 17801, 32380, 36105, 38429, 40704, 47050, 51067, 51067, 51969, 58950, 58953] 

Token: messinger 
Entities including it: [2461] 

Token: rjt 
Entities including it: [2461] 

Token: barrer 
Entities including it: [2462] 

Token: spherical 
Entities including it: [2462, 6213, 16430, 17474, 20073, 27129, 27129, 30052, 30612, 32002, 32600, 34567, 34776, 35007, 35388, 38080, 38175, 46738, 49660, 49772, 53823, 54760, 59483, 63639, 63984, 64080] 

Token: shells 
Entities including it: [2462, 17855, 25784, 26574, 49914, 53823, 55499, 56831, 60678] 

Token: thermal 
Entities including it: [2462, 3613, 3867, 4360, 5309, 5452, 6593, 7158, 8047, 8261, 8332, 8951, 


Token: crawford 
Entities including it: [2625, 3789, 7072, 11899, 16133, 16204, 17979, 18675, 21759, 24510, 28631, 31270, 31598, 40301, 41796, 45851, 50189, 51665, 52340, 56136, 56188, 56233, 57007, 61209] 

Token: touman 
Entities including it: [2625] 

Token: middle? 
Entities including it: [2625] 

Token: aboul-magd 
Entities including it: [2626] 

Token: incorporating 
Entities including it: [2626, 2977, 4135, 5115, 5991, 6350, 11457, 13901, 22044, 23001, 23356, 24093, 24636, 25031, 27091, 28242, 28301, 28324, 28776, 29052, 33416, 35683, 37040, 38715, 39337, 40096, 43954, 45424, 45541, 46822, 47091, 47593, 50352, 52952, 54087, 54492, 55019, 55137, 55422, 61656, 64965, 66468] 

Token: b-isdn 
Entities including it: [2626, 3915, 12811, 48169] 

Token: paulk 
Entities including it: [2627, 51920, 54507] 

Token: konrad 
Entities including it: [2627, 22631, 22675, 44433] 

Token: iso 
Entities including it: [2627, 4342, 18480, 27407, 28160, 30940, 33339, 39666, 40462, 45790, 50883, 519

Token: cody 
Entities including it: [2816, 3785, 11024, 17498, 34807, 41172, 43977, 46368, 47683, 53131, 58121, 61260, 65744] 

Token: smallwood 
Entities including it: [2816] 

Token: vertebrate 
Entities including it: [2816, 19106, 24054, 27569, 30380, 33584, 36537, 40111, 46696, 48783, 50069, 57619] 

Token: râ??multlprocessor 
Entities including it: [2817] 

Token: jom 
Entities including it: [2817, 15283, 34151, 61444] 

Token: algorithmsâ?? 
Entities including it: [2817, 15918] 

Token: koster 
Entities including it: [2818, 4477, 17861, 31465, 58387] 

Token: tholen 
Entities including it: [2818, 46281] 

Token: howie 
Entities including it: [2818, 3352, 5473, 49940, 45084, 46859, 59993, 60642] 

Token: stacking-fault 
Entities including it: [2818, 23263, 59676] 

Token: ni--co-cr 
Entities including it: [2818] 

Token: flatt 
Entities including it: [2819, 14077, 37771] 

Token: ability: 
Entities including it: [2820, 6439, 40444] 

Token: ability 
Entities including it: [2820, 2


Token: multifaceted 
Entities including it: [3008, 8406, 52631] 

Token: tested? 
Entities including it: [3008] 

Token: illustrated 
Entities including it: [3008, 25842, 40559, 41849, 43285, 57786] 

Token: macrae 
Entities including it: [3009, 4876, 28029] 

Token: bodenhausen 
Entities including it: [3009, 4876, 20365, 28029, 32635, 42509, 61557] 

Token: stereotypes 
Entities including it: [3009, 4876, 12017, 13247, 19790, 19790, 24647, 39199, 39731, 42509, 46451, 49025, 56324, 59084, 61557, 62937] 

Token: energy-saving 
Entities including it: [3009, 55112] 

Token: devices: 
Entities including it: [3009, 52702] 

Token: peek 
Entities including it: [3009] 

Token: toolbox 
Entities including it: [3009, 5958, 7594, 14512, 15380, 16291, 21352, 23989, 25015, 25189, 26186, 26587, 27289, 36678, 61765, 62892, 66065] 

Token: mankato 
Entities including it: [3011, 3011, 47775] 

Token: guidelines. 
Entities including it: [3011, 17357, 18095] 

Token: nahar 
Entities including it: [3012


Token: encounters 
Entities including it: [3217, 13740, 19638, 31274, 44624] 

Token: milner 
Entities including it: [3218, 6134, 34009, 35111, 36170, 42822, 47048, 57507, 58710, 60683, 61038] 

Token: minicon: 
Entities including it: [3219, 66762] 

Token: perez 
Entities including it: [3220, 3696, 3757, 6049, 10986, 11649, 12861, 14973, 17491, 17921, 21332, 25336, 27513, 29235, 30498, 34122, 34506, 35104, 36563, 46124, 51054, 56265, 58499] 

Token: gartner: 
Entities including it: [3220] 

Token: self-inflicted 
Entities including it: [3220] 

Token: kinny 
Entities including it: [3221, 38966] 

Token: georgeff 
Entities including it: [3221, 25371] 

Token: multi-agent 
Entities including it: [3221, 3619, 5085, 6419, 7525, 10172, 14477, 14905, 17259, 17460, 18663, 20008, 22886, 23826, 24654, 28130, 28635, 31720, 31999, 32814, 33101, 35473, 36147, 41496, 43219, 48667, 49214, 50194, 51588, 54736, 54910, 55477, 57083, 60002, 62206, 62870, 64050, 64291] 

Token: storb 
Entities includin


Token: produce 
Entities including it: [3411, 12776, 36754, 43755, 44165, 56405] 

Token: ensembles 
Entities including it: [3411, 18586, 32164, 33967, 34214, 39539, 62406, 66427] 

Token: frajzyngier 
Entities including it: [3412] 

Token: dicto 
Entities including it: [3412] 

Token: language. 
Entities including it: [3412, 5325, 11546, 30602, 31649, 32436, 37251, 61004] 

Token: lakshminarayanan 
Entities including it: [3413, 6517, 51982, 53949] 

Token: wehrle 
Entities including it: [3413, 26998] 

Token: i3 
Entities including it: [3413] 

Token: tietze 
Entities including it: [3414, 38502, 51568] 

Token: yyj 
Entities including it: [3414] 

Token: tompkins 
Entities including it: [3415, 20510, 31345, 61881] 

Token: lewellen 
Entities including it: [3415] 

Token: tobin's 
Entities including it: [3415, 63624] 

Token: charity: 
Entities including it: [3568] 

Token: hutson 
Entities including it: [3416, 20322] 

Token: anglin 
Entities including it: [3416, 26062] 

Token: mall

Entities including it: [3616, 3616] 

Token: 1980-81 
Entities including it: [3616] 

Token: counselors 
Entities including it: [3616, 17780, 27160, 28857, 35240, 57270] 

Token: oe 
Entities including it: [3617, 5993, 6906, 9942, 11686, 12636, 22902, 23291, 26766, 30775, 32817, 33107, 43044, 50148, 54563] 

Token: klapp 
Entities including it: [3617] 

Token: ethos; 
Entities including it: [3617] 

Token: collected 
Entities including it: [3617, 11884, 22495, 25963, 31594, 32805, 40577, 43483, 47953, 50537] 

Token: hassenpflug 
Entities including it: [3618] 

Token: vetter 
Entities including it: [3618, 23047, 23348, 36493, 47714, 50831, 59006] 

Token: cã¡rdenas 
Entities including it: [3618, 25088] 

Token: thorn 
Entities including it: [3618, 33244] 

Token: surgeryâ??results 
Entities including it: [3618] 

Token: lomuscio 
Entities including it: [3619] 

Token: sergot 
Entities including it: [3619] 

Token: deontic 
Entities including it: [3619, 36669, 62107] 

Token: veitch 
En


Token: sukezane 
Entities including it: [3825] 

Token: ras-mediated 
Entities including it: [3825] 

Token: rala/phospholipase 
Entities including it: [3825] 

Token: growth-promoting 
Entities including it: [3825] 

Token: grhh 
Entities including it: [3826] 

Token: snoeren 
Entities including it: [3827] 

Token: tracback 
Entities including it: [3827] 

Token: ars 
Entities including it: [3828, 29726, 51296] 

Token: usda 
Entities including it: [3828, 22255, 28399, 47417] 

Token: germplasm 
Entities including it: [3828, 47198] 

Token: network-(grin).[online 
Entities including it: [3828] 

Token: database] 
Entities including it: [3828] 

Token: swedish 
Entities including it: [3829, 14535, 14535, 17674, 18176, 19564, 36332, 39053, 49464, 50482] 

Token: mercury 
Entities including it: [3829, 7980, 15492, 21947, 23853, 25818, 30218, 32584, 36041, 39735, 52410, 52468, 54053] 

Token: sweden 
Entities including it: [3829, 4647, 5084, 6042, 10582, 11464, 12010, 15283, 19863, 30938


Token: magee 
Entities including it: [4026, 12382, 30779, 37045, 46691, 49726] 

Token: shanks 
Entities including it: [4026, 6795] 

Token: federations 
Entities including it: [4026, 8998, 10755, 22646, 36016, 36507, 41510, 43388, 45826, 56892, 60468, 63118, 64329, 65598] 

Token: werahera 
Entities including it: [4027] 

Token: jayasumana 
Entities including it: [4027] 

Token: fddi 
Entities including it: [4027, 15540, 18538] 

Token: tt 
Entities including it: [4028, 4651, 5417, 6301, 6329, 7116, 7569, 8734, 9233, 11064, 14100, 16343, 16763, 21269, 22119, 22428, 23459, 24392, 26837, 28074, 28259, 30806, 33750, 33902, 34050, 34461, 35987, 37672, 37823, 47523, 50074, 52973, 53056, 53749, 61397] 

Token: chimpanzees 
Entities including it: [4028, 46526] 

Token: å? 
Entities including it: [4029, 20059, 20740, 22865, 28648, 39104, 39903] 

Token: gasification 
Entities including it: [4029] 

Token: selden 
Entities including it: [4030] 

Token: furbee 
Entities including it: [4030] 




Token: guzzo 
Entities including it: [4233] 

Token: decision-making 
Entities including it: [4233, 7358, 10276, 14654, 16236, 17689, 25652, 27278, 30160, 35238, 37964, 42990, 49157, 50307, 61189, 64696] 

Token: wilsonâ?¦ 
Entities including it: [4234, 19277, 20254, 46209] 

Token: peeling-ballooning 
Entities including it: [4234] 

Token: modes: 
Entities including it: [4234] 

Token: elms 
Entities including it: [4234] 

Token: pedestal? 
Entities including it: [4234] 

Token: koutsoupias 
Entities including it: [4235, 8283, 15147] 

Token: fixed 
Entities including it: [4235, 10765, 11878, 17388, 17789, 18548, 20642, 21188, 23231, 23919, 23985, 24082, 31611, 32091, 34787, 36091, 38059, 38442, 39355, 39861, 40156, 44036, 44721, 44870, 48124, 51440, 54719, 56129, 56391, 57950, 58540, 61069] 

Token: korpeoglu 
Entities including it: [4236, 35876, 66120] 

Token: rouchon 
Entities including it: [4721] 

Token: space: 
Entities including it: [4237, 4653, 14550, 16143, 18912, 20889, 28

Token: faut-tolerant 
Entities including it: [4440] 

Token: (ftcs 
Entities including it: [4440] 

Token: rauschmayer 
Entities including it: [4441] 

Token: renner 
Entities including it: [4441, 29786, 47347, 48594, 49310, 54362, 55394, 64839] 

Token: tube: 
Entities including it: [4441, 21256] 

Token: model-integrated 
Entities including it: [4441, 58281] 

Token: ferguson 
Entities including it: [4442, 8989, 16764, 17542, 18400, 21016, 26061, 28521, 30765, 39218, 42452, 47550, 53335, 54331, 56369, 60938, 60960] 

Token: khajenoori 
Entities including it: [4442, 46391] 

Token: macke 
Entities including it: [4442] 

Token: vandekerckhove 
Entities including it: [4443, 8264, 9246, 10548, 15511, 24439, 26748, 32102, 33186, 35317, 36314, 51419, 58394] 

Token: lilford 
Entities including it: [4443, 8264, 15511, 19203, 33186, 35317, 36314, 46854] 

Token: subfertility 
Entities including it: [4443, 9853, 10548, 15511, 24439, 26748, 58394] 

Token: szeliski 
Entities including it: [444

Entities including it: [4655] 

Token: corsi 
Entities including it: [4656, 28448] 

Token: dacorogna 
Entities including it: [4656, 13263, 14464, 16925, 24853, 62494] 

Token: zumbach 
Entities including it: [4656] 

Token: thurner 
Entities including it: [4657, 25473] 

Token: heiner 
Entities including it: [4657, 17564] 

Token: time-triggered 
Entities including it: [4657] 

Token: safety-related 
Entities including it: [4657] 

Token: gergen 
Entities including it: [4659, 19420, 26764, 58488] 

Token: ellsworth 
Entities including it: [4659, 50356, 51030, 52981] 

Token: maslach 
Entities including it: [4659, 18243] 

Token: seipel 
Entities including it: [4659, 53036] 

Token: obligation 
Entities including it: [4659, 29732, 35015, 54201] 

Token: donorresourcesandreactions 
Entities including it: [4659] 

Token: peixoto 
Entities including it: [4660] 

Token: oort 
Entities including it: [4660] 

Token: 520 
Entities including it: [4660, 7495, 49929] 

Token: birbal: 
Entities i

Token: outer 
Entities including it: [4868, 6336, 10123, 12174, 17003, 19928, 23345, 29831, 34550, 35918, 58621, 59692, 65875] 

Token: hair 
Entities including it: [4868, 10832, 37646, 49883, 54729] 

Token: admittance 
Entities including it: [4868] 

Token: strategizing 
Entities including it: [4870] 

Token: ginis 
Entities including it: [4871, 65981] 

Token: krupp 
Entities including it: [4871, 17411, 61846, 65981] 

Token: evolvable 
Entities including it: [4871, 7518, 11279, 17719, 36290, 55707, 65228, 65981] 

Token: minker 
Entities including it: [4872, 13158, 25467, 26813, 38364, 42371, 43047, 55755] 

Token: rights: 
Entities including it: [4872, 6147] 

Token: mode-an 
Entities including it: [4873] 

Token: truffet 
Entities including it: [4874] 

Token: polygon 
Entities including it: [4874, 12830, 55299] 

Token: amalgamation 
Entities including it: [4874, 12031] 

Token: vempala 
Entities including it: [4875, 6769, 25927, 33622, 55144] 

Token: 4/3 
Entities including it

Token: talbot 
Entities including it: [5103, 29087, 36249, 58942, 62023] 

Token: poly-adenylated 
Entities including it: [5103] 

Token: lauder 
Entities including it: [5103, 18916, 59791, 63033] 

Token: whips 
Entities including it: [5105, 9146, 11080, 11629, 45783, 65637] 

Token: maintenance. 
Entities including it: [5105, 28913] 

Token: asnapshot 
Entities including it: [5106] 

Token: refresh 
Entities including it: [5106, 6338, 9864, 28293, 29061, 30276, 39226, 45390, 61176, 65169] 

Token: sperber 
Entities including it: [5107, 25517, 62941] 

Token: thiemann 
Entities including it: [5107, 8804, 21401, 24250, 25517, 39447, 62817] 

Token: craft: 
Entities including it: [5107] 

Token: bindingtime 
Entities including it: [5107] 

Token: scahill 
Entities including it: [5108, 10386, 19199] 

Token: lear 
Entities including it: [5108] 

Token: prspeech 
Entities including it: [5108] 

Token: recognition-making 
Entities including it: [5108] 

Token: petrik 
Entities including it


Token: moninger 
Entities including it: [5347, 27524, 55786] 

Token: detman 
Entities including it: [5347, 27524] 

Token: bringi 
Entities including it: [5347, 11687, 12370, 13747, 27524, 42921, 55786] 

Token: melting 
Entities including it: [5347, 7677, 18146, 24216, 27524, 38532, 42921, 45973, 56759] 

Token: maypole((for 
Entities including it: [5347] 

Token: hydrometeors)) 
Entities including it: [5347] 

Token: mallach 
Entities including it: [5348] 

Token: aren 
Entities including it: [5348] 

Token: â??t 
Entities including it: [5348] 

Token: mawhin 
Entities including it: [5349] 

Token: moreno 
Entities including it: [5350, 6097, 10520, 13425, 16353, 29170, 29209, 32149, 34252, 43077, 56988, 63581] 

Token: tressaud 
Entities including it: [5350] 

Token: chaminade 
Entities including it: [5350] 

Token: cryst. 
Entities including it: [5350, 30115, 35328] 

Token: amorph. 
Entities including it: [5350] 

Token: jeffries 
Entities including it: [5351, 33699, 37195, 51144

Entities including it: [5571, 57746] 

Token: mccandless 
Entities including it: [5571, 24133, 34327, 57746, 61202] 

Token: wear 
Entities including it: [5571, 5571, 20581, 28487, 54246, 61202, 61202] 

Token: francken 
Entities including it: [5572, 5827] 

Token: vancorenland 
Entities including it: [5572] 

Token: gielen 
Entities including it: [5572, 8731, 59476, 62843] 

Token: computerâ?? 
Entities including it: [5572, 43868] 

Token: daisy: 
Entities including it: [5572, 36958] 

Token: simulationâ??based 
Entities including it: [5572] 

Token: highâ??level 
Entities including it: [5572] 

Token: krauthgamer 
Entities including it: [5573] 

Token: berkmann 
Entities including it: [5574] 

Token: b. 
Entities including it: [5574, 5956, 6520, 10053, 10118, 11412, 11825, 12153, 15049, 16069, 16121, 16500, 16665, 16824, 17054, 17368, 17437, 17962, 20275, 20670, 21103, 21287, 21623, 22822, 24961, 25853, 26088, 26785, 27605, 28212, 29009, 29705, 30952, 31738, 31803, 31880, 32537, 3387

Entities including it: [5787, 48755] 

Token: website 
Entities including it: [5787, 36394, 37004, 48633, 48755, 49949, 54553] 

Token: (h 
Entities including it: [5787, 30807, 62262] 

Token: ttp://www. 
Entities including it: [5787] 

Token: arl. 
Entities including it: [5787, 48755] 

Token: noaa. 
Entities including it: [5787, 48755] 

Token: gov 
Entities including it: [5787] 

Token: riccio 
Entities including it: [5789, 6961, 16326, 19736, 20809, 22504, 22581, 27578, 35594, 36289, 37552, 43887, 48170, 48405, 49577, 51048, 54370, 59604, 60569] 

Token: employment: 
Entities including it: [5789] 

Token: knapper 
Entities including it: [5790, 14080, 20759, 28756, 30683, 35497, 43208, 53309, 55844] 

Token: aplin 
Entities including it: [5791] 

Token: yã¡ã±ez 
Entities including it: [5792] 

Token: adrio 
Entities including it: [5792] 

Token: saccus 
Entities including it: [5792] 

Token: vasculosus 
Entities including it: [5792] 

Token: (salmo 
Entities including it: [5792, 389


Token: ofcurrentworkowmanagement 
Entities including it: [6007] 

Token: pink. 
Entities including it: [6008] 

Token: smallforwardingtablesforfastrouting 
Entities including it: [6008] 

Token: hammel 
Entities including it: [6009, 66508] 

Token: focardi 
Entities including it: [6010, 63720] 

Token: martinelli 
Entities including it: [6010] 

Token: gone?. 
Entities including it: [6011] 

Token: menelaos 
Entities including it: [6012] 

Token: fotis 
Entities including it: [6012] 

Token: iakovos 
Entities including it: [6012] 

Token: gennaro 
Entities including it: [6012, 19684] 

Token: mistakes. 
Entities including it: [6013] 

Token: galabov 
Entities including it: [6015] 

Token: imdct 
Entities including it: [6015] 

Token: mp3 
Entities including it: [6015, 26237, 48864] 

Token: dct 
Entities including it: [6015, 17558, 32890, 42613, 49299, 51677, 61412] 

Token: instances 
Entities including it: [6016, 9140, 19437, 21353, 24283, 24778, 49395, 59534, 66577] 

Token: simã©o

Entities including it: [6280] 

Token: closing 
Entities including it: [6280, 12304, 19968, 28765, 39950, 48148, 48804, 61082, 63813, 65272, 66187] 

Token: mancini 
Entities including it: [6281, 44732, 45118, 55606] 

Token: webâ» 
Entities including it: [6282, 50018] 

Token: rill 
Entities including it: [6283] 

Token: biochem. 
Entities including it: [6283, 52032] 

Token: (1989) 
Entities including it: [6283, 24328, 28494, 31300, 58695, 63298] 

Token: 3243.(b) 
Entities including it: [6283] 

Token: bruice 
Entities including it: [6283] 

Token: mazumder 
Entities including it: [6283, 17122, 30151] 

Token: navroji 
Entities including it: [6284] 

Token: 1928.0 
Entities including it: [6284, 31589, 49286] 

Token: teachings 
Entities including it: [6284, 56168] 

Token: zarathushtra 
Entities including it: [6284] 

Token: hines 
Entities including it: [6285, 12369, 12924, 20607, 29123, 30249, 44859, 63070] 

Token: de-optimization 
Entities including it: [6285] 

Token: re-optimi


Token: sigeomm 
Entities including it: [6520] 

Token: bhattacharjee. 
Entities including it: [6520] 

Token: kommareddy. 
Entities including it: [6520] 

Token: bassani 
Entities including it: [6521] 

Token: ciulli 
Entities including it: [6521] 

Token: leeds-lyon 
Entities including it: [6521, 10513, 33783] 

Token: symposiumon 
Entities including it: [6521] 

Token: tribology. 
Entities including it: [6521] 

Token: lubricant 
Entities including it: [6521, 35498] 

Token: interferometry 
Entities including it: [6521, 9453, 10984, 28017, 32050, 35990, 42903] 

Token: crocker 
Entities including it: [6522, 6682, 11953, 20157, 20754, 20790, 44525, 45744, 47508, 54965, 58327, 63966] 

Token: seeking 
Entities including it: [6522, 13871, 15111, 17969, 18897, 22004, 23722, 25188, 28408, 28861, 30596, 31733, 35671, 40939, 43767, 50708, 51754, 66672] 

Token: hiemstra 
Entities including it: [6523, 7414, 14922, 22910, 22942, 26402, 33146, 36999, 59436, 60147] 

Token: whence 
Entities in

Token: sager 
Entities including it: [6759, 7063, 44570, 47156, 48938] 

Token: firewalling 
Entities including it: [6759] 

Token: yoshizawa 
Entities including it: [6760, 13511, 26416] 

Token: helmke 
Entities including it: [6760] 

Token: starkov 
Entities including it: [6760] 

Token: ungson 
Entities including it: [6762, 42605] 

Token: complementarity 
Entities including it: [6762, 10354, 22993, 23907, 26915, 28999, 32874, 49944, 50860, 51046, 53697, 59582] 

Token: boos 
Entities including it: [6763, 48354] 

Token: muller-deile 
Entities including it: [6763] 

Token: ohrdorf 
Entities including it: [6763] 

Token: flutter 
Entities including it: [6763, 8213, 11391, 17402] 

Token: neonate-severe 
Entities including it: [6763] 

Token: electrolyte 
Entities including it: [6763, 16311, 47913] 

Token: imbalance 
Entities including it: [6763, 8086, 63298] 

Token: tract 
Entities including it: [6763, 12299, 15786, 20910, 25794, 26009, 28131, 30166, 30830, 34644, 38816, 40401, 521

Entities including it: [7018, 23459, 44133, 47746, 56943, 66398] 

Token: cube 
Entities including it: [7018, 8751, 13821, 14588, 17989, 19070, 20200, 32890, 34914, 35552, 37623, 41332, 44114, 47490, 47746, 49674, 49773, 50396, 51698, 53887, 54225, 61157, 64710, 65360, 65471, 65507, 66319, 66398, 66873] 

Token: qc-tree: 
Entities including it: [7018] 

Token: summarizations 
Entities including it: [7018] 

Token: linux 
Entities including it: [7019, 28414, 30250, 48188, 55877, 59077] 

Token: deepening 
Entities including it: [7019, 49793] 

Token: foothold 
Entities including it: [7019] 

Token: schoonbaert 
Entities including it: [7020] 

Token: jozsa 
Entities including it: [7021] 

Token: makai 
Entities including it: [7021] 

Token: reroute 
Entities including it: [7021] 

Token: mpls 
Entities including it: [7021, 25332, 26538, 34879, 38688] 

Token: armon 
Entities including it: [7022] 

Token: thrombolysis 
Entities including it: [7022, 19083, 28218, 45435, 62300] 

Token: hon

Token: enteral 
Entities including it: [7284, 15801, 41346, 47391, 52649, 53369] 

Token: diuretics 
Entities including it: [7284, 32510, 54162] 

Token: (or 
Entities including it: [7284, 16213, 17003, 21403, 29875, 32510, 48965, 54162, 58916] 

Token: developing) 
Entities including it: [7284, 32510, 54162] 

Token: cremer 
Entities including it: [7285] 

Token: gore 
Entities including it: [7285, 39467, 45584, 56000] 

Token: kirkup 
Entities including it: [7285] 

Token: minn 
Entities including it: [7285] 

Token: melles 
Entities including it: [7285] 

Token: quaternary 
Entities including it: [7285, 15427, 32101, 36009, 49900, 49972, 50920, 59963, 61706] 

Token: windmill 
Entities including it: [7285] 

Token: antarctica-initial 
Entities including it: [7285] 

Token: klausner 
Entities including it: [7286, 19863, 34625] 

Token: multirelations 
Entities including it: [7286] 

Token: semantice 
Entities including it: [7286] 

Token: zolan 
Entities including it: [7288] 

Token:


Token: gajski 
Entities including it: [7520, 20676, 28082, 32389, 48217] 

Token: kuck 
Entities including it: [7520, 20676, 29796, 32389, 56854, 61404] 

Token: lawrie 
Entities including it: [7520, 13073, 54390, 55261] 

Token: sameh 
Entities including it: [7520] 

Token: sigarch 
Entities including it: [7520] 

Token: cedar: 
Entities including it: [7520] 

Token: enc 
Entities including it: [7522, 12232, 27323] 

Token: andrade 
Entities including it: [7522, 12232, 14533, 27323, 30969, 33177, 39172, 50928, 60475] 

Token: liquids-part 
Entities including it: [7522] 

Token: saraglar 
Entities including it: [7523] 

Token: khudanpur 
Entities including it: [7523] 

Token: pronunciation 
Entities including it: [7523, 7523, 16556, 23101, 32583, 53256] 

Token: ravichandran 
Entities including it: [7524, 41932] 

Token: lertwongsatien 
Entities including it: [7524] 

Token: resource-based 
Entities including it: [7524, 27655, 31636, 36481, 40210, 44030, 51712, 55140, 58109, 58235, 58

Entities including it: [7774] 

Token: sambin 
Entities including it: [7775, 40733] 

Token: valentini 
Entities including it: [7775] 

Token: tool-box 
Entities including it: [7775] 

Token: martin-lof 
Entities including it: [7775, 50292] 

Token: intuitionistic 
Entities including it: [7775, 21502] 

Token: â??active 
Entities including it: [7776] 

Token: programmierung 
Entities including it: [7776, 28659, 28941, 33559, 48921, 51832, 56485, 60431] 

Token: comprehending 
Entities including it: [7777, 9457] 

Token: decompositions: 
Entities including it: [7777] 

Token: opearators 
Entities including it: [7778] 

Token: efrat 
Entities including it: [7779, 9652, 40280, 54589] 

Token: guibas 
Entities including it: [7779, 59147, 62959] 

Token: hall-holt 
Entities including it: [7779] 

Token: polyhedral 
Entities including it: [7779, 11758, 14119, 16603, 34180, 34232, 34641, 55077, 60773, 61761] 

Token: polyzotis 
Entities including it: [7780, 37955, 42689, 43865, 46469, 64804, 

Token: lebling 
Entities including it: [8040] 

Token: zork: 
Entities including it: [8040] 

Token: fantasy 
Entities including it: [8040, 16265, 18424] 

Token: warch 
Entities including it: [8041] 

Token: illnesses 
Entities including it: [8041, 34126, 40360, 43449] 

Token: pitassi 
Entities including it: [8042, 14350, 28668] 

Token: impagliazzo 
Entities including it: [8042, 14350, 23682, 28668, 30748, 35158, 50771, 53202] 

Token: conformance 
Entities including it: [8043, 28394, 43935, 47518] 

Token: agreements 
Entities including it: [8043, 11895, 19778, 29654, 33163, 33872, 38825, 54980, 58677, 60230] 

Token: navas 
Entities including it: [8044, 10953, 64971] 

Token: gps-based 
Entities including it: [8044, 18606, 36875] 

Token: curtis-prior 
Entities including it: [8045] 

Token: obesity 
Entities including it: [8045, 29166, 29563, 30397, 31429, 38251, 43655, 46815, 51061] 

Token: ormoneit 
Entities including it: [8046, 42725] 

Token: glynn 
Entities including it: [80

Entities including it: [8298, 13379, 14678, 36004, 40839, 55671, 57265, 57486, 60261] 

Token: kataoka 
Entities including it: [8299, 9961, 22496, 27103, 65295] 

Token: ohmura 
Entities including it: [8299, 17393, 59065] 

Token: hamano 
Entities including it: [8299] 

Token: augmentation 
Entities including it: [8299, 21000, 26066, 39036, 42464, 52744] 

Token: wake 
Entities including it: [8299, 10986, 14171, 16276, 26084, 40044, 43853, 52070, 60498] 

Token: propertius 
Entities including it: [8300] 

Token: 4.9 
Entities including it: [8300] 

Token: toils 
Entities including it: [8300] 

Token: historicism. 
Entities including it: [8300] 

Token: downey 
Entities including it: [8301, 10258, 12883, 13209, 27300, 28212, 28288, 35807, 36129, 36949, 44432, 51977, 52539, 58173, 60083] 

Token: skeptic 
Entities including it: [8301] 

Token: doctor-patient 
Entities including it: [8302, 53005] 

Token: abuknesha 
Entities including it: [8303, 37671] 

Token: al-mazeedi 
Entities includ

Entities including it: [8554] 

Token: oakman 
Entities including it: [8554] 

Token: nonhypnotic 
Entities including it: [8554] 

Token: suggestibility 
Entities including it: [8554] 

Token: hypnotic 
Entities including it: [8554, 30675] 

Token: databases:[research 
Entities including it: [8555] 

Token: report] 
Entities including it: [8555] 

Token: cubbage 
Entities including it: [8556] 

Token: polycyclic 
Entities including it: [8556, 25829, 59696] 

Token: hydrocarbons 
Entities including it: [8556, 19662, 24309, 26983, 30468, 33397, 43535, 57309, 58967] 

Token: wyckoff 
Entities including it: [8556, 26220, 63816] 

Token: tocqueville 
Entities including it: [8557] 

Token: weis69 
Entities including it: [8558] 

Token: giuliano 
Entities including it: [8559] 

Token: sending 
Entities including it: [8559, 33070, 56381] 

Token: artifact 
Entities including it: [8559, 18378] 

Token: artifact: 
Entities including it: [8559] 

Token: stiglitz 
Entities including it: [8560] 

T

Token: sharman 
Entities including it: [8833, 16043, 18615, 39803, 41507, 43089] 

Token: winterbottom 
Entities including it: [8833, 19775, 24332, 30755, 47553, 66251] 

Token: ndb. 
Entities including it: [8833] 

Token: pã¶tke 
Entities including it: [8834] 

Token: 26thint. 
Entities including it: [8834] 

Token: inobject-relational 
Entities including it: [8834] 

Token: behaviour: 
Entities including it: [8835, 28270, 47624, 62702, 63464] 

Token: kardaras 
Entities including it: [8836] 

Token: simulate 
Entities including it: [8836, 9552, 9656, 28635, 30583, 51854] 

Token: curler: 
Entities including it: [8838] 

Token: macro-actions 
Entities including it: [8839] 

Token: arrays- 
Entities including it: [8841] 

Token: ukpokoduâ?¦ 
Entities including it: [8843] 

Token: pegagogy: 
Entities including it: [8843] 

Token: empowerment. 
Entities including it: [8843, 21503] 

Token: pearl: 
Entities including it: [8844] 

Token: rohall 
Entities including it: [8845, 41558] 

Token

Token: langholz 
Entities including it: [9069, 58847] 

Token: demestichas 
Entities including it: [9070] 

Token: papadopoulou 
Entities including it: [9070, 19155] 

Token: stavroulaki 
Entities including it: [9070, 24993] 

Token: 3g: 
Entities including it: [9070, 14111] 

Token: (january 
Entities including it: [9071, 21940] 

Token: imieliå?ski 
Entities including it: [9072, 27570, 46704, 60784] 

Token: soundalgekar 
Entities including it: [9073, 66455, 66547] 

Token: curbera 
Entities including it: [9074] 

Token: goland 
Entities including it: [9074, 9925, 13770] 

Token: (bpel4ws 
Entities including it: [9074] 

Token: 1.0) 
Entities including it: [9074] 

Token: ankney 
Entities including it: [9075] 

Token: alisauskas 
Entities including it: [9075] 

Token: waterfowl 
Entities including it: [9075, 30150] 

Token: iwasa 
Entities including it: [9076] 

Token: mitogen-activated 
Entities including it: [9076] 

Token: p38 
Entities including it: [9076, 46913] 

Token: defines

Token: kruglyakov 
Entities including it: [9355, 63961] 

Token: tyurin 
Entities including it: [9355] 

Token: conf.) 
Entities including it: [9355, 22340] 

Token: penza: 
Entities including it: [9355] 

Token: penza 
Entities including it: [9355, 46254] 

Token: materialy 
Entities including it: [9355] 

Token: nauchno-tekhnicheskoi 
Entities including it: [9355] 

Token: konf 
Entities including it: [9355] 

Token: sanwal 
Entities including it: [9356] 

Token: continuous-time 
Entities including it: [9356, 20008, 24826, 27619, 35705, 37560, 42606, 44559, 46929, 54018, 57411, 61231] 

Token: sigmodâ??sigact 
Entities including it: [9357, 51559] 

Token: krishnamurthyâ?? 
Entities including it: [9357] 

Token: existential 
Entities including it: [9357, 23660, 29219, 31533, 39523, 46236, 53283, 61935] 

Token: queriesâ?? 
Entities including it: [9357, 39598] 

Token: dresser 
Entities including it: [9358] 

Token: giannelli 
Entities including it: [9358] 

Token: szabo 
Entities incl

Entities including it: [9609, 29097] 

Token: pedagogy: 
Entities including it: [9609, 41038, 50436] 

Token: siblings 
Entities including it: [9609, 60584] 

Token: rarely 
Entities including it: [9609] 

Token: gomez-mejia 
Entities including it: [9610, 18390, 41793, 47405] 

Token: balkin 
Entities including it: [9610, 41793] 

Token: pay: 
Entities including it: [9610, 11643, 15223, 17041, 30926, 40150, 62734] 

Token: blose 
Entities including it: [9611] 

Token: graduation 
Entities including it: [9611, 9611, 38107, 45881] 

Token: rates: 
Entities including it: [9611] 

Token: wildfire 
Entities including it: [9614, 12947] 

Token: homeowners 
Entities including it: [9614] 

Token: bfls 
Entities including it: [9615] 

Token: fortnow 
Entities including it: [9615] 

Token: scandariato 
Entities including it: [9616] 

Token: worms 
Entities including it: [9616, 31443, 43879] 

Token: katsevich 
Entities including it: [9617] 

Token: zamyatin 
Entities including it: [9617] 

Token

Entities including it: [9875] 

Token: weatherley 
Entities including it: [9878] 

Token: jcdl 
Entities including it: [9878, 17611] 

Token: khoo 
Entities including it: [9878, 17509, 57281] 

Token: m.: 
Entities including it: [9878, 53596, 63465] 

Token: reviewing 
Entities including it: [9878, 11160, 22954] 

Token: kadis 
Entities including it: [9880] 

Token: strictly 
Entities including it: [9880, 19298, 61136, 62416] 

Token: protected 
Entities including it: [9880, 23412, 26483, 33277, 34781, 39480, 43709, 43709, 44124] 

Token: cyprus 
Entities including it: [9880, 11074, 21716, 21716, 28251, 59128] 

Token: eady 
Entities including it: [9881, 58368] 

Token: minocycline 
Entities including it: [9881] 

Token: vulgaris: 
Entities including it: [9881] 

Token: attributes. 
Entities including it: [9882, 17000] 

Token: oudjit 
Entities including it: [9883] 

Token: k-median 
Entities including it: [9883, 45323, 56476] 

Token: md; 
Entities including it: [9884, 9884, 9884, 988

Token: kerschberg 
Entities including it: [10160, 18585, 25442, 29236, 38697, 38846, 56943, 64792] 

Token: query-initiated 
Entities including it: [10160] 

Token: sunley 
Entities including it: [10161, 14461] 

Token: krugman's 
Entities including it: [10161] 

Token: claims: 
Entities including it: [10162] 

Token: geography: 
Entities including it: [10163, 10304, 42756, 43440, 51348] 

Token: reason 
Entities including it: [10163, 21173, 64266] 

Token: caution? 
Entities including it: [10163] 

Token: l&amp;man 
Entities including it: [10164] 

Token: dlfferenhal 
Entities including it: [10164] 

Token: apphcatlon 
Entities including it: [10164] 

Token: mamtenance 
Entities including it: [10164] 

Token: bonewald 
Entities including it: [10165] 

Token: bilezikian 
Entities including it: [10165, 53661] 

Token: raisz 
Entities including it: [10165, 53661] 

Token: rodan 
Entities including it: [10165] 

Token: first: 
Entities including it: [10166, 27639, 34382, 37368, 56584, 581


Token: schegloff 
Entities including it: [10417, 37839] 

Token: schoene 
Entities including it: [10418, 43012] 

Token: prolotherapy 
Entities including it: [10418] 

Token: injections 
Entities including it: [10418, 59284] 

Token: bhargavan 
Entities including it: [10421, 17292, 36517, 39129, 46911] 

Token: holtzblatt 
Entities including it: [10422] 

Token: articulating 
Entities including it: [10422, 56982] 

Token: transparency: 
Entities including it: [10422] 

Token: kruglinskiâ?¦ 
Entities including it: [10423] 

Token: â??programming 
Entities including it: [10423] 

Token: conner 
Entities including it: [10424, 12442, 12625, 16281, 40033, 46944, 48154, 49117, 55378] 

Token: gerlaâ??impact 
Entities including it: [10425] 

Token: vance 
Entities including it: [10426, 12715, 12910, 21321, 33984, 35583, 35666, 36366, 54309, 56117, 62363, 65346, 65684, 65689] 

Token: worker: 
Entities including it: [10426] 

Token: dyck 
Entities including it: [10427, 15991, 16877, 40464, 45

Entities including it: [10714] 

Token: inter-operation 
Entities including it: [10714, 24661] 

Token: creatively. 
Entities including it: [10715, 35502] 

Token: gradshteyn 
Entities including it: [10717, 35450, 57081, 60011] 

Token: ryzhik 
Entities including it: [10717, 13954, 35450, 57081, 60011] 

Token: buhaug 
Entities including it: [10718] 

Token: lujala 
Entities including it: [10718] 

Token: scale: 
Entities including it: [10718, 17342, 26454, 34458, 35052, 43661, 47397, 51768, 57077, 60206] 

Token: annamalai 
Entities including it: [10719, 15921, 25656, 65120, 65154] 

Token: acceptance 
Entities including it: [10720, 11643, 15288, 17340, 27140, 32556, 33356, 34442, 37256, 43473, 44198, 44449, 54015, 55000, 55767, 60818, 61826] 

Token: macwilliams 
Entities including it: [10721] 

Token: sloane 
Entities including it: [10721, 48009] 

Token: pseudo-random 
Entities including it: [10721, 14800, 23896, 48848, 60690] 

Token: ariola 
Entities including it: [10722, 18529] 

Entities including it: [10973] 

Token: anethole 
Entities including it: [10973] 

Token: dithiolethione 
Entities including it: [10973] 

Token: n-acetylcysteine 
Entities including it: [10973] 

Token: stimulates 
Entities including it: [10973] 

Token: daskalakou 
Entities including it: [10974] 

Token: postfire 
Entities including it: [10974] 

Token: aleppo 
Entities including it: [10974] 

Token: (pinus 
Entities including it: [10974] 

Token: halepensis) 
Entities including it: [10974] 

Token: palaniswami 
Entities including it: [10975, 25919, 33486, 33502, 48743, 66847] 

Token: raigorodski 
Entities including it: [10977] 

Token: stavrinos 
Entities including it: [10977] 

Token: p57^ 
Entities including it: [10978] 

Token: k^ 
Entities including it: [10978] 

Token: i^ 
Entities including it: [10978] 

Token: p^ 
Entities including it: [10978] 

Token: islet 
Entities including it: [10978, 20645, 33936, 37349, 49192] 

Token: hyperinsulinism 
Entities including it: [10978] 

Token: schallehn 
Entities including it: [11249, 12736, 13398, 66316] 

Token: quiet: 
Entities including it: [11249, 66316] 

Token: query-driven 
Entities including it: [11249, 13398, 66316] 

Token: kirksey 
Entities including it: [11252, 34300] 

Token: holt-ashley 
Entities including it: [11252] 

Token: autoerotic 
Entities including it: [11252] 

Token: asphyxia 
Entities including it: [11252, 16952] 

Token: politicized 
Entities including it: [11255] 

Token: (vldbâ??98)( 
Entities including it: [11256] 

Token: afrigraph 
Entities including it: [11257] 

Token: impressions 
Entities including it: [11257, 30394, 49546, 53323] 

Token: crocca 
Entities including it: [11258] 

Token: codevelopment 
Entities including it: [11258] 

Token: kenaga 
Entities including it: [11260, 51232] 

Token: 75 
Entities including it: [11260] 

Token: pesticides 
Entities including it: [11260, 36754, 38234, 40628] 

Token: shay 
Entities including it: [11261, 57731] 

Token: brasiskyte 
Entities

Entities including it: [11564] 

Token: soto 
Entities including it: [11565, 29209, 30788] 

Token: medina 
Entities including it: [11565, 15940, 16990, 17270, 33217, 48033, 58841] 

Token: dember 
Entities including it: [11565] 

Token: permethrin-impregnated 
Entities including it: [11565] 

Token: uniforms 
Entities including it: [11565] 

Token: leishmaniasis 
Entities including it: [11565] 

Token: case: 
Entities including it: [11566] 

Token: taligent: 
Entities including it: [11567] 

Token: sfikant 
Entities including it: [11571] 

Token: ivancic 
Entities including it: [11572, 28841, 52003] 

Token: pragocrypt 
Entities including it: [11573] 

Token: eternity 
Entities including it: [11573] 

Token: ivlev 
Entities including it: [11574] 

Token: kopnin 
Entities including it: [11574, 12091, 13631, 14584, 54251] 

Token: lattice: 
Entities including it: [11574] 

Token: high-t 
Entities including it: [11574, 12171, 37987] 

Token: superconductors(abstract 
Entities including i

Entities including it: [11832, 31439] 

Token: ends 
Entities including it: [11832, 21109, 38019, 40438, 41039] 

Token: jaime 
Entities including it: [11833, 19851, 65178, 66696] 

Token: empirically-grounded 
Entities including it: [11834] 

Token: gianola 
Entities including it: [11835, 14202] 

Token: covariances 
Entities including it: [11835, 14202, 35368, 36506, 48269] 

Token: pu. 
Entities including it: [11836, 14658, 18436, 30694, 40669, 58230, 64197] 

Token: pdcat 
Entities including it: [11837] 

Token: mmun 
Entities including it: [11837] 

Token: ations 
Entities including it: [11837] 

Token: cutter 
Entities including it: [11838, 13863, 20116, 24741, 33632] 

Token: hodgson 
Entities including it: [11838, 17615, 26685, 28405, 35093, 37177, 43132, 45184, 46107, 50085, 57736] 

Token: subsidized 
Entities including it: [11838] 

Token: inequities: 
Entities including it: [11838] 

Token: patterning 
Entities including it: [11838, 14368, 18882, 35964, 46822, 57032, 63875]


Token: junkan 
Entities including it: [12168, 14462, 52897] 

Token: life-threatening 
Entities including it: [12168, 16364, 50991] 

Token: probably 
Entities including it: [12168, 44516] 

Token: psychotropic 
Entities including it: [12168, 44444] 

Token: hannaway 
Entities including it: [12169] 

Token: cardwell 
Entities including it: [12170, 12458, 15923] 

Token: environgenics 
Entities including it: [12170] 

Token: mizsei 
Entities including it: [12171] 

Token: uusimaki 
Entities including it: [12171, 33948] 

Token: superconductors 
Entities including it: [12171, 12805, 46766, 59513, 60248] 

Token: c)(abstract 
Entities including it: [12171] 

Token: nikias 
Entities including it: [12173] 

Token: raghuveer 
Entities including it: [12173, 40038] 

Token: bispectrum 
Entities including it: [12173] 

Token: estimation- 
Entities including it: [12173] 

Token: lukasiak 
Entities including it: [12174, 50741] 

Token: andtvonrosenvinge. 
Entities including it: [12174] 

Token: 

Token: bosshard 
Entities including it: [12476] 

Token: rept. 
Entities including it: [12476, 52235] 

Token: xxiiird. 
Entities including it: [12476] 

Token: czechoslovakia 
Entities including it: [12476, 50861, 60976] 

Token: crustal 
Entities including it: [12476, 13130, 16323, 25331, 45569, 55914] 

Token: canary 
Entities including it: [12476, 37049, 45570, 62628] 

Token: ajk 
Entities including it: [12477] 

Token: lampeter 
Entities including it: [12477] 

Token: raised 
Entities including it: [12477, 35031, 47794, 50547] 

Token: nod 
Entities including it: [12477] 

Token: haase 
Entities including it: [12478, 20471, 39685, 41929, 51194] 

Token: sectorial 
Entities including it: [12478] 

Token: liskovâ?¦ 
Entities including it: [12479] 

Token: methodol. 
Entities including it: [12479] 

Token: 39 
Entities including it: [12479, 41431, 53923, 58015, 61173] 

Token: richters 
Entities including it: [12481, 59994] 

Token: heard: 
Entities including it: [12481, 25953, 5459


Token: paulik 
Entities including it: [12793] 

Token: gales 
Entities including it: [12793, 49756] 

Token: allometric 
Entities including it: [12793] 

Token: beverton 
Entities including it: [12793] 

Token: kaltofen 
Entities including it: [12795, 26968] 

Token: foxbox: 
Entities including it: [12795] 

Token: representation. 
Entities including it: [12795] 

Token: o. 
Entities including it: [12795, 16027, 16248, 28138, 30497, 31110, 32668, 32737, 40840, 42418, 45852, 46577, 47483, 56498, 62121, 63301, 63635] 

Token: gloor 
Entities including it: [12795, 14646, 28477, 40745] 

Token: lnside 
Entities including it: [12796] 

Token: page-answers 
Entities including it: [12798] 

Token: cortico-hippocampal 
Entities including it: [12799] 

Token: interplay 
Entities including it: [12799, 15626, 33188, 35948, 46330, 56955] 

Token: shawler 
Entities including it: [12800] 

Token: department's 
Entities including it: [12800, 53429] 

Token: suicidal 
Entities including it: [12800, 2

Token: graphs. 
Entities including it: [13124, 22088, 29001, 52129] 

Token: pub. 
Entities including it: [13124, 18711, 20522, 22088, 31061, 35960, 41284, 45899, 49901, 63874] 

Token: alm 
Entities including it: [13125, 18349, 27646, 46040] 

Token: (sac) 
Entities including it: [13125] 

Token: vahid 
Entities including it: [13126, 40364, 61282] 

Token: softening 
Entities including it: [13126] 

Token: culp 
Entities including it: [13127, 29826] 

Token: anarchy. 
Entities including it: [13127] 

Token: nitta 
Entities including it: [13128, 55838, 60108] 

Token: haradone 
Entities including it: [13128] 

Token: haradomeâ??temp 
Entities including it: [13128] 

Token: atur 
Entities including it: [13128] 

Token: depend 
Entities including it: [13128] 

Token: ece 
Entities including it: [13128, 19983, 48232, 49238, 55149] 

Token: resistivities 
Entities including it: [13128] 

Token: sn0-based 
Entities including it: [13128] 

Token: bailenson 
Entities including it: [13129] 

T

Token: valleau 
Entities including it: [13455] 

Token: ceperley 
Entities including it: [13455] 

Token: nrcc 
Entities including it: [13455] 

Token: coulombic 
Entities including it: [13455] 

Token: bhide 
Entities including it: [13456, 19223, 24049, 58107] 

Token: paddo 
Entities including it: [13457] 

Token: flavin 
Entities including it: [13458, 24503] 

Token: otn-based 
Entities including it: [13458] 

Token: minimalist 
Entities including it: [13459, 33702, 61840] 

Token: anaphora 
Entities including it: [13459, 30419, 35663, 48901] 

Token: holsapple 
Entities including it: [13460] 

Token: hendry 
Entities including it: [13461, 24589, 31846] 

Token: ymm 
Entities including it: [13461] 

Token: antar 
Entities including it: [13461] 

Token: cross-polarized 
Entities including it: [13461] 

Token: wordsworth-itp 
Entities including it: [13464] 

Token: janowski 
Entities including it: [13465] 

Token: kaven 
Entities including it: [13465, 51477] 

Token: postlethwaite 
En


Token: hpk 
Entities including it: [13800, 29177, 47353] 

Token: locak 
Entities including it: [13800] 

Token: behling 
Entities including it: [13801, 45937] 

Token: anastasi 
Entities including it: [13802] 

Token: conti 
Entities including it: [13802, 21214, 28146, 28365, 38138, 47987, 60611] 

Token: gregori 
Entities including it: [13802, 21214, 41899] 

Token: passarella 
Entities including it: [13802] 

Token: power-saving 
Entities including it: [13802] 

Token: polices 
Entities including it: [13802] 

Token: wi-fi 
Entities including it: [13802, 14062, 55567, 59217, 61958] 

Token: hotspots 
Entities including it: [13802, 60745] 

Token: tool-kit 
Entities including it: [13803, 16670, 61722, 63922] 

Token: anlauff 
Entities including it: [13806] 

Token: xasmâ??an 
Entities including it: [13806] 

Token: garcia-molina. 
Entities including it: [13807, 16402, 16737, 19788, 49027, 53265, 64053] 

Token: objectfusioninmediatorsystems 
Entities including it: [13807] 

Token: e

Entities including it: [14099, 36690] 

Token: georgiou 
Entities including it: [14100, 33798] 

Token: funakoshi 
Entities including it: [14101] 

Token: sikder 
Entities including it: [14101] 

Token: ebihara 
Entities including it: [14101] 

Token: xenopus 
Entities including it: [14101, 29512, 41243, 41674, 49508] 

Token: a1 
Entities including it: [14101, 32302] 

Token: cdc28 
Entities including it: [14101] 

Token: cell-cycle 
Entities including it: [14101] 

Token: kerker 
Entities including it: [14102, 42417] 

Token: kalaitzidis 
Entities including it: [14103] 

Token: papazisimou 
Entities including it: [14103] 

Token: christanis 
Entities including it: [14103] 

Token: thint. 
Entities including it: [14103, 14892, 15362, 24654, 28750, 42430, 46381, 53099] 

Token: xxxiv 
Entities including it: [14103] 

Token: graikas 
Entities including it: [14103] 

Token: lignite 
Entities including it: [14103] 

Token: peloponnese 
Entities including it: [14103] 

Token: hyperstormâ??

Token: multivibrator 
Entities including it: [14417] 

Token: 4(cd-rom 
Entities including it: [14418] 

Token: c2001 
Entities including it: [14418, 25027, 44569] 

Token: negated 
Entities including it: [14419] 

Token: novembre 
Entities including it: [14420] 

Token: eiffel: 
Entities including it: [14420, 59596] 

Token: feigned 
Entities including it: [14422] 

Token: unfeigned 
Entities including it: [14422] 

Token: lying 
Entities including it: [14422, 33779, 36589, 42097, 45989] 

Token: demaindreville 
Entities including it: [14423, 61902] 

Token: hajek 
Entities including it: [14424, 16021, 26657, 48678, 59885, 62271] 

Token: andt. 
Entities including it: [14427, 19267, 42827, 56620, 58375, 63871] 

Token: milo. 
Entities including it: [14427] 

Token: clarberg 
Entities including it: [14428] 

Token: jarosz 
Entities including it: [14428, 20669, 22360, 30004] 

Token: akenine-mã¶ller 
Entities including it: [14428] 

Token: sampling: 
Entities including it: [14428, 17021

Entities including it: [14751] 

Token: zielinski 
Entities including it: [14751, 21422] 

Token: ashmore 
Entities including it: [14751, 49804] 

Token: krewski 
Entities including it: [14751] 

Token: biologically 
Entities including it: [14751, 17512, 24333, 43306, 53067, 59170] 

Token: cohort 
Entities including it: [14751, 22468, 32592, 40865, 56322, 57692] 

Token: teufel 
Entities including it: [14752, 51695, 51695] 

Token: permeability 
Entities including it: [14752, 24866, 27406, 30455, 58945, 59353, 60590] 

Token: msr 
Entities including it: [14753, 18589] 

Token: tr-98-35 
Entities including it: [14753] 

Token: morishima 
Entities including it: [14754, 15589, 32849, 35685, 42545, 60338, 60340, 64881, 65197, 65495] 

Token: correlate 
Entities including it: [14754, 15463] 

Token: vimentin 
Entities including it: [14754, 39871, 44413] 

Token: craggs 
Entities including it: [14755, 23126] 

Token: isotopies 
Entities including it: [14755] 

Token: 3-manifold 
Entities in

Token: sears 
Entities including it: [15070, 28134, 35203, 37960, 44968, 47254, 49765, 51495, 57916, 63174] 

Token: opac 
Entities including it: [15071, 60855] 

Token: postula 
Entities including it: [15072] 

Token: 25&#39;h 
Entities including it: [15072] 

Token: lozwiak&quot; 
Entities including it: [15072] 

Token: xor 
Entities including it: [15072, 53202] 

Token: sig 
Entities including it: [15074, 26226, 26467, 34158, 51849] 

Token: institutionalization 
Entities including it: [15076, 27268, 44533] 

Token: imcss 
Entities including it: [15078] 

Token: unbalanced-magnetic-pull 
Entities including it: [15078] 

Token: machines-its 
Entities including it: [15078] 

Token: leelaprute 
Entities including it: [15079] 

Token: dreyer 
Entities including it: [15080, 22506, 31139, 37434, 42750, 50104, 65177, 65944] 

Token: well-founded 
Entities including it: [15080, 34024] 

Token: cornfeld 
Entities including it: [15130] 

Token: fomin 
Entities including it: [15130, 38320] 

T

Entities including it: [15386, 17528, 17953, 36131, 44419] 

Token: piecewise-affine 
Entities including it: [15386] 

Token: scot 
Entities including it: [15387] 

Token: shepherd. 
Entities including it: [15387] 

Token: maria 
Entities including it: [15388, 15800, 17966] 

Token: favrat 
Entities including it: [15390, 43354] 

Token: tomiyama 
Entities including it: [15390, 18426] 

Token: ishitani 
Entities including it: [15390] 

Token: gnedenko 
Entities including it: [15391, 30618] 

Token: korolev 
Entities including it: [15391, 29510] 

Token: summation: 
Entities including it: [15391] 

Token: lecullier 
Entities including it: [15393] 

Token: chanin 
Entities including it: [15393] 

Token: fabry-perot 
Entities including it: [15393] 

Token: 50-1000 
Entities including it: [15393] 

Token: communal 
Entities including it: [15394, 49003] 

Token: regis 
Entities including it: [15396] 

Token: 1996; 
Entities including it: [15396, 26794, 34677, 61789] 

Token: 325pp 
Entities 

Token: tactics: 
Entities including it: [15704] 

Token: florek 
Entities including it: [15705] 

Token: permittivity 
Entities including it: [15705, 45879, 62832] 

Token: drying 
Entities including it: [15705, 56511, 64127] 

Token: turkiewicz 
Entities including it: [15707, 21743] 

Token: schairer 
Entities including it: [15707, 21743] 

Token: all-optical 
Entities including it: [15707, 21743, 28334, 33064, 34456, 36632, 38160, 39098, 62097, 62218] 

Token: otdm 
Entities including it: [15707, 21743] 

Token: add-drop 
Entities including it: [15707, 21743] 

Token: gbit/s 
Entities including it: [15707, 21743, 26999, 29759, 29759, 29759, 56003] 

Token: agw 
Entities including it: [15708, 28237, 59060] 

Token: crystallographic 
Entities including it: [15708, 17241, 26873, 29427, 42616, 62775] 

Token: hubka 
Entities including it: [15710] 

Token: jost 
Entities including it: [15711, 25412, 25676, 27731] 

Token: benevolent 
Entities including it: [15711, 54960, 62162] 

Token: s

Token: half-site 
Entities including it: [16001] 

Token: peroxisome 
Entities including it: [16001, 29068, 31429] 

Token: proliferator- 
Entities including it: [16001, 29068] 

Token: baranovâ?¥ 
Entities including it: [16002] 

Token: vollkopf 
Entities including it: [16002] 

Token: emmons 
Entities including it: [16003, 22579, 25108, 32385, 33256, 33976, 42009, 54659, 54996] 

Token: 2003-04 
Entities including it: [16003, 41541] 

Token: prolapse 
Entities including it: [16004, 18116, 53290, 56354, 61008] 

Token: mdrc 
Entities including it: [16005] 

Token: brief. 
Entities including it: [16005] 

Token: in-home 
Entities including it: [16006] 

Token: voeckler 
Entities including it: [16007, 28740] 

Token: pralle 
Entities including it: [16007, 28740, 49249, 50435] 

Token: mikel 
Entities including it: [16008] 

Token: tompa. 
Entities including it: [16012, 21980, 36062] 

Token: unifypow: 
Entities including it: [16013] 

Token: sample-size 
Entities including it: [16013] 


Token: perils. 
Entities including it: [16294] 

Token: jaenker 
Entities including it: [16295, 43461] 

Token: kloeppel 
Entities including it: [16295] 

Token: hermle 
Entities including it: [16295, 28702, 43461, 58498] 

Token: steering 
Entities including it: [16297, 20479, 21817, 23348, 24705, 31597, 38036, 40651, 48156, 50831, 51789, 54334, 55645, 59758] 

Token: controllable 
Entities including it: [16297, 52743, 63932] 

Token: escrow&quot; 
Entities including it: [16298] 

Token: self-assembled 
Entities including it: [16299, 52873] 

Token: thiols 
Entities including it: [16299] 

Token: tsiatsis 
Entities including it: [16300] 

Token: finke 
Entities including it: [16301, 32583, 50322, 59011] 

Token: waibel 
Entities including it: [16301, 32583, 38582, 45015, 54860, 59011] 

Token: cdh 
Entities including it: [16301] 

Token: hme 
Entities including it: [16301] 

Token: icassp 
Entities including it: [16301, 24343, 55715] 

Token: polyphone 
Entities including it: [16301, 


Token: goâ??hare 
Entities including it: [16664, 39809] 

Token: bolivia 
Entities including it: [16664, 56783] 

Token: lyndon 
Entities including it: [16665, 36889] 

Token: strohbehn 
Entities including it: [16666, 31079] 

Token: henning 
Entities including it: [16667, 19434, 26448, 40647, 43852, 47096] 

Token: spec2000: 
Entities including it: [16667] 

Token: cpu 
Entities including it: [16667, 18959, 22600, 31040, 40858, 50265, 65623] 

Token: millenium 
Entities including it: [16667, 57904, 62533] 

Token: self-stabilizing 
Entities including it: [16669] 

Token: ops 
Entities including it: [16669, 46384] 

Token: dorne 
Entities including it: [16670, 33302] 

Token: voudouris 
Entities including it: [16670, 18249, 24667, 32456, 33009, 33302, 36980, 53702, 63007] 

Token: liret 
Entities including it: [16670] 

Token: ladde 
Entities including it: [16670, 33302] 

Token: ischeduleâ??an 
Entities including it: [16670] 

Token: capra 
Entities including it: [16672, 35031, 36172

Entities including it: [16976] 

Token: domain-independent 
Entities including it: [16977, 18155] 

Token: overview. 
Entities including it: [16978, 27817, 30778, 33367, 50668] 

Token: kolokol'tsov 
Entities including it: [16979, 43592] 

Token: separative 
Entities including it: [16979, 16979] 

Token: cascades 
Entities including it: [16979, 19344, 31810, 40942, 41213, 43592, 48222, 52463, 59441] 

Token: enrichments 
Entities including it: [16979] 

Token: srihari 
Entities including it: [16980, 27091, 29160, 51642, 57056, 65371] 

Token: macnaughton 
Entities including it: [16980, 63804, 65371, 65809, 66157] 

Token: fusion: 
Entities including it: [16980, 65371] 

Token: shared-disk 
Entities including it: [16980, 65371] 

Token: theâ?? 
Entities including it: [16981, 20785, 38513] 

Token: miniworkshop 
Entities including it: [16981] 

Token: mixingsâ?? 
Entities including it: [16981] 

Token: h&amp;tad 
Entities including it: [16982] 

Token: inapproximability 
Entities includi

Entities including it: [17279] 

Token: ninomiya 
Entities including it: [17280, 45138] 

Token: ipec-tokyo 
Entities including it: [17280] 

Token: switching-mode 
Entities including it: [17280] 

Token: random-switching 
Entities including it: [17280] 

Token: cozman 
Entities including it: [17281] 

Token: walley 
Entities including it: [17281, 33726, 61450] 

Token: graphoid 
Entities including it: [17281] 

Token: irrelevance 
Entities including it: [17281] 

Token: 50456 
Entities including it: [17282] 

Token: 2004); 
Entities including it: [17282] 

Token: 69 
Entities including it: [17282, 44578, 45142, 54988] 

Token: 59285 
Entities including it: [17282] 

Token: 2004). 
Entities including it: [17282] 

Token: sec 
Entities including it: [17282, 17939, 27907, 34986, 45661, 50707] 

Token: approved 
Entities including it: [17282] 

Token: amendments 
Entities including it: [17282, 50707, 51057, 53543] 

Token: nyse 
Entities including it: [17282, 23247, 40497, 54410, 54611, 6

Entities including it: [17588] 

Token: arterioles 
Entities including it: [17588] 

Token: anastomoses 
Entities including it: [17588] 

Token: sponges 
Entities including it: [17588, 40898, 43025, 46452, 59671] 

Token: nosanchuk 
Entities including it: [17589] 

Token: up? 
Entities including it: [17589, 24025, 39528] 

Token: calibrating 
Entities including it: [17589, 17892, 32680, 39316, 45907, 51513, 53569, 58037, 62302, 66063] 

Token: peck 
Entities including it: [17590, 21686, 22891, 25606, 25781, 47397, 48857, 49583, 55236, 59756] 

Token: triplett 
Entities including it: [17590] 

Token: dinh-zarr 
Entities including it: [17591] 

Token: diguiseppi 
Entities including it: [17591] 

Token: heitman 
Entities including it: [17591, 44127] 

Token: drinkers 
Entities including it: [17591] 

Token: trepagnier 
Entities including it: [17593] 

Token: cottrell 
Entities including it: [17594, 39968, 53599] 

Token: portevin--le 
Entities including it: [17594] 

Token: chatelier 
Ent


Token: report-tr02-hpng-031001 
Entities including it: [17949] 

Token: buffers 
Entities including it: [17949, 28402, 36942, 40151, 40168, 45890, 58004, 65991, 66799] 

Token: kowal 
Entities including it: [17950, 47949] 

Token: vanpee 
Entities including it: [17951, 53447] 

Token: delgrange 
Entities including it: [17951] 

Token: gillet 
Entities including it: [17951, 53447] 

Token: donckier 
Entities including it: [17951] 

Token: antacid 
Entities including it: [17951] 

Token: tablets 
Entities including it: [17951] 

Token: (rennie) 
Entities including it: [17951] 

Token: supervisor. 
Entities including it: [17952] 

Token: dstc-tr-9840 
Entities including it: [17954] 

Token: dstc. 
Entities including it: [17954] 

Token: edu. 
Entities including it: [17954] 

Token: au/ 
Entities including it: [17954] 

Token: flowback: 
Entities including it: [17954, 25897, 36215, 37447, 65494] 

Token: braz. 
Entities including it: [17955, 18712, 32978, 41377, 54264] 

Token: woodsâ?¦ 


Token: chernett 
Entities including it: [18249] 

Token: montemayor 
Entities including it: [18250] 

Token: makowski 
Entities including it: [18251] 

Token: knickmeyer 
Entities including it: [18252] 

Token: sudarshan\\ 
Entities including it: [18253] 

Token: system&quot; 
Entities including it: [18253, 25547, 29009, 37996, 41905, 42133, 43197, 45174, 46005, 46770, 51415, 52990, 53900, 56109] 

Token: thetheoryofprobabilistic 
Entities including it: [18254] 

Token: aldezabal 
Entities including it: [18255] 

Token: aranzabe 
Entities including it: [18255] 

Token: gojenola 
Entities including it: [18255] 

Token: sarasola 
Entities including it: [18255] 

Token: argument/adjunct 
Entities including it: [18255] 

Token: basque 
Entities including it: [18255, 33368, 40009] 

Token: so'modhrain 
Entities including it: [18256] 

Token: goâ??designing 
Entities including it: [18256] 

Token: hand-held 
Entities including it: [18256, 41385, 54059] 

Token: sang 
Entities including it: [


Token: supersonic 
Entities including it: [18656, 22536, 32762, 39254, 43635, 50962, 55252, 61244, 63274] 

Token: student-t 
Entities including it: [18657, 57374] 

Token: sekiguichi 
Entities including it: [18658] 

Token: sifting 
Entities including it: [18658, 23365, 53391, 60114, 63596] 

Token: dusty 
Entities including it: [18658, 45238] 

Token: corridors 
Entities including it: [18658] 

Token: software-strange 
Entities including it: [18659] 

Token: bedfellows? 
Entities including it: [18659] 

Token: knaves 
Entities including it: [18660] 

Token: hgj 
Entities including it: [18661] 

Token: tikunov 
Entities including it: [18662] 

Token: averaged 
Entities including it: [18662] 

Token: density: 
Entities including it: [18662] 

Token: axtell 
Entities including it: [18663] 

Token: aamas2002 
Entities including it: [18663] 

Token: non-cooperative 
Entities including it: [18663, 30491] 

Token: latency-recency 
Entities including it: [18664, 66507] 

Token: rachel 
Enti

Token: zmmersive 
Entities including it: [18992] 

Token: driving-related 
Entities including it: [18992] 

Token: tendulkar 
Entities including it: [18994] 

Token: povertyâ?? 
Entities including it: [18994] 

Token: merchant 
Entities including it: [18995, 48655, 59132, 59699, 63800] 

Token: (&gt; 
Entities including it: [18995, 49589, 64089] 

Token: v) 
Entities including it: [18995, 56816, 64089] 

Token: soi 
Entities including it: [18995, 23983, 27965, 29109, 36810, 38947, 45118, 61575] 

Token: basolo 
Entities including it: [18997, 35167] 

Token: ibers 
Entities including it: [18997, 35167] 

Token: gallium-arsenide 
Entities including it: [18998] 

Token: eis 
Entities including it: [18999, 25478, 49419] 

Token: weighting: 
Entities including it: [19000] 

Token: suppression: 
Entities including it: [19001, 27096, 32029] 

Token: ofwireless 
Entities including it: [19005] 

Token: è?? 
Entities including it: [19006] 

Token: kallahalla 
Entities including it: [19008] 

Tok

Token: unit.â?? 
Entities including it: [19358] 

Token: grosswald 
Entities including it: [19359] 

Token: shelf 
Entities including it: [19359, 38859] 

Token: russo 
Entities including it: [19360, 21579, 31852, 40210, 44885, 51714, 55507, 62621] 

Token: kã¼spert 
Entities including it: [19363] 

Token: gã¼nauer 
Entities including it: [19363] 

Token: nik 
Entities including it: [19364, 60687] 

Token: czer 
Entities including it: [19364] 

Token: intersoc. 
Entities including it: [19364] 

Token: convers. 
Entities including it: [19364] 

Token: s.; 
Entities including it: [19364, 27364, 34246] 

Token: j.; 
Entities including it: [19364] 

Token: iebold 
Entities including it: [19364] 

Token: pyrolysis 
Entities including it: [19364] 

Token: stamper 
Entities including it: [19365] 

Token: turtles 
Entities including it: [19365] 

Token: winner 
Entities including it: [19366, 45554, 57205] 

Token: gifted 
Entities including it: [19366] 

Token: realities. 
Entities including i

Token: hubball 
Entities including it: [19719] 

Token: swinnerton 
Entities including it: [19720] 

Token: burdorf 
Entities including it: [19722] 

Token: kathy 
Entities including it: [19723, 27770] 

Token: zeidenstein 
Entities including it: [19723, 64629] 

Token: sarasvathy 
Entities including it: [19724] 

Token: causation 
Entities including it: [19724, 47898] 

Token: effectuation: 
Entities including it: [19724] 

Token: inevitability 
Entities including it: [19724] 

Token: gurrin 
Entities including it: [19725] 

Token: fã­schlã¡r@ 
Entities including it: [19725] 

Token: trecvid2003: 
Entities including it: [19725] 

Token: donnerstein 
Entities including it: [19726, 19726, 32092, 32092] 

Token: ditrichs 
Entities including it: [19726] 

Token: interracial 
Entities including it: [19726] 

Token: retaliation 
Entities including it: [19726, 23781] 

Token: riot 
Entities including it: [19726] 

Token: abatement 
Entities including it: [19729, 46875, 55562, 61007] 

Token:

Entities including it: [20044] 

Token: ramalingam 
Entities including it: [20044] 

Token: aamodt 
Entities including it: [20045] 

Token: case-specific 
Entities including it: [20045] 

Token: rapoport 
Entities including it: [20047, 45901, 50608] 

Token: synthesis: 
Entities including it: [20047, 29744, 61396] 

Token: orjundamental 
Entities including it: [20047] 

Token: pogge 
Entities including it: [20049] 

Token: developer's 
Entities including it: [20049, 24498, 28262, 43607, 47938, 51648, 55328] 

Token: garnet: 
Entities including it: [20050] 

Token: demonstrational 
Entities including it: [20050, 33551] 

Token: l'art 
Entities including it: [20051] 

Token: holzbecher 
Entities including it: [20052] 

Token: hf; 
Entities including it: [20052] 

Token: lituanas 
Entities including it: [20053] 

Token: renandya 
Entities including it: [20053] 

Token: extensive 
Entities including it: [20053, 30558, 32445, 35503, 49767, 51854] 

Token: philippines 
Entities including it:


Token: konigsfeld 
Entities including it: [20396] 

Token: milliron 
Entities including it: [20396] 

Token: floorplanning 
Entities including it: [20396, 53117] 

Token: smillie 
Entities including it: [20397, 51073] 

Token: hibernian 
Entities including it: [20397] 

Token: fever: 
Entities including it: [20397, 33440, 50619] 

Token: 14-year 
Entities including it: [20397] 

Token: waking 
Entities including it: [20398, 31413] 

Token: ashkenazi 
Entities including it: [20399] 

Token: idar 
Entities including it: [20399] 

Token: zt 
Entities including it: [20399, 31070] 

Token: handzel 
Entities including it: [20399] 

Token: ofarim 
Entities including it: [20399] 

Token: in-vitro 
Entities including it: [20399] 

Token: coeliac 
Entities including it: [20399, 47740] 

Token: skolmoski 
Entities including it: [20400] 

Token: clamping: 
Entities including it: [20400] 

Token: antialiasing 
Entities including it: [20400] 

Token: oller 
Entities including it: [20402] 

Token: m

Entities including it: [20739] 

Token: sakkoula 
Entities including it: [20740] 

Token: athens- 
Entities including it: [20740] 

Token: (university 
Entities including it: [20743, 24272, 31959, 36885, 61243] 

Token: technology? 
Entities including it: [20744] 

Token: case-bases 
Entities including it: [20745] 

Token: 588pp 
Entities including it: [20746] 

Token: paleoseismology: 
Entities including it: [20746] 

Token: hellums 
Entities including it: [20747] 

Token: asf: 
Entities including it: [20747] 

Token: svaerdson 
Entities including it: [20748] 

Token: salmonidae 
Entities including it: [20748] 

Token: obligations. 
Entities including it: [20750] 

Token: leveridge 
Entities including it: [20751] 

Token: campusworld 
Entities including it: [20751] 

Token: eisenberger 
Entities including it: [20753, 30041] 

Token: cooperative-versus-individual 
Entities including it: [20753] 

Token: luhtanen 
Entities including it: [20754, 54965] 

Token: ingroup 
Entities includin


Token: mudhar 
Entities including it: [21123] 

Token: allergy. 
Entities including it: [21124] 

Token: andv 
Entities including it: [21126, 49704, 62824] 

Token: leungâ??mobility-basedpredictive 
Entities including it: [21126] 

Token: andbandwidth 
Entities including it: [21126] 

Token: crook 
Entities including it: [21127, 24333, 44402] 

Token: nuseibeh 
Entities including it: [21127, 24019, 29896, 44402, 51518, 52324] 

Token: bairlein 
Entities including it: [21128] 

Token: nahrungswahl 
Entities including it: [21128] 

Token: gartengrasmã¼cke 
Entities including it: [21128] 

Token: sylvia 
Entities including it: [21128, 23743, 61391] 

Token: borin: 
Entities including it: [21128] 

Token: beitrag 
Entities including it: [21128, 27514] 

Token: bedeutung 
Entities including it: [21128, 21190] 

Token: frugivorie 
Entities including it: [21128] 

Token: &amp;deployment 
Entities including it: [21129] 

Token: percentile 
Entities including it: [21130, 23013, 33008] 

Token:

Entities including it: [21444] 

Token: burdelski 
Entities including it: [21444] 

Token: broelsch 
Entities including it: [21444] 

Token: tenure 
Entities including it: [21445, 23815, 45800, 51057, 53943, 62301] 

Token: conflict. 
Entities including it: [21445, 26921, 30251, 30332] 

Token: swiftly 
Entities including it: [21446] 

Token: akg 
Entities including it: [21447] 

Token: nearline 
Entities including it: [21447] 

Token: markov-chain 
Entities including it: [21447, 22611, 59196, 61313, 66178, 66693] 

Token: shouxiang 
Entities including it: [21448] 

Token: guoqing 
Entities including it: [21448] 

Token: wenzhong 
Entities including it: [21448] 

Token: hillman 
Entities including it: [21449, 23284] 

Token: dalziel 
Entities including it: [21449] 

Token: (4) 
Entities including it: [21450, 28168, 28813, 29142, 42410, 62273] 

Token: bruggeman 
Entities including it: [21452] 

Token: hiv-transgenic 
Entities including it: [21452] 

Token: kurzynski 
Entities including

Entities including it: [21819, 36525] 

Token: zollinger-ellison 
Entities including it: [21819] 

Token: servi 
Entities including it: [21820, 57918] 

Token: atzemi 
Entities including it: [21820] 

Token: mythology 
Entities including it: [21820] 

Token: pastine 
Entities including it: [21821, 21821] 

Token: externalities: 
Entities including it: [21821] 

Token: transmutation 
Entities including it: [21822, 27942, 50904, 59759] 

Token: parlin 
Entities including it: [21823] 

Token: butz 
Entities including it: [21825, 30412, 30515, 35309] 

Token: hã¶llerer 
Entities including it: [21825] 

Token: iwar 
Entities including it: [21825] 

Token: eco-activist 
Entities including it: [21826] 

Token: pã¸ibyl 
Entities including it: [21828] 

Token: poã¨ã­taã¨ovã© 
Entities including it: [21828] 

Token: viry 
Entities including it: [21828] 

Token: prahu 
Entities including it: [21828] 

Token: milã©nia 
Entities including it: [21828] 

Token: donadio 
Entities including it: [21830,


Token: mariotti 
Entities including it: [22189] 

Token: pme. 
Entities including it: [22189, 26081] 

Token: utrecht 
Entities including it: [22189, 26081, 26661] 

Token: break? 
Entities including it: [22189] 

Token: singularly 
Entities including it: [22190, 49209] 

Token: hyperbloic 
Entities including it: [22190] 

Token: jasanoff 
Entities including it: [22192, 39654] 

Token: precaution 
Entities including it: [22192] 

Token: poorly 
Entities including it: [22193] 

Token: state-dependent 
Entities including it: [22194, 32853, 36163] 

Token: tort 
Entities including it: [22194] 

Token: foucaut 
Entities including it: [22195, 36740] 

Token: remora 
Entities including it: [22195] 

Token: competitve 
Entities including it: [22196] 

Token: kyro 
Entities including it: [22197] 

Token: taalas 
Entities including it: [22197] 

Token: winter/early 
Entities including it: [22197] 

Token: stratosphere 
Entities including it: [22197] 

Token: alles 
Entities including it: [2219

Entities including it: [22607, 49207] 

Token: neibaur 
Entities including it: [22608] 

Token: kraiss 
Entities including it: [22611, 26022, 45046, 48914, 59196, 61313, 64273, 66178, 66693] 

Token: gllavata 
Entities including it: [22612] 

Token: ewerth 
Entities including it: [22612] 

Token: arm10 
Entities including it: [22613] 

Token: cores 
Entities including it: [22613, 24104, 30481, 35192, 51984] 

Token: turman 
Entities including it: [22614] 

Token: vercellotti 
Entities including it: [22614] 

Token: accommodation: 
Entities including it: [22614, 29732] 

Token: xenografting 
Entities including it: [22614] 

Token: galusinski 
Entities including it: [22615] 

Token: universitã© 
Entities including it: [22615, 43195, 53928, 55612] 

Token: bordeaux-i 
Entities including it: [22615] 

Token: stream-based 
Entities including it: [22616, 52008] 

Token: foâ« 
Entities including it: [22617] 

Token: brettschneider 
Entities including it: [22618] 

Token: lymphomas 
Entities i

Token: abolition 
Entities including it: [22967] 

Token: (1967; 
Entities including it: [22967] 

Token: newtonâ??s 
Entities including it: [22968, 31870] 

Token: birthday 
Entities including it: [22968] 

Token: binomial 
Entities including it: [22968, 28379, 35058, 54959] 

Token: meserole 
Entities including it: [22969] 

Token: hedges 
Entities including it: [22969, 51779, 55118, 60706, 62955] 

Token: â??comparison 
Entities including it: [22969] 

Token: carbon-carbon 
Entities including it: [22969] 

Token: jijkoun 
Entities including it: [22971] 

Token: frequently 
Entities including it: [22971, 28059, 43222, 53855, 55527, 62220, 66441] 

Token: asked 
Entities including it: [22971, 43222, 53855] 

Token: psees 
Entities including it: [22972] 

Token: slochower 
Entities including it: [22973] 

Token: externality 
Entities including it: [22973] 

Token: nonobese: 
Entities including it: [22973] 

Token: bating 
Entities including it: [22974] 

Token: swat&amp; 
Entities incl

Token: waterloo 
Entities including it: [23354, 48082] 

Token: mbb 
Entities including it: [23356, 39773, 61636] 

Token: magoldaâ?¦ 
Entities including it: [23356] 

Token: maturity: 
Entities including it: [23356, 31217] 

Token: worldviews 
Entities including it: [23356] 

Token: wsv 
Entities including it: [23358] 

Token: inter-switch 
Entities including it: [23358] 

Token: savaresi 
Entities including it: [23359] 

Token: bisecting 
Entities including it: [23359] 

Token: pddp: 
Entities including it: [23359] 

Token: mos-controlled 
Entities including it: [23360] 

Token: thyristor 
Entities including it: [23360, 33866] 

Token: furugren 
Entities including it: [23361] 

Token: smartbo 
Entities including it: [23361] 

Token: korten 
Entities including it: [23362] 

Token: posphieszczyk 
Entities including it: [23362] 

Token: rotationally 
Entities including it: [23363] 

Token: evermann 
Entities including it: [23364, 49756] 

Token: odell 
Entities including it: [23364, 336


Token: johannessen 
Entities including it: [23713, 29063] 

Token: shalina 
Entities including it: [23713] 

Token: kuzmina 
Entities including it: [23713] 

Token: steane 
Entities including it: [23714] 

Token: bits: 
Entities including it: [23714] 

Token: roussel 
Entities including it: [23715] 

Token: teleconviviality 
Entities including it: [23715] 

Token: kalish 
Entities including it: [23716] 

Token: gynn 
Entities including it: [23716] 

Token: holly 
Entities including it: [23716] 

Token: bjorklun 
Entities including it: [23717] 

Token: torts 
Entities including it: [23717] 

Token: fights 
Entities including it: [23719, 47107] 

Token: beat 
Entities including it: [23719, 49848, 60713] 

Token: â??all 
Entities including it: [23719] 

Token: actionâ??image. 
Entities including it: [23719] 

Token: wischik 
Entities including it: [23720] 

Token: scaled-down 
Entities including it: [23720] 

Token: rholes 
Entities including it: [23722] 

Token: nelligan 
Entities inclu

Entities including it: [24064] 

Token: peyret 
Entities including it: [24065] 

Token: viviand 
Entities including it: [24065] 

Token: compressible 
Entities including it: [24065, 46390] 

Token: schã¶lkopf 
Entities including it: [24068] 

Token: 320--3510 
Entities including it: [24069] 

Token: eecs. 
Entities including it: [24070] 

Token: umich. 
Entities including it: [24070] 

Token: yuqing 
Entities including it: [24070] 

Token: imber: 
Entities including it: [24070] 

Token: protopapas 
Entities including it: [24071, 37295, 66069] 

Token: seery 
Entities including it: [24072] 

Token: aoki 
Entities including it: [24073, 30348, 32024, 33008, 33352, 33812, 36004, 40403, 42778, 44801, 45716, 51316, 52740, 54243, 55240, 56101, 66792] 

Token: calderã³n 
Entities including it: [24074, 25771] 

Token: jimã©nez-capdeville 
Entities including it: [24074] 

Token: modify 
Entities including it: [24074, 44332, 65235] 

Token: vonderscher 
Entities including it: [24076] 

Token: mei

Token: scrollbar-based 
Entities including it: [24444] 

Token: learntec 
Entities including it: [24445] 

Token: projekt 
Entities including it: [24445, 27588, 54927] 

Token: universitã¤ten 
Entities including it: [24445] 

Token: mannheim 
Entities including it: [24445, 47714, 54605, 55945] 

Token: heidelberg. 
Entities including it: [24445] 

Token: tr-4l8 
Entities including it: [24446] 

Token: dataflow/von 
Entities including it: [24446] 

Token: ar-chitecture 
Entities including it: [24446] 

Token: fradkin 
Entities including it: [24447] 

Token: gitman 
Entities including it: [24447] 

Token: electrodynamics: 
Entities including it: [24447] 

Token: al-waili 
Entities including it: [24448] 

Token: electrolytes 
Entities including it: [24448, 54512] 

Token: osmolality 
Entities including it: [24448] 

Token: bogers 
Entities including it: [24453] 

Token: eurotransplant 
Entities including it: [24453] 

Token: procedureâ??master 
Entities including it: [24453] 

Token: draw

KeyboardInterrupt: 

# Question B
Compute all the possible comparisons that shall be made to resolve the duplicates within the blocks that were created in Step A. After the computation, please print the final calculated number of comparisons.

## As a token might be included more than one times per row, it is possible that duplicate entities have been appended to some keys. The duplicates increase the number of comparisons per block.

In [67]:
#empty list that will hold the number of comparisons per block
number_of_comp_per_token = []
#take each block
for values in kv_pairs.values():
    #count the number of entities it includes
    n = len(values)
    #count the number of comparisons for n number of entities per block
    comparisons = n*(n-1)/2
    #add the number to the list
    number_of_comp_per_token.append(comparisons)
#sum the list for the total number of comparisons
total_number_of_comp = sum(number_of_comp_per_token)

In [66]:
print('The number of total comparisons is:',int(total_number_of_comp))

The number of total comparisons is: 2644594399


# Question C
Create a Meta-Blocking graph of the block collection (created in step A) and using the CBS Weighting Scheme (i.e., Number of common blocks that entities in a specific comparison have in common) i) prune (delete) the edges that have weight < 2 ii) re-calculate the final number of comparisons (like in step B) of the new block collection that will be created after the edge pruning.

## We will use the Meta-Blocking method to reduce the number of duplicates as well as the number of nonmeaningful comparisons and optimize the blocking procedure.

## Firstly, we will create a dictionary where the keys will be the concatenated ids of the entities we compare and the values will be the number of common blocks each pair of entities has.

In [46]:
#create empty dictionary to store the entities - weights pairs
entities_weights_pairs = {}

## To do so, we will iterate through each block, choose the first entity in it and create all the concatinated pairs of it with the rest of the entities in the block. As we want the graph to be undirected, for pairs created inside the same block and after all the possible pairs for the entity inside the block have been created, we delete the entity from the block and continue with the next entity (which now takes the 1st position in the block) to avoid creating reversed pairs. For pairs which we want to make sure that do not exist already in the dictionary as keys (in normal or reverse form), we will also create the reverse pair of each comparison and check if the reverse key is already included in the already existing keys of the dictionary to avoid creating it again if it exists in reverse.

## As the number of possible pairs is extremely high, we will use the first 100 entities and their possible pairs as a toy example.

In [47]:
total_entities = 0
stop = 0
#check all blocks
for block in kv_pairs.values():
    #take each entity in the block
    for entity in block:
        total_entities += 1
        if total_entities > 100:
            stop = 1
            break
        else:
            #check the length of the block as the following iteration would go out of index if it is <=1
            if len(block) == 1:
                break
            #iterate i times
            for i in range(0,len(block)-1):
                #create 1 key with the concatenation of the id of the entity we are checking 
                #and the rest of entities' ids in the block
                key1 = str(entity)+','+str(block[i+1])
                #create second key with the concatenation of the id of the entity we are checking 
                #and the rest of entities' ids in the block but in reverse
                key2 = str(block[i+1])+','+str(entity)
                #check if the key or the reverse key already exist in dictionary and create a key in it if not with value 1
                if key1 not in entities_weights_pairs.keys() and key2 not in entities_weights_pairs.keys():
                    entities_weights_pairs[key1] = [1,]
                #if the reverse key already exists then go and append 1 in the reverse key
                elif key2 in entities_weights_pairs.keys():
                    entities_weights_pairs[key2].append(1)
                #else append to the key the number 1
                else:
                    entities_weights_pairs[key1].append(1)
            #finally remove the entity from the block so that the the next in line entity to be chosen in the next iteration
            block.remove(entity)
            #we want only 10 entities
        if stop == 1:
            break

## We will then count the number of 1s for each key which will show us the weight of each pair.

In [48]:
#count number of 1s by summing them
for key,values in entities_weights_pairs.items():
    entities_weights_pairs[key] = sum(values) 

## Finally, we will prune the pairs that have weight less than 2.

In [56]:
entities_weights_pairs_final = {key: values for key, values in entities_weights_pairs.items() if values >= 2}

In [58]:
#print the key value pairs
for key,values in entities_weights_pairs_final.items():
    print('Nodes:',key,'\n' 'Number of Common Blocks:',values,'\n')

Nodes: 22275,15004 
Number of Common Blocks: 2 

Nodes: 43073,15004 
Number of Common Blocks: 2 

Nodes: 44908,15004 
Number of Common Blocks: 2 

Nodes: 49406,15004 
Number of Common Blocks: 2 

Nodes: 53149,15004 
Number of Common Blocks: 2 

Nodes: 58857,15004 
Number of Common Blocks: 2 

Nodes: 8,17069 
Number of Common Blocks: 2 

Nodes: 8,24512 
Number of Common Blocks: 2 

Nodes: 8,25955 
Number of Common Blocks: 2 

Nodes: 8,29089 
Number of Common Blocks: 3 

Nodes: 8,56669 
Number of Common Blocks: 2 

Nodes: 8,58053 
Number of Common Blocks: 2 

Nodes: 219,17069 
Number of Common Blocks: 2 

Nodes: 219,24512 
Number of Common Blocks: 2 

Nodes: 219,25955 
Number of Common Blocks: 2 

Nodes: 219,29089 
Number of Common Blocks: 3 

Nodes: 219,56669 
Number of Common Blocks: 2 

Nodes: 219,58053 
Number of Common Blocks: 2 

Nodes: 574,17069 
Number of Common Blocks: 2 

Nodes: 574,24512 
Number of Common Blocks: 2 

Nodes: 574,25955 
Number of Common Blocks: 2 

Nodes: 574,29

## We recalculate the number of comparisons as in task B. To do so, we will sum the weights of all pairs. As we used only 100 entities to create the dictionary we are not able to calculate the exact number of combinations but the methodology remains the same.

In [70]:
number_of_comp_after_prune = 0
#sum the weights of all pairs
for values in entities_weights_pairs_final.values():
        number_of_comp_after_prune += values            

In [71]:
print('The number of total comparisons is:',number_of_comp_after_prune)

The number of total comparisons is: 1104


# Question D

Create a function that takes as input two entities and computes their Jaccard similarity based on the attribute title. You are not requested to perform any actual comparisons using this function.

## We assume that the dataset is tokenized per column and each column (attribute title) includes lists of the tokens per row. A fixed dataset with dummy variables will be used for reference. The scope of the function is to be fed the 2 entities ids as well as the dataset and then calculate their Jaccard similarity per attribute comparing the lists of tokens that correspond to them. 

In [104]:
#create the dataset and fill it with dummy data
dataset = pd.DataFrame(columns=['id','attribute1','attribute2'])

In [105]:
dataset['id'] = [1,2]
dataset['attribute1'] = [['this','is','an','example'],['this','is','another','example']]
dataset['attribute2'] = [['1996'],['1997']]         

In [128]:
#function that calculates jaccard similarity
def Jaccard_similarity(x,y,data):
    k=0
    lista=[]
    #read each columns
    for column in dataset.columns:
        #for the columns except of the id columns
        if column != 'id':
            #read per row
            for i in range(len(dataset)):
                #if the id column matches with the given entities
                if dataset.id.iloc[i] == x or dataset.id.iloc[i] == y:
                    #add to a list the tokens corresponding to the column and row 
                    lista.append(dataset[column].iloc[i])
                    k+=1
                    #when k==2 both entities given have been added to the list
                    if k==2:
                        #calculate the intersection
                        intersection =  len([value for value in lista[0] if value in lista[1]])
                        #calculate the union
                        union = len([value for value in lista[0]]) + len([value for value in lista[1] if value not in lista[0]])
                        #calculate jaccard similarity
                        Jaccard = intersection/union
                        #renew the constant and list in order to procced to the next column in the next iteration
                        k=0
                        lista = []
                        #print the entities, and their jaccard similarities per column
                        print('The Jaccard similarity of entities',x,'and',y,'for the column',column,'is:',Jaccard)

In [129]:
#the jaccard similiarities of our dummy entities
Jaccard_similarity(1,2,dataset)

The Jaccard similarity of entities 1 and 2 for the column attribute1 is: 0.6
The Jaccard similarity of entities 1 and 2 for the column attribute2 is: 0.0
