/
analysis.jsonl
333 lines (308 loc) · 10.3 KB
/
analysis.jsonl
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
Erroneous retrieved wiki articles:
Claim: "Heavy Metal music was developed in the early 1970's."
Correct document: Heavy_metal_music
Found Documents:
English_possessive
Possessive
Contraction_-LRB-grammar-RRB-
Contraction_-LRB-operator_theory-RRB-
Heavy_metal
Backup Documents:
heavy_metal_-lrb-magazine-rrb-
heavy_metal_-lrb-wrestler-rrb-
heavy_metal_-lrb-g.i._joe-rrb-
heavy_metal_-lrb-film-rrb-
heavy_metal_-lrb-comics-rrb-
heavy_metal_-lrb-terminator-colon-_the_sarah_connor_chronicles-rrb-
Constituency Tree: (ROOT (S (NP (NNP Heavy) (NNP Metal) (NN music)) (VP (VBD was) (VP (VBN developed) (PP (IN in) (NP (NP (DT the) (JJ early)) (NP (CD 1970) (POS 's)))))) (. .)))
Dependency Graph: -> developed/VBN (root)
-> music/NN (nsubjpass)
-> Heavy/NNP (compound)
-> Metal/NNP (compound)
-> was/VBD (auxpass)
-> 1970/CD (nmod:'s)
-> in/IN (case)
-> the/DT (det)
-> early/JJ (amod)
-> 's/POS (case)
-> ./. (punct)
Claim: "2015 was the year when the principal photography of The Disaster Aritst (film) started."
Correct document: The_Disaster_Artist_-LRB-film-RRB-
Found Documents:
Backup Documents:
Constituency Tree: (ROOT (S (NP (CD 2015)) (VP (VBD was) (NP (NP (DT the) (NN year)) (SBAR (WHADVP (WRB when)) (S (NP (NP (DT the) (JJ principal) (NN photography)) (PP (IN of) (NP (NP (DT The) (NN Disaster) (NN Aritst)) (PRN (-LRB- -LRB-) (NP (NN film)) (-RRB- -RRB-))))) (VP (VBD started)))))) (. .)))
Dependency Graph: -> year/NN (root)
-> 2015/CD (nsubj)
-> was/VBD (cop)
-> the/DT (det)
-> when/WRB (dep)
-> photography/NN (dep)
-> the/DT (det)
-> principal/JJ (amod)
-> started/VBD (acl:of)
-> of/IN (mark)
-> The/DT (dep)
-> Aritst/NN (nsubj)
-> Disaster/NN (compound)
-> film/NN (appos)
-> -LRB-/-LRB- (punct)
-> -RRB-/-RRB- (punct)
-> ./. (punct)
Claim: "In 1971 Asylum Records the American record label was founded by David Geffen and his partner Elliot Roberts."
Correct document: Asylum_Records
Found Documents:
David_Geffen
Elliot_Roberts
Backup Documents:
Constituency Tree: (ROOT (S (PP (IN In) (NP (CD 1971))) (NP (NP (NNP Asylum) (NNPS Records)) (NP (DT the) (JJ American) (NN record) (NN label))) (VP (VBD was) (VP (VBN founded) (PP (IN by) (NP (NP (NNP David) (NNP Geffen)) (CC and) (NP (PRP$ his) (NN partner) (NNP Elliot) (NNP Roberts)))))) (. .)))
Dependency Graph: -> founded/VBN (root)
-> Records/NNPS (nmod:in)
-> In/IN (case)
-> 1971/CD (nummod)
-> Asylum/NNP (compound)
-> label/NN (nsubjpass)
-> the/DT (det)
-> American/JJ (amod)
-> record/NN (compound)
-> was/VBD (auxpass)
-> Geffen/NNP (nmod:agent)
-> by/IN (case)
-> David/NNP (compound)
-> and/CC (cc)
-> Roberts/NNP (conj:and)
-> his/PRP$ (nmod:poss)
-> partner/NN (compound)
-> Elliot/NNP (compound)
-> Roberts/NNP (nmod:agent)
-> ./. (punct)
Claim: "The Armenian Genocide was the killing of Armenians who were mostly Ottoman natives."
Correct document: Armenian_Genocide
Found Documents:
Armenians
The_Armenian_Genocide
Ottoman
Backup Documents:
the_armenian_genocide_-lrb-film-rrb-
ottoman_-lrb-textile-rrb-
ottoman_-lrb-furniture-rrb-
Constituency Tree: (ROOT (S (NP (DT The) (JJ Armenian) (NN Genocide)) (VP (VBD was) (NP (NP (DT the) (NN killing)) (PP (IN of) (NP (NNPS Armenians))) (SBAR (WHNP (WP who)) (S (VP (VBD were) (ADVP (RB mostly)) (NP (NNP Ottoman) (NNS natives))))))) (. .)))
Dependency Graph: -> killing/NN (root)
-> Genocide/NN (nsubj)
-> The/DT (det)
-> Armenian/JJ (amod)
-> was/VBD (cop)
-> the/DT (det)
-> Armenians/NNPS (nmod:of)
-> of/IN (case)
-> who/WP (ref)
-> natives/NNS (acl:relcl)
-> killing/NN (nsubj)
-> were/VBD (cop)
-> mostly/RB (advmod)
-> Ottoman/NNP (compound)
-> ./. (punct)
Claim: "The first inauguration of Bill Clinton made him the 50th President of the United States."
Correct document: First_inauguration_of_Bill_Clinton
Found Documents:
Bill_Clinton
President_of_the_United_States
Backup Documents:
president_of_the_united_states_-lrb-disambiguation-rrb-
Constituency Tree: (ROOT (S (NP (NP (DT The) (JJ first) (NN inauguration)) (PP (IN of) (NP (NNP Bill) (NNP Clinton)))) (VP (VBD made) (S (NP (PRP him)) (NP (NP (DT the) (JJ 50th) (NN President)) (PP (IN of) (NP (DT the) (NNP United) (NNPS States)))))) (. .)))
Dependency Graph: -> made/VBD (root)
-> inauguration/NN (nsubj)
-> The/DT (det)
-> first/JJ (amod)
-> Clinton/NNP (nmod:of)
-> of/IN (case)
-> Bill/NNP (compound)
-> President/NN (xcomp)
-> him/PRP (nsubj)
-> the/DT (det)
-> 50th/JJ (amod)
-> States/NNPS (nmod:of)
-> of/IN (case)
-> the/DT (det)
-> United/NNP (compound)
-> ./. (punct)
Claim: "The 14th Dalai Lama lives in Japan exclusively."
Correct document: 14th_Dalai_Lama
Found Documents:
Japan
Backup Documents:
japan_-lrb-japan_album-rrb-
japan_-lrb-disambiguation-rrb-
japan_-lrb-film-rrb-
japan_-lrb-1992_manga-rrb-
japan_-lrb-band-rrb-
japan_-lrb-1994_manga-rrb-
Constituency Tree: (ROOT (S (NP (DT The) (JJ 14th) (NNP Dalai) (NNP Lama)) (VP (VBZ lives) (PP (IN in) (NP (NNP Japan))) (ADVP (RB exclusively))) (. .)))
Dependency Graph: -> lives/VBZ (root)
-> Lama/NNP (nsubj)
-> The/DT (det)
-> 14th/JJ (amod)
-> Dalai/NNP (compound)
-> Japan/NNP (nmod:in)
-> in/IN (case)
-> exclusively/RB (advmod)
-> ./. (punct)
Claim: "The Mod Squad is a series that fits into the genre of crime drama."
Correct document: The_Mod_Squad
Found Documents:
The_MOD_Squad
Backup Documents:
the_mod_squad_-lrb-film-rrb-
Constituency Tree: (ROOT (S (NP (DT The) (NNP Mod) (NNP Squad)) (VP (VBZ is) (NP (NP (DT a) (NN series)) (SBAR (WHNP (WDT that)) (S (VP (VBZ fits) (PP (IN into) (NP (NP (DT the) (NN genre)) (PP (IN of) (NP (NN crime) (NN drama)))))))))) (. .)))
Dependency Graph: -> series/NN (root)
-> Squad/NNP (nsubj)
-> The/DT (det)
-> Mod/NNP (compound)
-> is/VBZ (cop)
-> a/DT (det)
-> that/WDT (ref)
-> fits/VBZ (acl:relcl)
-> series/NN (nsubj)
-> genre/NN (nmod:into)
-> into/IN (case)
-> the/DT (det)
-> drama/NN (nmod:of)
-> of/IN (case)
-> crime/NN (compound)
-> ./. (punct)
Claim: "The Saw franchise grossed under $873 million."
Correct document: Saw_-LRB-franchise-RRB-
Found Documents:
Television_network
Deductible
League
Passenger_rail_franchising_in_Great_Britain
Franchise_tag
Chain_store
Franchise_player
Dem_Franchize_Boyz
Franchising
Media_franchise
Cable_television
Suffrage
Franchise_Pictures
North_America
Sports_league
Backup Documents:
franchise_-lrb-short_story-rrb-
Constituency Tree: (ROOT (S (NP (DT The) (NNP Saw) (NN franchise)) (VP (VBN grossed) (PP (IN under) (NP (QP ($ $) (CD 873) (CD million))))) (. .)))
Dependency Graph: -> grossed/VBN (root)
-> franchise/NN (nsubj)
-> The/DT (det)
-> Saw/NNP (compound)
-> $/$ (nmod:under)
-> under/IN (case)
-> million/CD (nummod)
-> 873/CD (compound)
-> ./. (punct)
Claim: "The Faroe Islands are no longer part of the Kingdom of Mercia."
Correct document: Faroe_Islands
Found Documents:
Kingdom
Backup Documents:
kingdom_-lrb-song-rrb-
kingdom_-lrb-u.s._tv_series-rrb-
kingdom_-lrb-video_game-rrb-
kingdom_-lrb-biology-rrb-
kingdom_-lrb-director-rrb-
kingdom_-lrb-covenant_worship_album-rrb-
kingdom_-lrb-album-rrb-
kingdom_-lrb-uk_tv_series-rrb-
kingdom_-lrb-gorgon_city_album-rrb-
kingdom_-lrb-manga-rrb-
kingdom_-lrb-professional_wrestling-rrb-
kingdom_-lrb-comics-rrb-
kingdom_-lrb-ep-rrb-
kingdom_-lrb-magazine-rrb-
Constituency Tree: (ROOT (S (NP (DT The) (NNP Faroe) (NNPS Islands)) (VP (VBP are) (ADVP (RB no) (RB longer)) (NP (NP (NN part)) (PP (IN of) (NP (NP (DT the) (NNP Kingdom)) (PP (IN of) (NP (NNP Mercia))))))) (. .)))
Dependency Graph: -> part/NN (root)
-> Islands/NNPS (nsubj)
-> The/DT (det)
-> Faroe/NNP (compound)
-> are/VBP (cop)
-> longer/RB (advmod)
-> no/RB (neg)
-> Kingdom/NNP (nmod:of)
-> of/IN (case)
-> the/DT (det)
-> Mercia/NNP (nmod:of)
-> of/IN (case)
-> ./. (punct)
Claim: "Part of the Hindu Kush is in Brazil."
Correct document: Hindu_Kush
Found Documents:
Part
Brazil
Backup Documents:
part_-lrb-music-rrb-
brazil_-lrb-ep-rrb-
brazil_-lrb-bebi_dol_song-rrb-
brazil_-lrb-rosemary_clooney_album-rrb-
brazil_-lrb-surname-rrb-
brazil_-lrb-1944_film-rrb-
brazil_-lrb-1985_film-rrb-
brazil_-lrb-novel-rrb-
brazil_-lrb-men_at_work_album-rrb-
brazil_-lrb-disambiguation-rrb-
brazil_-lrb-band-rrb-
brazil_-lrb-the_ritchie_family_album-rrb-
brazil_-lrb-book-rrb-
Constituency Tree: (ROOT (S (NP (NP (NN Part)) (PP (IN of) (NP (DT the) (NNP Hindu) (NNP Kush)))) (VP (VBZ is) (PP (IN in) (NP (NNP Brazil)))) (. .)))
Dependency Graph: -> Brazil/NNP (root)
-> Part/NN (nsubj)
-> Kush/NNP (nmod:of)
-> of/IN (case)
-> the/DT (det)
-> Hindu/NNP (compound)
-> is/VBZ (cop)
-> in/IN (case)
-> ./. (punct)
Claim: "Most of Ripon College's student body die on campus."
Correct document: Ripon_College_-LRB-Wisconsin-RRB-
Found Documents:
Most
Backup Documents:
most_-lrb-satellite-rrb-
most_-lrb-unix-rrb-
most_-lrb-2003_film-rrb-
most_-lrb-most_district-rrb-
most_-lrb-surname-rrb-
Constituency Tree: (ROOT (S (NP (NP (JJS Most)) (PP (IN of) (NP (NP (NNP Ripon) (NNP College) (POS 's)) (NN student) (NN body)))) (VP (VB die) (PP (IN on) (NP (NN campus)))) (. .)))
Dependency Graph: -> die/VB (root)
-> Most/JJS (nsubj)
-> body/NN (nmod:of)
-> of/IN (case)
-> College/NNP (nmod:poss)
-> Ripon/NNP (compound)
-> 's/POS (case)
-> student/NN (compound)
-> campus/NN (nmod:on)
-> on/IN (case)
-> ./. (punct)
Claim: "Speech recognition incorporates knowledge and research into multiple fields."
Correct document: Speech_recognition
Found Documents:
Speech
Backup Documents:
speech_-lrb-rapper-rrb-
speech_-lrb-disambiguation-rrb-
speech_-lrb-album-rrb-
Constituency Tree: (ROOT (S (NP (NN Speech) (NN recognition)) (VP (VBZ incorporates) (NP (NN knowledge) (CC and) (NN research)) (PP (IN into) (NP (JJ multiple) (NNS fields)))) (. .)))
Dependency Graph: -> incorporates/VBZ (root)
-> recognition/NN (nsubj)
-> Speech/NN (compound)
-> knowledge/NN (dobj)
-> and/CC (cc)
-> research/NN (conj:and)
-> research/NN (dobj)
-> fields/NNS (nmod:into)
-> into/IN (case)
-> multiple/JJ (amod)
-> ./. (punct)
Correct documents found: 133/145
Total primary documents found: 306
Backup documents found: 810