Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added synonyms to solr #6922

Open
wants to merge 4 commits into
base: master
Choose a base branch
from
Open
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
109 changes: 109 additions & 0 deletions conf/solr/conf/synonyms.txt
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,115 @@ fooaaa,baraaa,bazaaa
GB,gib,gigabyte,gigabytes
MB,mib,megabyte,megabytes
Television, Televisions, TV, TVs
Volume, Vol., Vol
&, and
1, One, I
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should test this one; this might cause false positives. Adding Vol. 1, Vol. I might be a more robust approach? But needs some testing. If Conail has some time, he should be able to load this up to run it!

2, Two, II
3, Three, III
4, Four, IV
5, Five, V
6, Six, VI
7, Seven, VII
8, Eight,VIII
9, Nine, IX
10, Ten, X
11, Eleven, XI
12, Twelve, XII
13, Thirteen, XIII
14, Fourteen, XIV
15, Fifteen, XV
16, Sixteen, XVI
17, Seventeen, XVII
18, Eigthteen, XVIII
19, Nineteen, XIX
20, Twenty, XX
21, Twenty One, XXI
22, Twenty Two, XXII
23, Twenty Three, XXIII
24, Twenty Four, XXIV
25, Twenty Five, XXV
26, Twenty Six, XXVI
27, Twenty Seven, XXVII
28, Twenty Eight, XXVIII
29, Twenty Nine, XXIX
30, Thirty, XXX
31, Thirty One, XXXI
32, Thirty Two, XXXII
33, Thirty Three, XXXIII
34, Thirty Four, XXXIV
35, Thirty Five, XXXV
36, Thirty Six, XXXVI
37, Thirty Seven, XXXVII
38, Thirty Eight, XXXVIII
39, Thirty Nine, XXXIX
40, Fourty, XXXX
bicolino34 marked this conversation as resolved.
Show resolved Hide resolved
41, Fourty One, XXXXI
42, Fourty Two, XXXXII
43, Fourty Three, XXXXIII
44, Fourty Four, XXXXIV
45, Fourty Five, XXXXV
46, Fourty Six, XXXXVI
47, Fourty Seven, XXXXVII
48, Fourty Eight, XXXXVIII
49, Fourty Nine, XXXXIX
50, Fifty, XXXXX
bicolino34 marked this conversation as resolved.
Show resolved Hide resolved
51, Fifty One, XXXXXI
52, Fifty Two, XXXXXII
53, Fifty Three, XXXXXIII
54, Fifty Four, XXXXXIV
55, Fifty Five, XXXXXV
56, Fifty Six, XXXXXVI
57, Fifty Seven, XXXXXVII
58, Fifty Eight, XXXXXVIII
59, Fifty Nine, XXXXXIX
60, Sixty, XXXXXX
61, Sixty One, XXXXXXI
62, Sixty Two, XXXXXXII
63, Sixty Three, XXXXXXIII
64, Sixty Four, XXXXXXIV
65, Sixty Five, XXXXXXV
66, Sixty Six, XXXXXXVI
67, Sixty Seven, XXXXXXVII
68, Sixty Eight, XXXXXXVIII
69, Sixty Nine, XXXXXXIX
70, Seventy, XXXXXXX
71, Seventy One, XXXXXXXI
72, Seventy Two, XXXXXXXII
73, Seventy Three, XXXXXXXIII
74, Seventy Four, XXXXXXXIV
75, Seventy Five, XXXXXXXV
76, Seventy Six, XXXXXXXVI
77, Seventy Seven, XXXXXXXVII
78, Seventy Eight, XXXXXXXVIII
79, Seventy Nine, XXXXXXXIX
80, Eighthy, XXXXXXXX
bicolino34 marked this conversation as resolved.
Show resolved Hide resolved
81, Eighthy One, XXXXXXXXI
82, Eighthy Two, XXXXXXXXII
83, Eighthy Three, XXXXXXXXIII
84, Eighthy Four, XXXXXXXXIV
85, Eighthy Five, XXXXXXXXV
86, Eighthy Six, XXXXXXXXVI
87, Eighthy Seven, XXXXXXXXVII
88, Eighthy Eight, XXXXXXXXVIII
89, Eighthy Nine, XXXXXXXXIX
90, Ninety, XXXXXXXXX
bicolino34 marked this conversation as resolved.
Show resolved Hide resolved
91, Ninety One, XXXXXXXXXI
92, Ninety Two, XXXXXXXXXII
93, Ninety Three, XXXXXXXXXIII
94, Ninety Four, XXXXXXXXXIV
95, Ninety Five, XXXXXXXXXV
96, Ninety Six, XXXXXXXXXVI
97, Ninety Seven, XXXXXXXXXVII
98, Ninety Eight, XXXXXXXXXVIII
99, Ninety Nine, XXXXXXXXXIX
100, One Hundred, C
#Ukrainian synonyms
навч., навчальний
ТБ, телебачення
ГБ, гігабайт, гіб
#general punctuation mark (probably won't work correctly because " is in both lines)
«,„, "
», “, "
#notice we use "gib" instead of "GiB" so any WordDelimiterGraphFilter coming
#after us won't split it into two words.
cdrini marked this conversation as resolved.
Show resolved Hide resolved

Expand Down