Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
bpgergo
committed
Jan 15, 2012
1 parent
2876ac6
commit d66c522
Showing
7 changed files
with
198 additions
and
13 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,72 @@ | ||
<html> | ||
<head> | ||
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8"> | ||
<title> | ||
Hunglish corpus search syntax | ||
</title> | ||
<link rel="stylesheet" href="/static/styles/syntax.css" type="text/css"> | ||
</head> | ||
<body> | ||
<p> | ||
<a href="http://mokk.bme.hu"><img src="http://szotar.mokk.bme.hu/hunglish/mokk-logo.gif"/></a> | ||
<a href="http://www.nytud.hu"><img src="http://szotar.mokk.bme.hu/hunglish/nytud-logo.gif"/></a> | ||
</p> | ||
|
||
<h1> | ||
Search syntax | ||
</h1> | ||
|
||
<p>You can set any number of space-separated query phrases on the Hungarian or the English side, or even both sides at once. | ||
</p> | ||
|
||
<p>The syntax of these query phrases is easiest to describe through a series of examples: | ||
</p> | ||
|
||
<p>The simplest example of a query phrases is a word: The Hungarian side <code>ellopták</code> query results in bisentences with | ||
this Hungarian word in their Hungarian sentence. Other inflected forms (<code>ellopnám</code>, <code>ellopott</code>) are also returned as results, but are ranked lower than the exactly matching version. | ||
</p> | ||
<p> | ||
|
||
To prohibit results with other inflected forms, the term should be surrounded with the < > parentheses. So <code><ellopták></code> | ||
will not give results with only, say, <code>ellopott</code> in them. This is called an <i>exact match</i> query. | ||
</p> | ||
|
||
<p> | ||
One can search for <i>multi-word</i> search terms. The three equivalent ways of doing this are: | ||
<ol> | ||
<li>Quotation: <code>"back to normal"</code>.</li> | ||
<li>Dashes: <code>back-to-normal</code>. </li> | ||
<li>Dots: <code>back.to.normal</code> .</li> | ||
</ol> | ||
</p> | ||
|
||
<p> | ||
One can mix the exact match syntax with multi-word search terms, but mixing parentheses and quotation signs (<code><"back to normal"></code>) is not allowed. So the exact match versions of the previous examples would look like this: <code><back to normal></code> | ||
<code><back-to-normal></code> <code><back.to.normal></code>. | ||
</p> | ||
|
||
<p> | ||
Any query phrase is allowed to have one of the following two modifiers as prefixes. Space is not allowed between the modifier and its modifyee. | ||
|
||
|
||
<ul> | ||
<li> | ||
The <i>Prohibited</i> modifier (minus sign) negates the query phrase. So, for example Hungarian: | ||
<code>fél</code> English: <code>-scared</code> returns bisentences where there is a word in | ||
the Hungarian sentence which stems the same as "fél", but there is NO word in the English | ||
sentence which stems the same as "scared". | ||
The Prohibited modifier can be combined with exact match syntax and multi-word search terms, | ||
e.g. English: <code>scare -<scared></code> and <code>-<don't know></code> are valid queries. | ||
</li> | ||
|
||
<li> | ||
The <i>Required</i> modifier (plus sign) requires that the term after the "+" symbol exist somewhere in the sentence. Without any modifier | ||
the phrase is optional. So, for example English <code> buy into </code> return bisentences where the words <code>buy</code> OR <code>into</code> but the query <code> +buy +into</code> returns only bisentences with the two words in it. | ||
</li> | ||
</ul> | ||
|
||
<p> | ||
|
||
The query syntax is designed so that simple queries have straightforward syntax, but quite complex queries are manageable, too. Some users might need even more advanced queries, like nested ones, or complex filtering of document sources. For these users we plan to provide direct access to the internal query language, but the syntax for this is quite complex and nonintuitive. | ||
</p> | ||
</html> |
Binary file not shown.
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,106 @@ | ||
img { | ||
border: 0px; | ||
} | ||
form { | ||
border: none; | ||
margin: 0; | ||
} | ||
|
||
textarea { | ||
font: 110% "Courier New", Courier, monospace; | ||
border: 1px solid #8cacbb; | ||
color: Black; | ||
background-color: white; | ||
width: 100%; | ||
} | ||
input { | ||
font-family: "Verdana, Geneva, Arial, Helvetica, sans-serif; | ||
visibility: visible; | ||
border: 1px solid #8cacbb; | ||
color: Black; | ||
background-color: white; | ||
vertical-align: middle; | ||
} | ||
select { | ||
border: 1px solid #8cacbb; | ||
color: Black; | ||
background-color: White; | ||
vertical-align: top; | ||
} | ||
|
||
a:link { | ||
font-family : Verdana, Geneva, Arial, Helvetica, sans-serif; | ||
font-style : normal; | ||
text-decoration : none; | ||
color: #336699; | ||
} | ||
|
||
a:visited { | ||
font-family : Verdana, Geneva, Arial, Helvetica, sans-serif; | ||
text-decoration : none; | ||
color: #336699; | ||
} | ||
|
||
a:hover { | ||
font-family : Verdana, Geneva, Arial, Helvetica, sans-serif; | ||
text-decoration : none; | ||
color: #000000; | ||
|
||
} | ||
|
||
a:active { | ||
font-family : Verdana, Geneva, Arial, Helvetica, sans-serif; | ||
text-decoration : underline; | ||
} | ||
|
||
a:focus { | ||
font-family : Verdana, Geneva, Arial, Helvetica, sans-serif; | ||
text-decoration : underline; | ||
} | ||
|
||
body { | ||
font-family : Verdana, Geneva, Arial, Helvetica, sans-serif; | ||
font-size : 12px; | ||
margin-bottom : 6px; | ||
margin-left : 6px; | ||
margin-right : 6px; | ||
margin-top : 6px; | ||
width: 800px; | ||
} | ||
|
||
.senttable { | ||
font-family : Verdana, Geneva, Arial, Helvetica, sans-serif; | ||
font-size : 12px; | ||
list-style-type : circle; | ||
margin-bottom : 6px; | ||
margin-left : 0px; | ||
margin-right : 0px; | ||
margin-top : 12px; | ||
} | ||
|
||
.senttable .odd { | ||
/*every second line should be shaded */ | ||
background-color: transparent; | ||
} | ||
.senttable .even { | ||
background-color: #f7f9fa; | ||
border-bottom: 1px solid #8cacbb; | ||
} | ||
|
||
.result td, | ||
table td | ||
{ | ||
border-bottom: 1px solid #8cacbb; | ||
padding: 0.5em 1em; | ||
text-align: left; | ||
} | ||
|
||
.notlast { | ||
border-right: 1px solid #8cacbb; | ||
} | ||
div.pager { | ||
margin-top: 6px; | ||
font-size : 12px; | ||
text-align: center; | ||
|
||
} |