Skip to content

Commit

Permalink
*) changed regex that is created for crawling filter (see http://foru…
Browse files Browse the repository at this point in the history
  • Loading branch information
low012 committed Jul 1, 2007
1 parent 28baecd commit 5180053
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions htroot/CrawlStartExpert_p.html
Expand Up @@ -71,10 +71,10 @@ <h2>Expert Crawl Start</h2>
</tr>
<tr valign="top" class="TableCellDark">
<td><label for="crawlingFilter">Crawling Filter</label>:</td>
<td>
<input name="crawlingFilter" id="crawlingFilter" type="text" size="20" maxlength="100" value="#[crawlingFilter]#" />
<input type="button" value="only one domain" onclick='var domain=prompt("please enter domain without \"www.\" (i.e. \"yacy.net\").");if(domain!=null){document.getElementById("crawlingFilter").value=".*\\."+domain.replace("\.", "\\.")+"/.*"}' />
</td>
<td>
<input name="crawlingFilter" id="crawlingFilter" type="text" size="20" maxlength="100" value="#[crawlingFilter]#" />
<input type="button" value="only one domain" onclick='var domain=prompt("please enter domain without \"www.\" (i.e. \"yacy.net\").");if(domain!=null){document.getElementById("crawlingFilter").value="(^|.*\\.)"+domain.replace("\.", "\\.")+"/.*"}' />
</td>
<td>
This is an emacs-like regular expression that must match with the URLs which are used to be crawled.
Use this i.e. to crawl a single domain. If you set this filter it makes sense to increase
Expand Down Expand Up @@ -216,7 +216,7 @@ <h2>Expert Crawl Start</h2>
<td colspan="5"><input type="submit" name="crawlingstart" value="Start New Crawl" /></td>
</tr>
</table>
</form>
</form>

#%env/templates/footer.template%#
</body>
Expand Down

0 comments on commit 5180053

Please sign in to comment.