Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Browse files

new version and such

  • Loading branch information...
commit d83d81f5c7b7c3c5c57829bacdf75649729e8643 1 parent 5a9bc20
John Brien authored
Showing with 694 additions and 2,768 deletions.
  1. +11 −1 Gemfile
  2. +11 −8 LICENSE
  3. +11 −58 README.md
  4. +22 −67 Rakefile
  5. +0 −3  app/assets/javascripts/store/solr_sort_by.js.coffee
  6. +0 −12 app/helpers/spree/base_helper_decorator.rb
  7. +0 −85 app/models/product_decorator.rb
  8. +5 −0 app/models/spree/app_configuration_decorator.rb
  9. +0 −5 app/models/spree/sunspot_configuration.rb
  10. +20 −0 app/models/spree_product_decorator.rb
  11. +0 −5 app/overrides/add_search_facets.rb
  12. +0 −5 app/overrides/add_search_pagination.rb
  13. +0 −4 app/overrides/add_search_sort.rb
  14. +0 −8 app/overrides/add_search_suggestion.rb
  15. +0 −42 app/views/spree/products/_facets.html.erb
  16. +0 −18 app/views/spree/products/_sort_bar.html.erb
  17. +0 −6 app/views/spree/products/_suggestion.html.erb
  18. +6 −5 config/locales/en.yml
  19. +0 −31 lib/conf/admin-extra.html
  20. +0 −36 lib/conf/elevate.xml
  21. +0 −246 lib/conf/mapping-ISOLatin1Accent.txt
  22. +0 −21 lib/conf/protwords.txt
  23. +0 −238 lib/conf/schema.xml
  24. +0 −24 lib/conf/scripts.conf
  25. +0 −934 lib/conf/solrconfig.xml
  26. +0 −2  lib/conf/spellings.txt
  27. +0 −58 lib/conf/stopwords.txt
  28. +0 −31 lib/conf/synonyms.txt
  29. +0 −132 lib/conf/xslt/example.xsl
  30. +0 −67 lib/conf/xslt/example_atom.xsl
  31. +0 −66 lib/conf/xslt/example_rss.xsl
  32. +0 −337 lib/conf/xslt/luke.xsl
  33. +44 −0 lib/generators/spree/sunspot/install/install_generator.rb
  34. +48 −0 lib/generators/spree/sunspot/install/templates/config/initializers/spree_sunspot.rb
  35. +0 −15 lib/generators/spree_sunspot_search/install/install_generator.rb
  36. +0 −12 lib/generators/templates/spree_sunspot_search.rb
  37. +0 −64 lib/spree/search/configuration.rb
  38. +0 −22 lib/spree/search/engine.rb
  39. +0 −65 lib/spree/search/sunspot.rb
  40. +36 −0 lib/spree/sunspot/engine.rb
  41. +62 −0 lib/spree/sunspot/filter/condition.rb
  42. +62 −0 lib/spree/sunspot/filter/filter.rb
  43. +54 −0 lib/spree/sunspot/filter/param.rb
  44. +36 −0 lib/spree/sunspot/filter/query.rb
  45. +73 −0 lib/spree/sunspot/filter_support.rb
  46. +30 −0 lib/spree/sunspot/filters.rb
  47. +100 −0 lib/spree/sunspot/search.rb
  48. +27 −0 lib/spree/sunspot/setup.rb
  49. +2 −0  lib/spree_sunspot.rb
  50. +0 −13 lib/spree_sunspot_search.rb
  51. +8 −0 lib/tasks/spree_sunspot.rake
  52. +13 −13 spec/spec_helper.rb
  53. +13 −9 spree_sunspot_search.gemspec
12 Gemfile
View
@@ -1,3 +1,13 @@
source 'http://rubygems.org'
-gemspec
+group :test do
+ gem 'ffaker'
+end
+
+if RUBY_VERSION < "1.9"
+ gem "ruby-debug"
+else
+ gem "ruby-debug19"
+end
+
+gemspec
19 LICENSE
View
@@ -1,14 +1,17 @@
-Redistribution and use in source and binary forms, with or without modification,
+Copyright (c) 2012 [name of plugin creator]
+All rights reserved.
+
+Redistribution and use in source and binary forms, with or without modification,
are permitted provided that the following conditions are met:
- * Redistributions of source code must retain the above copyright notice,
+ * Redistributions of source code must retain the above copyright notice,
this list of conditions and the following disclaimer.
- * Redistributions in binary form must reproduce the above copyright notice,
- this list of conditions and the following disclaimer in the documentation
+ * Redistributions in binary form must reproduce the above copyright notice,
+ this list of conditions and the following disclaimer in the documentation
and/or other materials provided with the distribution.
- * Neither the name of the Rails Dog LLC nor the names of its
- contributors may be used to endorse or promote products derived from this
- software without specific prior written permission.
+ * Neither the name Spree nor the names of its contributors may be used to
+ endorse or promote products derived from this software without specific
+ prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
"AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
@@ -20,4 +23,4 @@ PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
-SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
69 README.md
View
@@ -1,68 +1,21 @@
-SpreeSunspotSearch
-==================
+Spree::Sunspot
+==============
-Adds Solr search to Spree using [Sunspot](https://github.com/sunspot/sunspot). This is a moving targer and is very beta and should be treated as such.
+Introduction goes here.
-This is compatible with Spree 1.2. I haven't tested it below that.
-
-Install
-=======
-
-I make the assumption that you have a functioning Spree store and are just extending the search capabilities with Sunspot/Solr
-
-Add spree_sunspot_search to your Gemfile and run bundler.
-
-`gem 'spree_sunspot_search', git: 'git://github.com/jbrien/spree_sunspot_search.git'`
-
-add the following to the Gemfile if you are not using another solr install locally for testing and development. The rake tasks for starting and stop this for development are included automatically for your use.
-
- group :test, :development do
- gem 'sunspot_solr'
- end
-
-
-Install the solr.yml file from Sunspot.
-
-`rails g sunspot_rails:install`
-
-Copy the initializer and add `solr_sort_by` to `all.js`
-
-`rails g spree_sunspot_search:install`
-
-Running
+Example
=======
-Start up Solr (bundled with Sunspot's install)
-
-`rake sunspot:solr:run`
-
-Build the index for the first time
-
-`rake sunspot:reindex`
-
-Customise the Facets Shown
---------------------------
-
-Edit the initializer and specify you Product Properties, Product Options, and Price Ranges as an array.
-The initializer should provide enough examples to get you started.
+Example goes here.
Testing
-=======
-
-TODO
+-------
-TODOs
-=====
+Be sure to bundle your dependencies and then create a dummy test app for the specs to run against.
-* Add an automatic MAX value for price facets (e.g. Above <max_said_value>)
-* Sorting by facet criteria and Solr analytics (Best result, Popular, etc.)
-* Open the Sunspot DSL to utilise all the additional data and analytics available through Solr
-* Get the Taxon browsing (e.g. Categories) to utilise the Solr data for speed boosts
-
-Authors
-=======
-* @jbrien
-* @iloveitaly
+ $ bundle
+ $ bundle exec rake test app
+ $ bundle exec rspec spec
-Copyright (c) 2011 John Brien Dilts, released under the New BSD License
+Copyright (c) 2012 [name of extension creator], released under the New BSD License
89 Rakefile
View
@@ -1,75 +1,30 @@
-require 'rubygems'
-require 'rake'
-require 'rake/testtask'
-require 'rake/packagetask'
-require 'rake/gempackagetask'
-
-gemfile = File.expand_path('../spec/test_app/Gemfile', __FILE__)
-if File.exists?(gemfile) && (%w(spec cucumber).include?(ARGV.first.to_s) || ARGV.size == 0)
- require 'bundler'
- ENV['BUNDLE_GEMFILE'] = gemfile
- Bundler.setup
-
- require 'rspec'
- require 'rspec/core/rake_task'
- RSpec::Core::RakeTask.new
-
- require 'cucumber/rake/task'
- Cucumber::Rake::Task.new do |t|
- t.cucumber_opts = %w{--format progress}
- end
-end
-
-desc "Default Task"
-task :default => [:spec, :cucumber ]
-
-spec = eval(File.read('spree_sunspot_search.gemspec'))
-
-Rake::GemPackageTask.new(spec) do |p|
- p.gem_spec = spec
+#!/usr/bin/env rake
+begin
+ require 'bundler/setup'
+rescue LoadError
+ puts 'You must `gem install bundler` and `bundle install` to run rake tasks'
end
-
-desc "Release to gemcutter"
-task :release => :package do
- require 'rake/gemcutter'
- Rake::Gemcutter::Tasks.new(spec).define
- Rake::Task['gem:push'].invoke
+begin
+ require 'rdoc/task'
+rescue LoadError
+ require 'rdoc/rdoc'
+ require 'rake/rdoctask'
+ RDoc::Task = Rake::RDocTask
end
-desc "Default Task"
-task :default => [ :spec ]
-
-desc "Regenerates a rails 3 app for testing"
-task :test_app do
- require '../spree/lib/generators/spree/test_app_generator'
- class SpreeSunspotSearchTestAppGenerator < Spree::Generators::TestAppGenerator
-
- def install_gems
- inside "test_app" do
- run 'bundle exec rake spree_core:install'
- run 'bundle exec rake spree_sunspot_search:install'
- end
- end
+Bundler::GemHelper.install_tasks
- def migrate_db
- run_migrations
- end
+require 'rspec/core/rake_task'
+require 'spree/core/testing_support/common_rake'
- protected
- def full_path_for_local_gems
- <<-gems
-gem 'spree_core', :path => \'#{File.join(File.dirname(__FILE__), "../spree/", "core")}\'
-gem 'spree_sunspot_search', :path => \'#{File.dirname(__FILE__)}\'
- gems
- end
+RSpec::Core::RakeTask.new
- end
- SpreeSunspotSearchTestAppGenerator.start
-end
+task :default => [:spec]
-namespace :test_app do
- desc 'Rebuild test and cucumber databases'
- task :rebuild_dbs do
- system("cd spec/test_app && bundle exec rake db:drop db:migrate RAILS_ENV=test && rake db:drop db:migrate RAILS_ENV=cucumber")
+task :test_app do
+ %w( spree_sunspot ).each do |engine|
+ ENV['LIB_NAME'] = File.join(engine)
+ ENV['DUMMY_PATH'] = File.expand_path("../../#{engine}/spec/dummy", __FILE__)
+ Rake::Task['common:test_app'].execute
end
-end
+end
3  app/assets/javascripts/store/solr_sort_by.js.coffee
View
@@ -1,3 +0,0 @@
-$ ->
- $('#product_sort_by').change ->
- window.location.href = @value
12 app/helpers/spree/base_helper_decorator.rb
View
@@ -1,12 +0,0 @@
-Spree::BaseHelper.module_eval do
- def link_to_facet(facet_name, facet_row)
- # if we are just linking to taxon, link to the permalink instead of query string
-
- if facet_name == :taxon_id
- # use seo_url when linking to a taxon
- link_to(facet_row.instance.name, nested_taxons_path(facet_row.instance.permalink, params.merge("page" => nil))) + " (#{facet_row.count})"
- else
- link_to(facet_row.value, params.merge("#{facet_name}_facet" => facet_row.value, "page" => nil)) + " (#{facet_row.count})"
- end
- end
-end
85 app/models/product_decorator.rb
View
@@ -1,85 +0,0 @@
-Spree::Product.class_eval do
- searchable do
- boolean :is_active, :using => :is_active?
-
- conf = Spree::Search::Sunspot.configuration
-
- conf.fields.each do |field|
- if field.class == Hash
- field = { :opts => {} }.merge(field)
-
- if field[:opts][:block]
- block = field[:opts][:block]
- field[:opts].delete(:block)
- send field[:type], field[:name], field[:opts], &block
- else
- send field[:type], field[:name], field[:opts]
- end
- else
- text(field)
- end
- end
-
- # pull the product's taxon, and all its ancestors: this allows us to intersect the display with the current taxon's
- # children and allow the user to intuitively 'dig down' into the product heirarchy
- # root taxon is excluded: doesn't really allow for intuitive navigation
- integer :taxon_ids, :multiple => true, :references => Spree::Taxon do
- taxons.map { |t| t.self_and_ancestors.select { |tx| !tx.root? }.map(&:id) }.flatten(1).uniq
- end
-
- conf.option_facets.each do |option|
- string "#{option}_facet", :multiple => true do
- get_option_values(option.to_s).map(&:presentation)
- end
- end
-
- conf.property_facets.each do |prop|
- string "#{prop}_facet", :multiple => true do
- property(prop.to_s)
- end
- end
-
- conf.other_facets.each do |method|
- string "#{method}_facet", :multiple => true do
- send(method)
- end
- end
-
- if respond_to?(:stores)
- integer :store_ids, :multiple => true, :references => Store
- end
-
- end
-
- def is_active?
- !deleted_at && available_on &&
- (available_on <= Time.zone.now) &&
- (Spree::Config[:allow_backorders] || count_on_hand > 0)
- end
-
- private
-
- def price_range
- max = 0
- Spree::Search::Sunspot.configuration.price_ranges.each do |range, name|
- return name if range.include?(price)
- max = range.max if range.max > max
- end
- I18n.t(:price_and_above, :price => max)
- end
-
- def get_option_values(option_name)
- # in the next 1.1.x release this should be replaced with the option value accessors
-
- sql = <<-eos
- SELECT DISTINCT ov.id, ov.presentation
- FROM spree_option_values AS ov
- LEFT JOIN spree_option_types AS ot ON (ov.option_type_id = ot.id)
- LEFT JOIN spree_option_values_variants AS ovv ON (ovv.option_value_id = ov.id)
- LEFT JOIN spree_variants AS v ON (ovv.variant_id = v.id)
- LEFT JOIN spree_products AS p ON (v.product_id = p.id)
- WHERE (ot.name = '#{option_name}' AND p.id = #{self.id});
- eos
- Spree::OptionValue.find_by_sql(sql)
- end
-end
5 app/models/spree/app_configuration_decorator.rb
View
@@ -0,0 +1,5 @@
+module Spree
+ AppConfiguration.class_eval do
+ preference :total_similar_products, :integer, :default => 10
+ end
+end
5 app/models/spree/sunspot_configuration.rb
View
@@ -1,5 +0,0 @@
-module Spree
- class SunspotConfiguration < Preferences::Configuration
- preference :facet_display_limit, :integer, :default => -1
- end
-end
20 app/models/spree_product_decorator.rb
View
@@ -0,0 +1,20 @@
+Spree::Product.class_eval do
+ def get_option_values(option_name)
+ sql = <<-eos
+ SELECT DISTINCT ov.id, ov.presentation
+ FROM spree_option_values AS ov
+ LEFT JOIN spree_option_types AS ot ON (ov.option_type_id = ot.id)
+ LEFT JOIN spree_option_values_variants AS ovv ON (ovv.option_value_id = ov.id)
+ LEFT JOIN spree_variants AS v ON (ovv.variant_id = v.id)
+ LEFT JOIN spree_products AS p ON (v.product_id = p.id)
+ WHERE ((ot.name = '#{option_name}' OR ot.presentation = '#{option_name}')
+ AND p.id = #{self.id});
+ eos
+ Spree::OptionValue.find_by_sql(sql).map(&:presentation)
+ end
+end
+
+unless Spree::Sunspot::Setup.configuration.nil?
+ Spree::Product.class_eval &Spree::Sunspot::Setup.configuration
+end
+
5 app/overrides/add_search_facets.rb
View
@@ -1,5 +0,0 @@
-Deface::Override.new(:virtual_path => "spree/shared/_taxonomies",
- :name => "show_search_partials_facets",
- :insert_top => "nav#taxonomies",
- :partial => "spree/products/facets",
- :disabled => false)
5 app/overrides/add_search_pagination.rb
View
@@ -1,5 +0,0 @@
-Deface::Override.new(:virtual_path => "spree/shared/_products",
- :name => "add_sunspot_search_pagination",
- :replace => "code[erb-silent]:contains('if paginated_products.respond_to')",
- :closing_selector => "code[erb-silent]:contains('end')",
- :text => "<%= paginate @searcher.sunspot.hits %>")
4 app/overrides/add_search_sort.rb
View
@@ -1,4 +0,0 @@
-Deface::Override.new(:virtual_path => "spree/shared/_products",
- :name => "add_sort_bar",
- :insert_before => "#products",
- :partial => 'spree/products/sort_bar')
8 app/overrides/add_search_suggestion.rb
View
@@ -1,8 +0,0 @@
-# unfortunately it doesn't look like sunspot has spell check support yet
-# https://github.com/sunspot/sunspot/pull/43
-
-# Deface::Override.new(:virtual_path => "spree/products/index",
-# :name => "show_search_partials_suggestion",
-# :insert_top => "[data-hook='search_results']",
-# :partial => "spree/products/suggestion",
-# :disabled => false)
42 app/views/spree/products/_facets.html.erb
View
@@ -1,42 +0,0 @@
-<%
-facets_arr = Spree::Search::SpreeSunspot.configuration.display_facets
-limit = Spree::SunspotSearch::Config[:facet_display_limit]
-
-if @taxon
- display_list = @taxon.leaf? ? [@taxon.id] : @taxon.children.map(&:id)
-else
- display_list = @searcher.sunspot.facet(:taxon_ids).rows.slice(0..limit).map { |r| r.instance.id }
-end
-
-taxon_rows = @searcher.sunspot.facet(:taxon_ids).rows.select { |t| display_list.include? t.instance.id }.slice(0..limit)
-
-if taxon_rows.length > 1 %>
-<h6><%= t :taxon_facet %></h6>
-<ul>
- <% taxon_rows.each do |taxon| %>
- <%= content_tag(:li, link_to_facet(:taxon_id, taxon)) %>
- <% end %>
-</ul>
-<% end %>
-
-<%# handle the rest of the facets %>
-<% facets_arr.each do |f| %>
- <% unless @searcher.sunspot.facet("#{f}_facet").rows.empty? %>
- <h6><%= t "#{f}_facet" %></h6>
- <ul>
- <% @searcher.sunspot.facet("#{f}_facet").rows.slice(0..limit).each do |row| %>
- <%= content_tag(:li, link_to_facet(f, row)) %>
- <% end %>
- </ul>
- <% end %>
-<% end %>
-
-
-<% unless @searcher.sunspot.facet(:price).rows.empty? %>
-<h6><%= t "price_range" %></h6>
-<ul>
- <% @searcher.sunspot.facet(:price).rows.each do |row| %>
- <li><%= link_to(t(row.value) + " (#{row.count})", params.merge("price" => row.value)) %></li>
- <% end %>
-</ul>
-<% end %>
18 app/views/spree/products/_sort_bar.html.erb
View
@@ -1,18 +0,0 @@
-<%
-if not params.keys.detect { |k| k != 'controller' and k != 'action' }.nil? and params[:controller] != 'spree/taxons'
- # hate to throw this logic here (messy)
- # I think it would be worse to create a one-time-use helper method
- options = Spree::Search::SpreeSunspot.configuration.sort_fields.map do |key, value|
- # value is sort direction
- value = [value] if !value.is_a? Array
- Rails.logger.info "Array Key sort.#{key}_#{value}"
- value.map { |sort| [t("sort.#{key}_#{sort}"), url_for(request.params.merge({:sort => key, :order => sort}))] }
- end
-
- options = options_for_select(options.flatten(1), url_for(request.params.merge({
- :sort => params[:sort] || :score,
- :order => params[:order] || :desc
- })))
-%>
-<div id="product-list-sort"><%= t(:sort_by) %> <%= select_tag("product_sort_by", options) %></div>
-<% end %>
6 app/views/spree/products/_suggestion.html.erb
View
@@ -1,6 +0,0 @@
-<% if suggestion = @searcher.suggest %>
- <p>
- <%= t(:did_you_mean, :default => "Did you mean") %>
- <%= link_to h(suggestion), url_for(request.params.merge({:keywords => suggestion})) %>?
- </p>
-<% end %>
11 config/locales/en.yml
View
@@ -1,6 +1,7 @@
+# Sample localization file for English. Add more files in this directory for other locales.
+# See https://github.com/svenfuchs/rails-i18n/tree/master/rails%2Flocale for starting points.
+
en:
- sort_by: "Sort By:"
- sort:
- price_desc: "Price Desc"
- price_asc: "Price Asc"
- score_desc: "Relevance"
+ empty_search_results: "No products found"
+ clear_all: "Clear All"
+ search: "Search"
31 lib/conf/admin-extra.html
View
@@ -1,31 +0,0 @@
-<!--
- Licensed to the Apache Software Foundation (ASF) under one or more
- contributor license agreements. See the NOTICE file distributed with
- this work for additional information regarding copyright ownership.
- The ASF licenses this file to You under the Apache License, Version 2.0
- (the "License"); you may not use this file except in compliance with
- the License. You may obtain a copy of the License at
-
- http://www.apache.org/licenses/LICENSE-2.0
-
- Unless required by applicable law or agreed to in writing, software
- distributed under the License is distributed on an "AS IS" BASIS,
- WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
- See the License for the specific language governing permissions and
- limitations under the License.
--->
-
-<!-- The content of this page will be statically included into the top
-of the admin page. Uncomment this as an example to see there the content
-will show up.
-
-<hr>
-<i>This line will appear before the first table</i>
-<tr>
-<td colspan="2">
-This row will be appended to the end of the first table
-</td>
-</tr>
-<hr>
-
--->
36 lib/conf/elevate.xml
View
@@ -1,36 +0,0 @@
-<?xml version="1.0" encoding="UTF-8" ?>
-<!--
- Licensed to the Apache Software Foundation (ASF) under one or more
- contributor license agreements. See the NOTICE file distributed with
- this work for additional information regarding copyright ownership.
- The ASF licenses this file to You under the Apache License, Version 2.0
- (the "License"); you may not use this file except in compliance with
- the License. You may obtain a copy of the License at
-
- http://www.apache.org/licenses/LICENSE-2.0
-
- Unless required by applicable law or agreed to in writing, software
- distributed under the License is distributed on an "AS IS" BASIS,
- WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
- See the License for the specific language governing permissions and
- limitations under the License.
--->
-
-<!-- If this file is found in the config directory, it will only be
- loaded once at startup. If it is found in Solr's data
- directory, it will be re-loaded every commit.
--->
-
-<elevate>
- <query text="foo bar">
- <doc id="1" />
- <doc id="2" />
- <doc id="3" />
- </query>
-
- <query text="ipod">
- <doc id="MA147LL/A" /> <!-- put the actual ipod at the top -->
- <doc id="IW-02" exclude="true" /> <!-- exclude this cable -->
- </query>
-
-</elevate>
246 lib/conf/mapping-ISOLatin1Accent.txt
View
@@ -1,246 +0,0 @@
-# The ASF licenses this file to You under the Apache License, Version 2.0
-# (the "License"); you may not use this file except in compliance with
-# the License. You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-# Syntax:
-# "source" => "target"
-# "source".length() > 0 (source cannot be empty.)
-# "target".length() >= 0 (target can be empty.)
-
-# example:
-# "À" => "A"
-# "\u00C0" => "A"
-# "\u00C0" => "\u0041"
-# "ß" => "ss"
-# "\t" => " "
-# "\n" => ""
-
-# À => A
-"\u00C0" => "A"
-
-# Á => A
-"\u00C1" => "A"
-
-# Â => A
-"\u00C2" => "A"
-
-# Ã => A
-"\u00C3" => "A"
-
-# Ä => A
-"\u00C4" => "A"
-
-# Å => A
-"\u00C5" => "A"
-
-# Æ => AE
-"\u00C6" => "AE"
-
-# Ç => C
-"\u00C7" => "C"
-
-# È => E
-"\u00C8" => "E"
-
-# É => E
-"\u00C9" => "E"
-
-# Ê => E
-"\u00CA" => "E"
-
-# Ë => E
-"\u00CB" => "E"
-
-# Ì => I
-"\u00CC" => "I"
-
-# Í => I
-"\u00CD" => "I"
-
-# Î => I
-"\u00CE" => "I"
-
-# Ï => I
-"\u00CF" => "I"
-
-# IJ => IJ
-"\u0132" => "IJ"
-
-# Ð => D
-"\u00D0" => "D"
-
-# Ñ => N
-"\u00D1" => "N"
-
-# Ò => O
-"\u00D2" => "O"
-
-# Ó => O
-"\u00D3" => "O"
-
-# Ô => O
-"\u00D4" => "O"
-
-# Õ => O
-"\u00D5" => "O"
-
-# Ö => O
-"\u00D6" => "O"
-
-# Ø => O
-"\u00D8" => "O"
-
-# Π=> OE
-"\u0152" => "OE"
-
-# Þ
-"\u00DE" => "TH"
-
-# Ù => U
-"\u00D9" => "U"
-
-# Ú => U
-"\u00DA" => "U"
-
-# Û => U
-"\u00DB" => "U"
-
-# Ü => U
-"\u00DC" => "U"
-
-# Ý => Y
-"\u00DD" => "Y"
-
-# Ÿ => Y
-"\u0178" => "Y"
-
-# à => a
-"\u00E0" => "a"
-
-# á => a
-"\u00E1" => "a"
-
-# â => a
-"\u00E2" => "a"
-
-# ã => a
-"\u00E3" => "a"
-
-# ä => a
-"\u00E4" => "a"
-
-# å => a
-"\u00E5" => "a"
-
-# æ => ae
-"\u00E6" => "ae"
-
-# ç => c
-"\u00E7" => "c"
-
-# è => e
-"\u00E8" => "e"
-
-# é => e
-"\u00E9" => "e"
-
-# ê => e
-"\u00EA" => "e"
-
-# ë => e
-"\u00EB" => "e"
-
-# ì => i
-"\u00EC" => "i"
-
-# í => i
-"\u00ED" => "i"
-
-# î => i
-"\u00EE" => "i"
-
-# ï => i
-"\u00EF" => "i"
-
-# ij => ij
-"\u0133" => "ij"
-
-# ð => d
-"\u00F0" => "d"
-
-# ñ => n
-"\u00F1" => "n"
-
-# ò => o
-"\u00F2" => "o"
-
-# ó => o
-"\u00F3" => "o"
-
-# ô => o
-"\u00F4" => "o"
-
-# õ => o
-"\u00F5" => "o"
-
-# ö => o
-"\u00F6" => "o"
-
-# ø => o
-"\u00F8" => "o"
-
-# œ => oe
-"\u0153" => "oe"
-
-# ß => ss
-"\u00DF" => "ss"
-
-# þ => th
-"\u00FE" => "th"
-
-# ù => u
-"\u00F9" => "u"
-
-# ú => u
-"\u00FA" => "u"
-
-# û => u
-"\u00FB" => "u"
-
-# ü => u
-"\u00FC" => "u"
-
-# ý => y
-"\u00FD" => "y"
-
-# ÿ => y
-"\u00FF" => "y"
-
-# ff => ff
-"\uFB00" => "ff"
-
-# fi => fi
-"\uFB01" => "fi"
-
-# fl => fl
-"\uFB02" => "fl"
-
-# ffi => ffi
-"\uFB03" => "ffi"
-
-# ffl => ffl
-"\uFB04" => "ffl"
-
-# ſt => ft
-"\uFB05" => "ft"
-
-# st => st
-"\uFB06" => "st"
21 lib/conf/protwords.txt
View
@@ -1,21 +0,0 @@
-# The ASF licenses this file to You under the Apache License, Version 2.0
-# (the "License"); you may not use this file except in compliance with
-# the License. You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-#-----------------------------------------------------------------------
-# Use a protected word file to protect against the stemmer reducing two
-# unrelated words to the same base word.
-
-# Some non-words that normally won't be encountered,
-# just to test that they won't be stemmed.
-dontstems
-zwhacky
-
238 lib/conf/schema.xml
View
@@ -1,238 +0,0 @@
-<?xml version="1.0" encoding="UTF-8"?>
-<!--
- Licensed to the Apache Software Foundation (ASF) under one or more
- contributor license agreements. See the NOTICE file distributed with
- this work for additional information regarding copyright ownership.
- The ASF licenses this file to You under the Apache License, Version 2.0
- (the "License"); you may not use this file except in compliance with
- the License. You may obtain a copy of the License at
-
- http://www.apache.org/licenses/LICENSE-2.0
-
- Unless required by applicable law or agreed to in writing, software
- distributed under the License is distributed on an "AS IS" BASIS,
- WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
- See the License for the specific language governing permissions and
- limitations under the License.
--->
-<!--
- This is the Solr schema file. This file should be named "schema.xml" and
- should be in the conf directory under the solr home
- (i.e. ./solr/conf/schema.xml by default)
- or located where the classloader for the Solr webapp can find it.
-
- This example schema is the recommended starting point for users.
- It should be kept correct and concise, usable out-of-the-box.
-
- For more information, on how to customize this file, please see
- http://wiki.apache.org/solr/SchemaXml
-
- PERFORMANCE NOTE: this schema includes many optional features and should not
- be used for benchmarking. To improve performance one could
- - set stored="false" for all fields possible (esp large fields) when you
- only need to search on the field but don't need to return the original
- value.
- - set indexed="false" if you don't need to search on the field, but only
- return the field as a result of searching on other indexed fields.
- - remove all unneeded copyField statements
- - for best index size and searching performance, set "index" to false
- for all general text fields, use copyField to copy them to the
- catchall "text" field, and use that for searching.
- - For maximum indexing performance, use the StreamingUpdateSolrServer
- java client.
- - Remember to run the JVM in server mode, and use a higher logging level
- that avoids logging every request
--->
-<schema name="sunspot" version="1.0">
- <types>
- <!-- field type definitions. The "name" attribute is
- just a label to be used by field definitions. The "class"
- attribute and any other attributes determine the real
- behavior of the fieldType.
- Class names starting with "solr" refer to java classes in the
- org.apache.solr.analysis package.
- -->
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="string" class="solr.StrField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="tdouble" class="solr.TrieDoubleField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="rand" class="solr.RandomSortField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="text" class="solr.TextField" omitNorms="false">
- <analyzer>
- <tokenizer class="solr.StandardTokenizerFactory"/>
- <filter class="solr.StandardFilterFactory"/>
- <filter class="solr.LowerCaseFilterFactory"/>
- </analyzer>
- </fieldType>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="boolean" class="solr.BoolField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="date" class="solr.DateField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="sdouble" class="solr.SortableDoubleField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="sfloat" class="solr.SortableFloatField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="sint" class="solr.SortableIntField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="slong" class="solr.SortableLongField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="tint" class="solr.TrieIntField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="tfloat" class="solr.TrieFloatField" omitNorms="true"/>
- <!-- *** This fieldType is used by Sunspot! *** -->
- <fieldType name="tdate" class="solr.TrieDateField" omitNorms="true"/>
- </types>
- <fields>
- <!-- Valid attributes for fields:
- name: mandatory - the name for the field
- type: mandatory - the name of a previously defined type from the
- <types> section
- indexed: true if this field should be indexed (searchable or sortable)
- stored: true if this field should be retrievable
- compressed: [false] if this field should be stored using gzip compression
- (this will only apply if the field type is compressable; among
- the standard field types, only TextField and StrField are)
- multiValued: true if this field may contain multiple values per document
- omitNorms: (expert) set to true to omit the norms associated with
- this field (this disables length normalization and index-time
- boosting for the field, and saves some memory). Only full-text
- fields or fields that need an index-time boost need norms.
- termVectors: [false] set to true to store the term vector for a
- given field.
- When using MoreLikeThis, fields used for similarity should be
- stored for best performance.
- termPositions: Store position information with the term vector.
- This will increase storage costs.
- termOffsets: Store offset information with the term vector. This
- will increase storage costs.
- default: a value that should be used if no value is specified
- when adding a document.
- -->
- <!-- *** This field is used by Sunspot! *** -->
- <field name="id" stored="true" type="string" multiValued="false" indexed="true"/>
- <!-- *** This field is used by Sunspot! *** -->
- <field name="type" stored="false" type="string" multiValued="true" indexed="true"/>
- <!-- *** This field is used by Sunspot! *** -->
- <field name="class_name" stored="false" type="string" multiValued="false" indexed="true"/>
- <!-- *** This field is used by Sunspot! *** -->
- <field name="text" stored="false" type="string" multiValued="true" indexed="true"/>
- <!-- *** This field is used by Sunspot! *** -->
- <field name="lat" stored="true" type="tdouble" multiValued="false" indexed="true"/>
- <!-- *** This field is used by Sunspot! *** -->
- <field name="lng" stored="true" type="tdouble" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="random_*" stored="false" type="rand" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="_local*" stored="false" type="tdouble" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_text" stored="false" type="text" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_texts" stored="true" type="text" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_b" stored="false" type="boolean" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_bm" stored="false" type="boolean" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_bs" stored="true" type="boolean" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_bms" stored="true" type="boolean" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_d" stored="false" type="date" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_dm" stored="false" type="date" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ds" stored="true" type="date" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_dms" stored="true" type="date" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_e" stored="false" type="sdouble" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_em" stored="false" type="sdouble" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_es" stored="true" type="sdouble" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ems" stored="true" type="sdouble" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_f" stored="false" type="sfloat" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_fm" stored="false" type="sfloat" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_fs" stored="true" type="sfloat" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_fms" stored="true" type="sfloat" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_i" stored="false" type="sint" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_im" stored="false" type="sint" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_is" stored="true" type="sint" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ims" stored="true" type="sint" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_l" stored="false" type="slong" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_lm" stored="false" type="slong" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ls" stored="true" type="slong" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_lms" stored="true" type="slong" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_s" stored="false" type="string" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_sm" stored="false" type="string" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ss" stored="true" type="string" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_sms" stored="true" type="string" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_it" stored="false" type="tint" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_itm" stored="false" type="tint" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_its" stored="true" type="tint" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_itms" stored="true" type="tint" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ft" stored="false" type="tfloat" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ftm" stored="false" type="tfloat" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_fts" stored="true" type="tfloat" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ftms" stored="true" type="tfloat" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_dt" stored="false" type="tdate" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_dtm" stored="false" type="tdate" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_dts" stored="true" type="tdate" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_dtms" stored="true" type="tdate" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_textv" stored="false" termVectors="true" type="text" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_textsv" stored="true" termVectors="true" type="text" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_et" stored="false" termVectors="true" type="tdouble" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_etm" stored="false" termVectors="true" type="tdouble" multiValued="true" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_ets" stored="true" termVectors="true" type="tdouble" multiValued="false" indexed="true"/>
- <!-- *** This dynamicField is used by Sunspot! *** -->
- <dynamicField name="*_etms" stored="true" termVectors="true" type="tdouble" multiValued="true" indexed="true"/>
- </fields>
- <!-- Field to use to determine and enforce document uniqueness.
- Unless this field is marked with required="false", it will be a required field
- -->
- <uniqueKey>id</uniqueKey>
- <!-- field for the QueryParser to use when an explicit fieldname is absent -->
- <defaultSearchField>text</defaultSearchField>
- <!-- SolrQueryParser configuration: defaultOperator="AND|OR" -->
- <solrQueryParser defaultOperator="AND"/>
- <!-- copyField commands copy one field to another at the time a document
- is added to the index. It's used either to index the same field differently,
- or to add multiple fields to the same field for easier/faster searching. -->
-</schema>
24 lib/conf/scripts.conf
View
@@ -1,24 +0,0 @@
-# Licensed to the Apache Software Foundation (ASF) under one or more
-# contributor license agreements. See the NOTICE file distributed with
-# this work for additional information regarding copyright ownership.
-# The ASF licenses this file to You under the Apache License, Version 2.0
-# (the "License"); you may not use this file except in compliance with
-# the License. You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-user=
-solr_hostname=localhost
-solr_port=8983
-rsyncd_port=18983
-data_dir=
-webapp_name=solr
-master_host=
-master_data_dir=
-master_status_dir=
934 lib/conf/solrconfig.xml
View
@@ -1,934 +0,0 @@
-<?xml version="1.0" encoding="UTF-8"?>
-<!--
- Licensed to the Apache Software Foundation (ASF) under one or more
- contributor license agreements. See the NOTICE file distributed with
- this work for additional information regarding copyright ownership.
- The ASF licenses this file to You under the Apache License, Version 2.0
- (the "License"); you may not use this file except in compliance with
- the License. You may obtain a copy of the License at
-
- http://www.apache.org/licenses/LICENSE-2.0
-
- Unless required by applicable law or agreed to in writing, software
- distributed under the License is distributed on an "AS IS" BASIS,
- WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
- See the License for the specific language governing permissions and
- limitations under the License.
--->
-<!--
- For more details about configurations options that may appear in this
- file, see http://wiki.apache.org/solr/SolrConfigXml.
-
- Specifically, the Solr Config can support XInclude, which may make it easier to manage
- the configuration. See https://issues.apache.org/jira/browse/SOLR-1167
--->
-<config>
- <!-- Set this to 'false' if you want solr to continue working after it has
- encountered an severe configuration error. In a production environment,
- you may want solr to keep working even if one handler is mis-configured.
-
- You may also set this to false using by setting the system property:
- -Dsolr.abortOnConfigurationError=false
- -->
- <abortOnConfigurationError>${solr.abortOnConfigurationError:true}</abortOnConfigurationError>
- <!-- lib directives can be used to instruct Solr to load an Jars identified
- and use them to resolve any "plugins" specified in your solrconfig.xml or
- schema.xml (ie: Analyzers, Request Handlers, etc...).
-
- All directories and paths are resolved relative the instanceDir.
-
- If a "./lib" directory exists in your instanceDir, all files found in it
- are included as if you had used the following syntax...
-
- <lib dir="./lib" />
- -->
- <!-- A dir option by itself adds any files found in the directory to the
- classpath, this is useful for including all jars in a directory.
- -->
- <lib dir="../../contrib/extraction/lib"/>
- <!-- When a regex is specified in addition to a directory, only the files in that
- directory which completely match the regex (anchored on both ends)
- will be included.
- -->
- <lib dir="../../dist/" regex="apache-solr-cell-\d.*\.jar"/>
- <lib dir="../../dist/" regex="apache-solr-clustering-\d.*\.jar"/>
- <!-- If a dir option (with or without a regex) is used and nothing is found
- that matches, it will be ignored
- -->
- <lib dir="../../contrib/clustering/lib/downloads/"/>
- <lib dir="../../contrib/clustering/lib/"/>
- <lib dir="/total/crap/dir/ignored"/>
- <!-- an exact path can be used to specify a specific file. This will cause
- a serious error to be logged if it can't be loaded.
- <lib path="../a-jar-that-does-not-exist.jar" />
- -->
- <!-- Used to specify an alternate directory to hold all index data
- other than the default ./data under the Solr home.
- If replication is in use, this should match the replication configuration. -->
- <dataDir>${solr.data.dir:./solr/data}</dataDir>
- <!-- WARNING: this <indexDefaults> section only provides defaults for index writers
- in general. See also the <mainIndex> section after that when changing parameters
- for Solr's main Lucene index. -->
- <indexDefaults>
- <!-- Values here affect all index writers and act as a default unless overridden. -->
- <useCompoundFile>false</useCompoundFile>
- <mergeFactor>10</mergeFactor>
- <!-- If both ramBufferSizeMB and maxBufferedDocs is set, then Lucene will flush
- based on whichever limit is hit first. -->
- <!--<maxBufferedDocs>1000</maxBufferedDocs>-->
- <!-- Sets the amount of RAM that may be used by Lucene indexing
- for buffering added documents and deletions before they are
- flushed to the Directory. -->
- <ramBufferSizeMB>32</ramBufferSizeMB>
- <!-- <maxMergeDocs>2147483647</maxMergeDocs> -->
- <maxFieldLength>10000</maxFieldLength>
- <writeLockTimeout>1000</writeLockTimeout>
- <commitLockTimeout>10000</commitLockTimeout>
- <!--
- Expert: Turn on Lucene's auto commit capability. This causes intermediate
- segment flushes to write a new lucene index descriptor, enabling it to be
- opened by an external IndexReader. This can greatly slow down indexing
- speed. NOTE: Despite the name, this value does not have any relation to
- Solr's autoCommit functionality
- -->
- <!--<luceneAutoCommit>false</luceneAutoCommit>-->
- <!--
- Expert: The Merge Policy in Lucene controls how merging is handled by
- Lucene. The default in 2.3 is the LogByteSizeMergePolicy, previous
- versions used LogDocMergePolicy.
-
- LogByteSizeMergePolicy chooses segments to merge based on their size. The
- Lucene 2.2 default, LogDocMergePolicy chose when to merge based on number
- of documents
-
- Other implementations of MergePolicy must have a no-argument constructor
- -->
- <!--<mergePolicy class="org.apache.lucene.index.LogByteSizeMergePolicy"/>-->
- <!--
- Expert:
- The Merge Scheduler in Lucene controls how merges are performed. The
- ConcurrentMergeScheduler (Lucene 2.3 default) can perform merges in the
- background using separate threads. The SerialMergeScheduler (Lucene 2.2
- default) does not.
- -->
- <!--<mergeScheduler class="org.apache.lucene.index.ConcurrentMergeScheduler"/>-->
- <!--
- This option specifies which Lucene LockFactory implementation to use.
-
- single = SingleInstanceLockFactory - suggested for a read-only index
- or when there is no possibility of another process trying
- to modify the index.
- native = NativeFSLockFactory - uses OS native file locking
- simple = SimpleFSLockFactory - uses a plain file for locking
-
- (For backwards compatibility with Solr 1.2, 'simple' is the default
- if not specified.)
- -->
- <lockType>native</lockType>
- <!--
- Expert:
- Controls how often Lucene loads terms into memory -->
- <!--<termIndexInterval>256</termIndexInterval>-->
- </indexDefaults>
- <mainIndex>
- <!-- options specific to the main on-disk lucene index -->
- <useCompoundFile>false</useCompoundFile>
- <ramBufferSizeMB>32</ramBufferSizeMB>
- <mergeFactor>10</mergeFactor>
- <!-- Deprecated -->
- <!--<maxBufferedDocs>1000</maxBufferedDocs>-->
- <!--<maxMergeDocs>2147483647</maxMergeDocs>-->
- <!-- inherit from indexDefaults <maxFieldLength>10000</maxFieldLength> -->
- <!-- If true, unlock any held write or commit locks on startup.
- This defeats the locking mechanism that allows multiple
- processes to safely access a lucene index, and should be
- used with care.
- This is not needed if lock type is 'none' or 'single'
- -->
- <unlockOnStartup>false</unlockOnStartup>
- <!-- If true, IndexReaders will be reopened (often more efficient) instead
- of closed and then opened. -->
- <reopenReaders>true</reopenReaders>
- <!--
- Expert:
- Controls how often Lucene loads terms into memory. Default is 128 and is likely good for most everyone. -->
- <!--<termIndexInterval>256</termIndexInterval>-->
- <!--
- Custom deletion policies can specified here. The class must
- implement org.apache.lucene.index.IndexDeletionPolicy.
-
- http://lucene.apache.org/java/2_3_2/api/org/apache/lucene/index/IndexDeletionPolicy.html
-
- The standard Solr IndexDeletionPolicy implementation supports deleting
- index commit points on number of commits, age of commit point and
- optimized status.
-
- The latest commit point should always be preserved regardless
- of the criteria.
- -->
- <deletionPolicy class="solr.SolrDeletionPolicy">
- <!-- The number of commit points to be kept -->
- <str name="maxCommitsToKeep">1</str>
- <!-- The number of optimized commit points to be kept -->
- <str name="maxOptimizedCommitsToKeep">0</str>
- <!--
- Delete all commit points once they have reached the given age.
- Supports DateMathParser syntax e.g.
-
- <str name="maxCommitAge">30MINUTES</str>
- <str name="maxCommitAge">1DAY</str>
- -->
- </deletionPolicy>
- <!-- To aid in advanced debugging, you may turn on IndexWriter debug logging.
- Setting to true will set the file that the underlying Lucene IndexWriter
- will write its debug infostream to. -->
- <infoStream file="INFOSTREAM.txt">false</infoStream>
- </mainIndex>
- <!-- Enables JMX if and only if an existing MBeanServer is found, use this
- if you want to configure JMX through JVM parameters. Remove this to disable
- exposing Solr configuration and statistics to JMX.
-
- If you want to connect to a particular server, specify the agentId
- e.g. <jmx agentId="myAgent" />
-
- If you want to start a new MBeanServer, specify the serviceUrl
- e.g <jmx serviceUrl="service:jmx:rmi:///jndi/rmi://localhost:9999/solr"/>
-
- For more details see http://wiki.apache.org/solr/SolrJmx
- -->
- <jmx/>
- <!-- the default high-performance update handler -->
- <updateHandler class="solr.DirectUpdateHandler2">
- <!-- A prefix of "solr." for class names is an alias that
- causes solr to search appropriate packages, including
- org.apache.solr.(search|update|request|core|analysis)
- -->
- <!-- Perform a <commit/> automatically under certain conditions:
- maxDocs - number of updates since last commit is greater than this
- maxTime - oldest uncommited update (in ms) is this long ago
- Instead of enabling autoCommit, consider using "commitWithin"
- when adding documents. http://wiki.apache.org/solr/UpdateXmlMessages
- <autoCommit>
- <maxDocs>10000</maxDocs>
- <maxTime>1000</maxTime>
- </autoCommit>
- -->
- <!-- The RunExecutableListener executes an external command from a
- hook such as postCommit or postOptimize.
- exe - the name of the executable to run
- dir - dir to use as the current working directory. default="."
- wait - the calling thread waits until the executable returns. default="true"
- args - the arguments to pass to the program. default=nothing
- env - environment variables to set. default=nothing
- -->
- <!-- A postCommit event is fired after every commit or optimize command
- <listener event="postCommit" class="solr.RunExecutableListener">
- <str name="exe">solr/bin/snapshooter</str>
- <str name="dir">.</str>
- <bool name="wait">true</bool>
- <arr name="args"> <str>arg1</str> <str>arg2</str> </arr>
- <arr name="env"> <str>MYVAR=val1</str> </arr>
- </listener>
- -->
- <!-- A postOptimize event is fired only after every optimize command
- <listener event="postOptimize" class="solr.RunExecutableListener">
- <str name="exe">snapshooter</str>
- <str name="dir">solr/bin</str>
- <bool name="wait">true</bool>
- </listener>
- -->
- </updateHandler>
- <!-- Use the following format to specify a custom IndexReaderFactory - allows for alternate
- IndexReader implementations.
-
- ** Experimental Feature **
- Please note - Using a custom IndexReaderFactory may prevent certain other features
- from working. The API to IndexReaderFactory may change without warning or may even
- be removed from future releases if the problems cannot be resolved.
-
- ** Features that may not work with custom IndexReaderFactory **
- The ReplicationHandler assumes a disk-resident index. Using a custom
- IndexReader implementation may cause incompatibility with ReplicationHandler and
- may cause replication to not work correctly. See SOLR-1366 for details.
-
- <indexReaderFactory name="IndexReaderFactory" class="package.class">
- Parameters as required by the implementation
- </indexReaderFactory >
- -->
- <!-- To set the termInfosIndexDivisor, do this: -->
- <!--<indexReaderFactory name="IndexReaderFactory" class="org.apache.solr.core.StandardIndexReaderFactory">
- <int name="termInfosIndexDivisor">12</int>
- </indexReaderFactory >-->
- <query>
- <!-- Maximum number of clauses in a boolean query... in the past, this affected
- range or prefix queries that expanded to big boolean queries - built in Solr
- query parsers no longer create queries with this limitation.
- An exception is thrown if exceeded. -->
- <maxBooleanClauses>1024</maxBooleanClauses>
- <!-- There are two implementations of cache available for Solr,
- LRUCache, based on a synchronized LinkedHashMap, and
- FastLRUCache, based on a ConcurrentHashMap. FastLRUCache has faster gets
- and slower puts in single threaded operation and thus is generally faster
- than LRUCache when the hit ratio of the cache is high (> 75%), and may be
- faster under other scenarios on multi-cpu systems. -->
- <!-- Cache used by SolrIndexSearcher for filters (DocSets),
- unordered sets of *all* documents that match a query.
- When a new searcher is opened, its caches may be prepopulated
- or "autowarmed" using data from caches in the old searcher.
- autowarmCount is the number of items to prepopulate. For LRUCache,
- the autowarmed items will be the most recently accessed items.
- Parameters:
- class - the SolrCache implementation LRUCache or FastLRUCache
- size - the maximum number of entries in the cache
- initialSize - the initial capacity (number of entries) of
- the cache. (seel java.util.HashMap)
- autowarmCount - the number of entries to prepopulate from
- and old cache.
- -->
- <filterCache class="solr.FastLRUCache" size="512" initialSize="512" autowarmCount="0"/>
- <!-- Cache used to hold field values that are quickly accessible
- by document id. The fieldValueCache is created by default
- even if not configured here.
- <fieldValueCache
- class="solr.FastLRUCache"
- size="512"
- autowarmCount="128"
- showItems="32"
- />
- -->
- <!-- queryResultCache caches results of searches - ordered lists of
- document ids (DocList) based on a query, a sort, and the range
- of documents requested. -->
- <queryResultCache class="solr.LRUCache" size="512" initialSize="512" autowarmCount="0"/>
- <!-- documentCache caches Lucene Document objects (the stored fields for each document).
- Since Lucene internal document ids are transient, this cache will not be autowarmed. -->
- <documentCache class="solr.LRUCache" size="512" initialSize="512" autowarmCount="0"/>
- <!-- If true, stored fields that are not requested will be loaded lazily.
- This can result in a significant speed improvement if the usual case is to
- not load all stored fields, especially if the skipped fields are large
- compressed text fields.
- -->
- <enableLazyFieldLoading>true</enableLazyFieldLoading>
- <!-- Example of a generic cache. These caches may be accessed by name
- through SolrIndexSearcher.getCache(),cacheLookup(), and cacheInsert().
- The purpose is to enable easy caching of user/application level data.
- The regenerator argument should be specified as an implementation
- of solr.search.CacheRegenerator if autowarming is desired. -->
- <!--
- <cache name="myUserCache"
- class="solr.LRUCache"
- size="4096"
- initialSize="1024"
- autowarmCount="1024"
- regenerator="org.mycompany.mypackage.MyRegenerator"
- />
- -->
- <!-- An optimization that attempts to use a filter to satisfy a search.
- If the requested sort does not include score, then the filterCache
- will be checked for a filter matching the query. If found, the filter
- will be used as the source of document ids, and then the sort will be
- applied to that.
- <useFilterForSortedQuery>true</useFilterForSortedQuery>
- -->
- <!-- An optimization for use with the queryResultCache. When a search
- is requested, a superset of the requested number of document ids
- are collected. For example, if a search for a particular query
- requests matching documents 10 through 19, and queryWindowSize is 50,
- then documents 0 through 49 will be collected and cached. Any further
- requests in that range can be satisfied via the cache. -->
- <queryResultWindowSize>20</queryResultWindowSize>
- <!-- Maximum number of documents to cache for any entry in the
- queryResultCache. -->
- <queryResultMaxDocsCached>200</queryResultMaxDocsCached>
- <!-- a newSearcher event is fired whenever a new searcher is being prepared
- and there is a current searcher handling requests (aka registered).
- It can be used to prime certain caches to prevent long request times for
- certain requests.
- -->
- <!-- QuerySenderListener takes an array of NamedList and executes a
- local query request for each NamedList in sequence. -->
- <listener event="newSearcher" class="solr.QuerySenderListener">
- <arr name="queries">
- <!--
- <lst> <str name="q">solr</str> <str name="start">0</str> <str name="rows">10</str> </lst>
- <lst> <str name="q">rocks</str> <str name="start">0</str> <str name="rows">10</str> </lst>
- <lst><str name="q">static newSearcher warming query from solrconfig.xml</str></lst>
- -->
- </arr>
- </listener>
- <!-- a firstSearcher event is fired whenever a new searcher is being
- prepared but there is no current registered searcher to handle
- requests or to gain autowarming data from. -->
- <listener event="firstSearcher" class="solr.QuerySenderListener">
- <arr name="queries">
- <lst>
- <str name="q">solr rocks</str>
- <str name="start">0</str>
- <str name="rows">10</str>
- </lst>
- <lst>
- <str name="q">static firstSearcher warming query from solrconfig.xml</str>
- </lst>
- </arr>
- </listener>
- <!-- If a search request comes in and there is no current registered searcher,
- then immediately register the still warming searcher and use it. If
- "false" then all requests will block until the first searcher is done
- warming. -->
- <useColdSearcher>false</useColdSearcher>
- <!-- Maximum number of searchers that may be warming in the background
- concurrently. An error is returned if this limit is exceeded. Recommend
- 1-2 for read-only slaves, higher for masters w/o cache warming. -->
- <maxWarmingSearchers>2</maxWarmingSearchers>
- </query>
- <!--
- Let the dispatch filter handler /select?qt=XXX
- handleSelect=true will use consistent error handling for /select and /update
- handleSelect=false will use solr1.1 style error formatting
- -->
- <requestDispatcher handleSelect="true">
- <!--Make sure your system has some authentication before enabling remote streaming! -->
- <requestParsers enableRemoteStreaming="true" multipartUploadLimitInKB="2048000"/>
- <!-- Set HTTP caching related parameters (for proxy caches and clients).
-
- To get the behaviour of Solr 1.2 (ie: no caching related headers)
- use the never304="true" option and do not specify a value for
- <cacheControl>
- -->
- <!-- <httpCaching never304="true"> -->
- <httpCaching lastModifiedFrom="openTime" etagSeed="Solr">
- <!-- lastModFrom="openTime" is the default, the Last-Modified value
- (and validation against If-Modified-Since requests) will all be
- relative to when the current Searcher was opened.
- You can change it to lastModFrom="dirLastMod" if you want the
- value to exactly corrispond to when the physical index was last
- modified.
-
- etagSeed="..." is an option you can change to force the ETag
- header (and validation against If-None-Match requests) to be
- differnet even if the index has not changed (ie: when making
- significant changes to your config file)
-
- lastModifiedFrom and etagSeed are both ignored if you use the
- never304="true" option.
- -->
- <!-- If you include a <cacheControl> directive, it will be used to
- generate a Cache-Control header, as well as an Expires header
- if the value contains "max-age="
-
- By default, no Cache-Control header is generated.
-
- You can use the <cacheControl> option even if you have set
- never304="true"
- -->
- <!-- <cacheControl>max-age=30, public</cacheControl> -->
- </httpCaching>
- </requestDispatcher>
- <!-- requestHandler plugins... incoming queries will be dispatched to the
- correct handler based on the path or the qt (query type) param.
- Names starting with a '/' are accessed with the a path equal to the
- registered name. Names without a leading '/' are accessed with:
- http://host/app/select?qt=name
- If no qt is defined, the requestHandler that declares default="true"
- will be used.
- -->
- <requestHandler name="standard" class="solr.SearchHandler" default="true">
- <!-- default values for query parameters -->
- <lst name="defaults">
- <str name="echoParams">explicit</str>
- <!--
- <int name="rows">10</int>
- <str name="fl">*</str>
- <str name="version">2.1</str>
- -->
- </lst>
- </requestHandler>
- <!-- Please refer to http://wiki.apache.org/solr/SolrReplication for details on configuring replication -->
- <!-- remove the <lst name="master"> section if this is just a slave -->
- <!-- remove the <lst name="slave"> section if this is just a master -->
- <!--
-<requestHandler name="/replication" class="solr.ReplicationHandler" >
- <lst name="master">
- <str name="replicateAfter">commit</str>
- <str name="replicateAfter">startup</str>
- <str name="confFiles">schema.xml,stopwords.txt</str>
- </lst>
- <lst name="slave">
- <str name="masterUrl">http://localhost:8983/solr/replication</str>
- <str name="pollInterval">00:00:60</str>
- </lst>
-</requestHandler>-->
- <!-- DisMaxRequestHandler allows easy searching across multiple fields
- for simple user-entered phrases. It's implementation is now
- just the standard SearchHandler with a default query type
- of "dismax".
- see http://wiki.apache.org/solr/DisMaxRequestHandler
- -->
- <requestHandler name="dismax" class="solr.SearchHandler">
- <lst name="defaults">
- <str name="defType">dismax</str>
- <str name="echoParams">explicit</str>
- <float name="tie">0.01</float>
- <str name="qf">
- text^0.5 features^1.0 name^1.2 sku^1.5 id^10.0 manu^1.1 cat^1.4
- </str>
- <str name="pf">
- text^0.2 features^1.1 name^1.5 manu^1.4 manu_exact^1.9
- </str>
- <str name="bf">
- popularity^0.5 recip(price,1,1000,1000)^0.3
- </str>
- <str name="fl">
- id,name,price,score
- </str>
- <str name="mm">
- 2&lt;-1 5&lt;-2 6&lt;90%
- </str>
- <int name="ps">100</int>
- <str name="q.alt">*:*</str>
- <!-- example highlighter config, enable per-query with hl=true -->
- <str name="hl.fl">text features name</str>
- <!-- for this field, we want no fragmenting, just highlighting -->
- <str name="f.name.hl.fragsize">0</str>
- <!-- instructs Solr to return the field itself if no query terms are
- found -->
- <str name="f.name.hl.alternateField">name</str>
- <str name="f.text.hl.fragmenter">regex</str>
- <!-- defined below -->
- </lst>
- </requestHandler>
- <!-- Note how you can register the same handler multiple times with
- different names (and different init parameters)
- -->
- <requestHandler name="partitioned" class="solr.SearchHandler">
- <lst name="defaults">
- <str name="defType">dismax</str>
- <str name="echoParams">explicit</str>
- <str name="qf">text^0.5 features^1.0 name^1.2 sku^1.5 id^10.0</str>
- <str name="mm">2&lt;-1 5&lt;-2 6&lt;90%</str>
- <!-- This is an example of using Date Math to specify a constantly
- moving date range in a config...
- -->
- <str name="bq">incubationdate_dt:[* TO NOW/DAY-1MONTH]^2.2</str>
- </lst>
- <!-- In addition to defaults, "appends" params can be specified
- to identify values which should be appended to the list of
- multi-val params from the query (or the existing "defaults").
-
- In this example, the param "fq=instock:true" will be appended to
- any query time fq params the user may specify, as a mechanism for
- partitioning the index, independent of any user selected filtering
- that may also be desired (perhaps as a result of faceted searching).
-
- NOTE: there is *absolutely* nothing a client can do to prevent these
- "appends" values from being used, so don't use this mechanism
- unless you are sure you always want it.
- -->
- <lst name="appends">
- <str name="fq">inStock:true</str>
- </lst>
- <!-- "invariants" are a way of letting the Solr maintainer lock down
- the options available to Solr clients. Any params values
- specified here are used regardless of what values may be specified
- in either the query, the "defaults", or the "appends" params.
-
- In this example, the facet.field and facet.query params are fixed,
- limiting the facets clients can use. Faceting is not turned on by
- default - but if the client does specify facet=true in the request,
- these are the only facets they will be able to see counts for;
- regardless of what other facet.field or facet.query params they
- may specify.
-
- NOTE: there is *absolutely* nothing a client can do to prevent these
- "invariants" values from being used, so don't use this mechanism
- unless you are sure you always want it.
- -->
- <lst name="invariants">
- <str name="facet.field">cat</str>
- <str name="facet.field">manu_exact</str>
- <str name="facet.query">price:[* TO 500]</str>
- <str name="facet.query">price:[500 TO *]</str>
- </lst>
- </requestHandler>
- <!--
- Search components are registered to SolrCore and used by Search Handlers
-
- By default, the following components are avaliable:
-
- <searchComponent name="query" class="org.apache.solr.handler.component.QueryComponent" />
- <searchComponent name="facet" class="org.apache.solr.handler.component.FacetComponent" />
- <searchComponent name="mlt" class="org.apache.solr.handler.component.MoreLikeThisComponent" />
- <searchComponent name="highlight" class="org.apache.solr.handler.component.HighlightComponent" />
- <searchComponent name="stats" class="org.apache.solr.handler.component.StatsComponent" />
- <searchComponent name="debug" class="org.apache.solr.handler.component.DebugComponent" />
-
- Default configuration in a requestHandler would look like:
- <arr name="components">
- <str>query</str>
- <str>facet</str>
- <str>mlt</str>
- <str>highlight</str>
- <str>stats</str>
- <str>debug</str>
- </arr>
-
- If you register a searchComponent to one of the standard names, that will be used instead.
- To insert components before or after the 'standard' components, use:
-
- <arr name="first-components">
- <str>myFirstComponentName</str>
- </arr>
-
- <arr name="last-components">
- <str>myLastComponentName</str>
- </arr>
- -->
- <!-- The spell check component can return a list of alternative spelling
- suggestions. -->
- <searchComponent name="spellcheck" class="solr.SpellCheckComponent">
- <str name="queryAnalyzerFieldType">textSpell</str>
- <lst name="spellchecker">
- <str name="name">default</str>
- <str name="field">name</str>
- <str name="spellcheckIndexDir">./spellchecker</str>
- </lst>
- <!-- a spellchecker that uses a different distance measure
- <lst name="spellchecker">
- <str name="name">jarowinkler</str>
- <str name="field">spell</str>
- <str name="distanceMeasure">org.apache.lucene.search.spell.JaroWinklerDistance</str>
- <str name="spellcheckIndexDir">./spellchecker2</str>
- </lst>
- -->
- <!-- a file based spell checker
- <lst name="spellchecker">
- <str name="classname">solr.FileBasedSpellChecker</str>
- <str name="name">file</str>
- <str name="sourceLocation">spellings.txt</str>
- <str name="characterEncoding">UTF-8</str>
- <str name="spellcheckIndexDir">./spellcheckerFile</str>
- </lst>
- -->
- </searchComponent>
- <!-- A request handler utilizing the spellcheck component.
- #############################################################################
- NOTE: This is purely as an example. The whole purpose of the
- SpellCheckComponent is to hook it into the request handler that handles (i.e.
- the standard or dismax SearchHandler) queries such that a separate request is
- not needed to get suggestions.
-
- IN OTHER WORDS, THERE IS REALLY GOOD CHANCE THE SETUP BELOW IS NOT WHAT YOU
- WANT FOR YOUR PRODUCTION SYSTEM!
- #############################################################################
- -->
- <requestHandler name="/spell" class="solr.SearchHandler" lazy="true">
- <lst name="defaults">
- <!-- omp = Only More Popular -->
- <str name="spellcheck.onlyMorePopular">false</str>
- <!-- exr = Extended Results -->
- <str name="spellcheck.extendedResults">false</str>
- <!-- The number of suggestions to return -->
- <str name="spellcheck.count">1</str>
- </lst>
- <arr name="last-components">
- <str>spellcheck</str>
- </arr>
- </requestHandler>
- <searchComponent name="tvComponent" class="org.apache.solr.handler.component.TermVectorComponent"/>
- <!-- A Req Handler for working with the tvComponent. This is purely as an example.
- You will likely want to add the component to your already specified request handlers. -->
- <requestHandler name="tvrh" class="org.apache.solr.handler.component.SearchHandler">
- <lst name="defaults">
- <bool name="tv">true</bool>
- </lst>
- <arr name="last-components">
- <str>tvComponent</str>
- </arr>
- </requestHandler>
- <!-- Clustering Component
- http://wiki.apache.org/solr/ClusteringComponent
- This relies on third party jars which are not included in the release.
- To use this component (and the "/clustering" handler)
- Those jars will need to be downloaded, and you'll need to set the
- solr.cluster.enabled system property when running solr...
- java -Dsolr.clustering.enabled=true -jar start.jar
- -->
- <searchComponent name="clusteringComponent" enable="${solr.clustering.enabled:false}" class="org.apache.solr.handler.clustering.ClusteringComponent">
- <!-- Declare an engine -->
- <lst name="engine">
- <!-- The name, only one can be named "default" -->
- <str name="name">default</str>
- <!--
- Class name of Carrot2 clustering algorithm. Currently available algorithms are:
-
- * org.carrot2.clustering.lingo.LingoClusteringAlgorithm
- * org.carrot2.clustering.stc.STCClusteringAlgorithm
-
- See http://project.carrot2.org/algorithms.html for the algorithm's characteristics.
- -->
- <str name="carrot.algorithm">org.carrot2.clustering.lingo.LingoClusteringAlgorithm</str>
- <!--
- Overriding values for Carrot2 default algorithm attributes. For a description
- of all available attributes, see: http://download.carrot2.org/stable/manual/#chapter.components.
- Use attribute key as name attribute of str elements below. These can be further
- overridden for individual requests by specifying attribute key as request
- parameter name and attribute value as parameter value.
- -->
- <str name="LingoClusteringAlgorithm.desiredClusterCountBase">20</str>
- </lst>
- <lst name="engine">
- <str name="name">stc</str>
- <str name="carrot.algorithm">org.carrot2.clustering.stc.STCClusteringAlgorithm</str>
- </lst>
- </searchComponent>
- <requestHandler name="/clustering" enable="${solr.clustering.enabled:false}" class="solr.SearchHandler">
- <lst name="defaults">
- <bool name="clustering">true</bool>
- <str name="clustering.engine">default</str>
- <bool name="clustering.results">true</bool>
- <!-- The title field -->
- <str name="carrot.title">name</str>
- <str name="carrot.url">id</str>
- <!-- The field to cluster on -->
- <str name="carrot.snippet">features</str>
- <!-- produce summaries -->
- <bool name="carrot.produceSummary">true</bool>
- <!-- the maximum number of labels per cluster -->
- <!--<int name="carrot.numDescriptions">5</int>-->
- <!-- produce sub clusters -->
- <bool name="carrot.outputSubClusters">false</bool>
- </lst>
- <arr name="last-components">
- <str>clusteringComponent</str>
- </arr>
- </requestHandler>
- <!-- Solr Cell: http://wiki.apache.org/solr/ExtractingRequestHandler -->
- <requestHandler name="/update/extract" class="org.apache.solr.handler.extraction.ExtractingRequestHandler" startup="lazy">
- <lst name="defaults">
- <!-- All the main content goes into "text"... if you need to return
- the extracted text or do highlighting, use a stored field. -->
- <str name="fmap.content">text</str>
- <str name="lowernames">true</str>
- <str name="uprefix">ignored_</str>
- <!-- capture link hrefs but ignore div attributes -->
- <str name="captureAttr">true</str>
- <str name="fmap.a">links</str>
- <str name="fmap.div">ignored_</str>
- </lst>
- </requestHandler>
- <!-- A component to return terms and document frequency of those terms.
- This component does not yet support distributed search. -->
- <searchComponent name="termsComponent" class="org.apache.solr.handler.component.TermsComponent"/>
- <requestHandler name="/terms" class="org.apache.solr.handler.component.SearchHandler">
- <lst name="defaults">
- <bool name="terms">true</bool>
- </lst>
- <arr name="components">
- <str>termsComponent</str>
- </arr>
- </requestHandler>
- <!-- a search component that enables you to configure the top results for
- a given query regardless of the normal lucene scoring.-->
- <searchComponent name="elevator" class="solr.QueryElevationComponent">
- <!-- pick a fieldType to analyze queries -->
- <str name="queryFieldType">string</str>
- <str name="config-file">elevate.xml</str>
- </searchComponent>
- <!-- a request handler utilizing the elevator component -->
- <requestHandler name="/elevate" class="solr.SearchHandler" startup="lazy">
- <lst name="defaults">
- <str name="echoParams">explicit</str>
- </lst>
- <arr name="last-components">
- <str>elevator</str>
- </arr>
- </requestHandler>
- <!-- Update request handler.
-
- Note: Since solr1.1 requestHandlers requires a valid content type header if posted in
- the body. For example, curl now requires: -H 'Content-type:text/xml; charset=utf-8'
- The response format differs from solr1.1 formatting and returns a standard error code.
- To enable solr1.1 behavior, remove the /update handler or change its path
- -->
- <requestHandler name="/update" class="solr.XmlUpdateRequestHandler"/>
- <requestHandler name="/update/javabin" class="solr.BinaryUpdateRequestHandler"/>
- <!--
- Analysis request handler. Since Solr 1.3. Use to return how a document is analyzed. Useful
- for debugging and as a token server for other types of applications.
-
- This is deprecated in favor of the improved DocumentAnalysisRequestHandler and FieldAnalysisRequestHandler
-
- <requestHandler name="/analysis" class="solr.AnalysisRequestHandler" />
- -->
- <!--
- An analysis handler that provides a breakdown of the analysis process of provided docuemnts. This handler expects a
- (single) content stream with the following format:
-
- <docs>
- <doc>
- <field name="id">1</field>
- <field name="name">The Name</field>
- <field name="text">The Text Value</field>
- <doc>
- <doc>...</doc>
- <doc>...</doc>
- ...
- </docs>
-
- Note: Each document must contain a field which serves as the unique key. This key is used in the returned
- response to assoicate an analysis breakdown to the analyzed document.
-
- Like the FieldAnalysisRequestHandler, this handler also supports query analysis by
- sending either an "analysis.query" or "q" request paraemter that holds the query text to be analyized. It also
- supports the "analysis.showmatch" parameter which when set to true, all field tokens that match the query
- tokens will be marked as a "match".
- -->
- <requestHandler name="/analysis/document" class="solr.DocumentAnalysisRequestHandler"/>
- <!--
- RequestHandler that provides much the same functionality as analysis.jsp. Provides the ability
- to specify multiple field types and field names in the same request and outputs index-time and
- query-time analysis for each of them.
-
- Request parameters are:
- analysis.fieldname - The field name whose analyzers are to be used
- analysis.fieldtype - The field type whose analyzers are to be used
- analysis.fieldvalue - The text for index-time analysis
- q (or analysis.q) - The text for query time analysis
- analysis.showmatch (true|false) - When set to true and when query analysis is performed, the produced
- tokens of the field value analysis will be marked as "matched" for every
- token that is produces by the query analysis
- -->
- <requestHandler name="/analysis/field" class="solr.FieldAnalysisRequestHandler"/>
- <!-- CSV update handler, loaded on demand -->
- <requestHandler name="/update/csv" class="solr.CSVRequestHandler" startup="lazy"/>
- <!--
- Admin Handlers - This will register all the standard admin RequestHandlers. Adding
- this single handler is equivalent to registering:
-
- <requestHandler name="/admin/luke" class="org.apache.solr.handler.admin.LukeRequestHandler" />
- <requestHandler name="/admin/system" class="org.apache.solr.handler.admin.SystemInfoHandler" />
- <requestHandler name="/admin/plugins" class="org.apache.solr.handler.admin.PluginInfoHandler" />
- <requestHandler name="/admin/threads" class="org.apache.solr.handler.admin.ThreadDumpHandler" />
- <requestHandler name="/admin/properties" class="org.apache.solr.handler.admin.PropertiesRequestHandler" />
- <requestHandler name="/admin/file" class="org.apache.solr.handler.admin.ShowFileRequestHandler" >
-
- If you wish to hide files under ${solr.home}/conf, explicitly register the ShowFileRequestHandler using:
- <requestHandler name="/admin/file" class="org.apache.solr.handler.admin.ShowFileRequestHandler" >
- <lst name="invariants">
- <str name="hidden">synonyms.txt</str>
- <str name="hidden">anotherfile.txt</str>
- </lst>
- </requestHandler>
- -->
- <requestHandler name="/admin/" class="org.apache.solr.handler.admin.AdminHandlers"/>
- <!-- ping/healthcheck -->
- <requestHandler name="/admin/ping" class="PingRequestHandler">
- <lst name="defaults">
- <str name="qt">standard</str>
- <str name="q">solrpingquery</str>
- <str name="echoParams">all</str>
- </lst>
- </requestHandler>
- <!-- Echo the request contents back to the client -->
- <requestHandler name="/debug/dump" class="solr.DumpRequestHandler">
- <lst name="defaults">
- <str name="echoParams">explicit</str>
- <!-- for all params (including the default etc) use: 'all' -->
- <str name="echoHandler">true</str>
- </lst>
- </requestHandler>
- <highlighting>
- <!-- Configure the standard fragmenter -->
- <!-- This could most likely be commented out in the "default" case -->
- <fragmenter name="gap" class="org.apache.solr.highlight.GapFragmenter" default="true">
- <lst name="defaults">
- <int name="hl.fragsize">100</int>
- </lst>
- </fragmenter>
- <!-- A regular-expression-based fragmenter (f.i., for sentence extraction) -->
- <fragmenter name="regex" class="org.apache.solr.highlight.RegexFragmenter">
- <lst name="defaults">
- <!-- slightly smaller fragsizes work better because of slop -->
- <int name="hl.fragsize">70</int>
- <!-- allow 50% slop on fragment sizes -->
- <float name="hl.regex.slop">0.5</float>
- <!-- a basic sentence pattern -->
- <str name="hl.regex.pattern">[-\w ,/\n\"']{20,200}</str>
- </lst>
- </fragmenter>
- <!-- Configure the standard formatter -->
- <formatter name="html" class="org.apache.solr.highlight.HtmlFormatter" default="true">
- <lst name="defaults">
- <str name="hl.simple.pre"><![CDATA[<em>]]></str>
- <str name="hl.simple.post"><![CDATA[</em>]]></str>
- </lst>
- </formatter>
- </highlighting>
- <!-- An example dedup update processor that creates the "id" field on the fly
- based on the hash code of some other fields. This example has overwriteDupes
- set to false since we are using the id field as the signatureField and Solr
- will maintain uniqueness based on that anyway.
-
- You have to link the chain to an update handler above to use it ie:
- <requestHandler name="/update "class="solr.XmlUpdateRequestHandler">
- <lst name="defaults">
- <str name="update.processor">dedupe</str>
- </lst>
- </requestHandler>
- -->
- <!--
- <updateRequestProcessorChain name="dedupe">
- <processor class="org.apache.solr.update.processor.SignatureUpdateProcessorFactory">
- <bool name="enabled">true</bool>
- <str name="signatureField">id</str>
- <bool name="overwriteDupes">false</bool>
- <str name="fields">name,features,cat</str>
- <str name="signatureClass">org.apache.solr.update.processor.Lookup3Signature</str>
- </processor>
- <processor class="solr.LogUpdateProcessorFactory" />
- <processor class="solr.RunUpdateProcessorFactory" />
- </updateRequestProcessorChain>
- -->
- <!-- queryResponseWriter plugins... query responses will be written using the
- writer specified by the 'wt' request parameter matching the name of a registered
- writer.
- The "default" writer is the default and will be used if 'wt' is not specified
- in the request. XMLResponseWriter will be used if nothing is specified here.
- The json, python, and ruby writers are also available by default.
-
- <queryResponseWriter name="xml" class="org.apache.solr.request.XMLResponseWriter" default="true"/>
- <queryResponseWriter name="json" class="org.apache.solr.request.JSONResponseWriter"/>
- <queryResponseWriter name="python" class="org.apache.solr.request.PythonResponseWriter"/>
- <queryResponseWriter name="ruby" class="org.apache.solr.request.RubyResponseWriter"/>
- <queryResponseWriter name="php" class="org.apache.solr.request.PHPResponseWriter"/>
- <queryResponseWriter name="phps" class="org.apache.solr.request.PHPSerializedResponseWriter"/>
-
- <queryResponseWriter name="custom" class="com.example.MyResponseWriter"/>
- -->
- <!-- XSLT response writer transforms the XML output by any xslt file found
- in Solr's conf/xslt directory. Changes to xslt files are checked for
- every xsltCacheLifetimeSeconds.
- -->
- <queryResponseWriter name="xslt" class="org.apache.solr.request.XSLTResponseWriter">
- <int name="xsltCacheLifetimeSeconds">5</int>
- </queryResponseWriter>
- <!-- example of registering a query parser
- <queryParser name="lucene" class="org.apache.solr.search.LuceneQParserPlugin"/>
- -->
- <!-- example of registering a custom function parser
- <valueSourceParser name="myfunc" class="com.mycompany.MyValueSourceParser" />
- -->
- <!-- config for the admin interface -->
- <admin>
- <defaultQuery>solr</defaultQuery>
- <!-- configure a healthcheck file for servers behind a loadbalancer
- <healthcheck type="file">server-enabled</healthcheck>
- -->
- </admin>
- <requestHandler class="solr.MoreLikeThisHandler" name="/mlt">
- <lst name="defaults">
- <str name="mlt.mintf">1</str>
- <str name="mlt.mindf">2</str>
- </lst>
- </requestHandler>
-</config>
2  lib/conf/spellings.txt
View
@@ -1,2 +0,0 @@
-pizza
-history
58 lib/conf/stopwords.txt
View
@@ -1,58 +0,0 @@
-# Licensed to the Apache Software Foundation (ASF) under one or more
-# contributor license agreements. See the NOTICE file distributed with
-# this work for additional information regarding copyright ownership.
-# The ASF licenses this file to You under the Apache License, Version 2.0
-# (the "License"); you may not use this file except in compliance with
-# the License. You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-#-----------------------------------------------------------------------
-# a couple of test stopwords to test that the words are really being
-# configured from this file:
-stopworda
-stopwordb
-
-#Standard english stop words taken from Lucene's StopAnalyzer
-a
-an
-and
-are
-as
-at
-be
-but
-by
-for
-if
-in
-into
-is
-it
-no
-not
-of
-on
-or
-s
-such
-t
-that
-the
-their
-then
-there
-these
-they
-this
-to
-was
-will
-with
-
31 lib/conf/synonyms.txt
View
@@ -1,31 +0,0 @@
-# The ASF licenses this file to You under the Apache License, Version 2.0
-# (the "License"); you may not use this file except in compliance with
-# the License. You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-
-#-----------------------------------------------------------------------
-#some test synonym mappings unlikely to appear in real input text
-aaa => aaaa
-bbb => bbbb1 bbbb2
-ccc => cccc1,cccc2
-a\=>a => b\=>b
-a\,a => b\,b
-fooaaa,baraaa,bazaaa
-
-# Some synonym groups specific to this example
-GB,gib,gigabyte,gigabytes
-MB,mib,megabyte,megabytes
-Television, Televisions, TV, TVs
-#notice we use "gib" instead of "GiB" so any WordDelimiterFilter coming
-#after us won't split it into two words.
-
-# Synonym mappings can be used for spelling correction too
-pixima => pixma
-
132 lib/conf/xslt/example.xsl
View
@@ -1,132 +0,0 @@
-<?xml version='1.0' encoding='UTF-8'?>
-
-<!--
- * Licensed to the Apache Software Foundation (ASF) under one or more
- * contributor license agreements. See the NOTICE file distributed with
- * this work for additional information regarding copyright ownership.
- * The ASF licenses this file to You under the Apache License, Version 2.0
- * (the "License"); you may not use this file except in compliance with
- * the License. You may obtain a copy of the License at
- *
- * http://www.apache.org/licenses/LICENSE-2.0
- *
- * Unless required by applicable law or agreed to in writing, software
- * distributed under the License is distributed on an "AS IS" BASIS,
- * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
- * See the License for the specific language governing permissions and
- * limitations under the License.
- -->
-
-<!--
- Simple transform of Solr query results to HTML
- -->
-<xsl:stylesheet version='1.0'
- xmlns:xsl='http://www.w3.org/1999/XSL/Transform'
->
-
- <xsl:output media-type="text/html; charset=UTF-8" encoding="UTF-8"/>
-
- <xsl:variable name="title" select="concat('Solr search results (',response/result/@numFound,' documents)')"/>
-
- <xsl:template match='/'>
- <html>
- <head>
- <title><xsl:value-of select="$title"/></title>
- <xsl:call-template name="css"/>
- </head>
- <body>
- <h1><xsl:value-of select="$title"/></h1>
- <div class="note">
- This has been formatted by the sample "example.xsl" transform -
- use your own XSLT to get a nicer page
- </div>
- <xsl:apply-templates select="response/result/doc"/>
- </body>
- </html>
- </xsl:template>
-
- <xsl:template match="doc">
- <xsl:variable name="pos" select="position()"/>
- <div class="doc">
- <table width="100%">
- <xsl:apply-templates>
- <xsl:with-param name="pos"><xsl:value-of select="$pos"/></xsl:with-param>
- </xsl:apply-templates>
- </table>
- </div>
- </xsl:template>
-
- <xsl:template match="doc/*[@name='score']" priority="100">
- <xsl:param name="pos"></xsl:param>
- <tr>
- <td class="name">
- <xsl:value-of select="@name"/>
- </td>
- <td class="value">
- <xsl:value-of select="."/>
-
- <xsl:if test="boolean(//lst[@name='explain'])">
- <xsl:element name="a">
- <!-- can't allow whitespace here -->
- <xsl:attribute name="href">javascript:toggle("<xsl:value-of select="concat('exp-',$pos)" />");</xsl:attribute>?</xsl:element>
- <br/>
- <xsl:element name="div">
- <xsl:attribute name="class">exp</xsl:attribute>
- <xsl:attribute name="id">
- <xsl:value-of select="concat('exp-',$pos)" />
- </xsl:attribute>
- <xsl:value-of select="//lst[@name='explain']/str[position()=$pos]"/>
- </xsl:element>
- </xsl:if>
- </td>
- </tr>
- </xsl:template>
-
- <xsl:template match="doc/arr" priority="100">
- <tr>
- <td class="name">
- <xsl:value-of select="@name"/>
- </td>
- <td class="value">
- <ul>
- <xsl:for-each select="*">
- <li><xsl:value-of select="."/></li>
- </xsl:for-each>
- </ul>
- </td>
- </tr>
- </xsl:template>
-
-
- <xsl:template match="doc/*">
- <tr>
- <td class="name">
- <xsl:value-of select="@name"/>
- </td>
- <td class="value">
- <xsl:value-of select="."/>
- </td>
- </tr>
- </xsl:template>
-
- <xsl:template match="*"/>
-
- <xsl:template name="css">
- <script>
- function toggle(id) {
- var obj = document.getElementById(id);
- obj.style.display = (obj.style.display != 'block') ? 'block' : 'none';
- }
- </script>
- <style type="text/css">
- body { font-family: "Lucida Grande", sans-serif }
- td.name { font-style: italic; font-size:80%; }
- td { vertical-align: top; }
- ul { margin: 0px; margin-left: 1em; padding: 0px; }
- .note { font-size:80%; }
- .doc { margin-top: 1em; border-top: solid grey 1px; }
- .exp { display: none; font-family: monospace; white-space: pre; }
- </style>
- </xsl:template>
-
-</xsl:stylesheet>
67 lib/conf/xslt/example_atom.xsl
View
@@ -1,67 +0,0 @@
-<?xml version='1.0' encoding='UTF-8'?>
-
-<!--
- * Licensed to the Apache Software Foundation (ASF) under one or more
- * contributor license agreements. See the NOTICE file distributed with
- * this work for additional information regarding copyright ownership.
- * The ASF licenses this file to You under the Apache License, Version 2.0
- * (the "License"); you may not use this file except in compliance with
- * the License. You may obtain a copy of the License at
- *
- * http://www.apache.org/licenses/LICENSE-2.0
- *
- * Unless required by applicable law or agreed to in writing, software
- * distributed under the License is distributed on an "AS IS" BASIS,
- * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
- * See the License for the specific language governing permissions and
- * limitations under the License.
- -->
-
-<!--
- Simple transform of Solr query results to Atom
- -->
-
-<xsl:stylesheet version='1.0'
- xmlns:xsl='http://www.w3.org/1999/XSL/Transform'>
-
- <xsl:output
- method="xml"
- encoding="utf-8"
- media-type="text/xml; charset=UTF-8"
- />
-
- <xsl:template match='/'>
- <xsl:variable name="query" select="response/lst[@name='responseHeader']/lst[@name='params']/str[@name='q']"/>
- <feed xmlns="http://www.w3.org/2005/Atom">
- <title>Example Solr Atom 1.0 Feed</title>
- <subtitle>
- This has been formatted by the sample "example_atom.xsl" transform -
- use your own XSLT to get a nicer Atom feed.
- </subtitle>
- <author>
- <name>Apache Solr</name>
- <email>solr-user@lucene.apache.org</email>
- </author>
- <link rel="self" type="application/atom+xml"
- href="http://localhost:8983/solr/q={$query}&amp;wt=xslt&amp;tr=atom.xsl"/>
- <updated>
- <xsl:value-of select="response/result/doc[position()=1]/date[@name='timestamp']"/>
- </updated>
- <id>tag:localhost,2007:example</id>
- <xsl:apply-templates select="response/result/doc"/>
- </feed>
- </xsl:template>
-
- <!-- search results xslt -->
- <