Example of fulltext search using sample data from wikileaks for Iowa Ruby presentation 2012-10-16
Ruby JavaScript CoffeeScript
Switch branches/tags
Nothing to show
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
app
config
db
doc
lib
log
public
script
solr/conf
test
vendor
.gitignore
.rvmrc
Gemfile
Gemfile.lock
README.markdown
Rakefile
Solr.key
WikiLeaks_uncensored_US_diplomatic_cables_(cables.csv).6644050.TPB.torrent
config.ru

README.markdown

Searching With Solr

Resources:

http://www.solrtutorial.com

http://sunspot.github.com

Wikileaks cables:

Use included torrent file WikiLeaks_uncensored_US_diplomatic_cables_(cables.csv).6644050.TPB.torrent

Note that the cables.csv file it downloads is over 1.6 GB, so will take a while to download and also to import (if you choose to import all of it)

It can be parsed with the db/import_cables.rb script; adjust the path within it to your local cables.csv file, then open a rails console and use load "db/import_cables.rb"

Synonyms

Synonyms are not enabled out of the box! Edit solr/conf/schema.xml as reflected below:

    <fieldType name="text" class="solr.TextField" omitNorms="false">
      <analyzer>
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.StandardFilterFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <!-- *** ADD THE BELOW FOR SYNONYMS TO WORK -->
        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true" />
      </analyzer>
    </fieldType>