Contains stop words lists and methods for extracting keywords from strings
Ruby
Switch branches/tags
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
config/locales
lib
spec
.gitignore
.rspec
.travis.yml
Gemfile
LICENSE
README.md
Rakefile
vidibus-words.gemspec

README.md

Vidibus::Words Build Status

This gem provides handling of words. It ships with a list of stop words in English, German, and Spanish and allows extraction of keywords from a string.

This gem is part of Vidibus, an open source toolset for building distributed (video) applications.

Installation

Add gem 'vidibus-words' to your Gemfile. Then call bundle install on your console.

Usage

Extracting keywords

To return a list of keywords from a given text, ordered by occurrence, enter:

input = Vidibus::Words.new('To tell a long story short, it\'s necessary to tell it briefly without fluff!')
input.keywords
 => ["tell", "long", "story", "short", "necessary", "briefly", "fluff"]

To return keywords of a certain locale only, you may set it as filter:

input = Vidibus::Words.new('To tell a long story short, it\'s necessary to tell it briefly without fluff!')
input.locale = :de
input.keywords
 => ["to", "tell", "a", "long", "story", "short", "it's", "necessary", "it", "briefly", "without", "fluff"] 

Stopwords lists

You may obtain stopwords easily:

Vidibus::Words.stopwords      # => Stopwords for all available locales
Vidibus::Words.stopwords(:en) # => English stopwords only

Copyright

© 2010-2012 Andre Pankratz. See LICENSE for details.