Skip to content

claritee/dedupe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dedupe

This program deduplicates and sorts a large word file.

Usage

Install Ruby

First you need to install Ruby. The current supported version is Ruby 2.1 or above.

Please refer to https://www.ruby-lang.org/en/installation/

Then check the version of Ruby via:

 ruby -v

Install Bundler

Bundler is used to download Ruby dependencies in projects

Please follow the getting started guide here http://bundler.io/

  gem install bundler

Run the project

Install dependencies

  bundle install

Build the gem

  rake install

Run the program

  bin/run_program.rb $input_file
  e.g. 
  bin/run_program.rb spec/data/KW_File.csv

Run tests (located in the specs directory)

  rspec

To check the output (on a Unix-like OS)

  sort -u $input_file > $output_file

About

To dedupe a large file containing words

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages