Skip to content

dakrone/corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

# corpus

A tool used to train detokenization libraries

## Usage

(use 'corpus.core)
(def w (corpus-file "data/alice-in-wonderland.txt"))
(count w)

## License

Copyright (C) 2010 Lee Hinman

Distributed under the Eclipse Public License, the same as Clojure.

About

a tool used to train a detokenization library

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages