Skip to content

coderganesh/tamil-sentence-tokenizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

tamil-sentence-tokenizer

A sentence tokenizer NLP tool for the Tamil language

Sentence Tokenizer

Command-line utility to perform sentence tokenization on a given Tamil corpus text file.

Usage

python sentence_tokenizer.py <input_file>

For help

python sentence_tokenizer.py -h

Features

  1. No preprocessing needed
  2. Works on any OS which supports Python 3
  3. Handles input file of any size

About

A sentence tokenizer NLP tool for the Tamil language

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages