Skip to content

Document classification based on certain topics

Notifications You must be signed in to change notification settings

0x0584/classify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

  • This is a project that I did for the main reason, which is fun of course, why would you code anyway?
  • And the other reason is to learn how to code in Perl. I have always found that Perl is weird yet very interesting language with many useful built-in functionalities.

indeed, i was looking recently on many machine-learning related topics, such text mining, so here’s where the idea of classify came from.

Up and Running..

# in case of debug
$ ./classify.pl -i docs/* -d dicts/* -t topics/* -g -s 0
  
# normal case 
$ ./classify.pl foo.txt -t top0.txt top1.txt

Stuff TODO [2/5]

  • [X] parse words better
  • [X] compare words better
  • [ ] export statistics as
    • [ ] txt
    • [ ] pdf with charts
    • [ ] html, maybe?
  • [ ] find nice resources (dictionaries and topics)
  • [ ] generate docs

About

Document classification based on certain topics

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages