Skip to content

julieweeds/code

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

code

code for DisCo project

stripcorpus.py is designed to take the reuters21578 data files, turn them into one file per document with the SGML removed. Name is according to topic. Contents is based on title and body fields. Train and test directories according to modApte split

About

code for DisCo project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages