Skip to content

List of hebrew stop words + script that computed them

Notifications You must be signed in to change notification settings

gidim/HebrewStopWords

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

HebrewStopWords

This is a list of the 500 most common words (stop words) computed from discussions from the Tapuz People website, on a variety of subjects.

Original corpora contained 1,397,173 tokes.

Tokens containing English characters or digits were removed from the lists.

heb_stopwords.txt - list of stopwords

heb_stopwords_counts.txt - list of stopwords + counts

About

List of hebrew stop words + script that computed them

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages