Skip to content

luzfcb/wbrazilian-without-non-ascii-words

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

wbrazilian-without-non-ascii-words

wbrazilian without non-ascii words

Brazilian Portuguese wordlist This contains part of the wbrazilian wordlist, but with all non-ascii words removed

generated from https://packages.debian.org/en/sid/wbrazilian

luzfcb@luzfcb:~$ sudo apt-get install wbrazilian
luzfcb@luzfcb:~$ # count words
luzfcb@luzfcb:~$ wc -l /usr/share/dict/brazilian
275502 /usr/share/dict/brazilian
luzfcb@luzfcb:~$ # strip non-ascii words
luzfcb@luzfcb:~$ perl -nle 'print if m{^[[:ascii:]]+$}' /usr/share/dict/brazilian > wbrazilian-without-non-ascii-words.txt
luzfcb@luzfcb:~$ # count words
luzfcb@luzfcb:~$ wc -l wbrazilian-without-non-ascii-words.txt 
204448 wbrazilian-without-non-ascii-words.txt

LICENCE: GNU GENERAL PUBLIC LICENSE v2

About

wbrazilian without non-ascii words

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published