Lists of not-suitable-for-work words as YARA rules
Switch branches/tags
Clone or download
rferreira Merge pull request #4 from uvasoftware/raf/38
EN Refactoring + CircleCi
Latest commit 19108dd Aug 27, 2018
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.circleci Adding circleci support Aug 26, 2018
src EN Refactoring Aug 25, 2018
.editorconfig first commit Jun 12, 2016
.gitignore Add readme Jun 12, 2016
LICENSE Initial commit May 31, 2016
Makefile Adding circleci support Aug 26, 2018
README.md EN Refactoring Aug 25, 2018

README.md

List of NSFW (not suitable for work) language in YARA(http://virustotal.github.io/yara/) pattern-matching format

based upon work from https://github.com/LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words

this database powers the nsfw language feature of the https://scanii.com content analysis service

Compiling the rules

$ make
 mkdir -p dist
 yarac src/cs-language-nsfw.yara dist/cs-language-nsfw.yara.db; yarac src/da-language-nsfw.yara dist/da-language-nsfw.yara.db; yarac src/de-language-nsfw.yara dist/de-language-nsfw.yara.db; yarac src/en-language-nsfw.yara dist/en-language-nsfw.yara.db; yarac src/eo-language-nsfw.yara dist/eo-language-nsfw.yara.db; yarac src/es-language-nsfw.yara dist/es-language-nsfw.yara.db; yarac src/fi-language-nsfw.yara dist/fi-language-nsfw.yara.db; yarac src/fr-language-nsfw.yara dist/fr-language-nsfw.yara.db; yarac src/hi-language-nsfw.yara dist/hi-language-nsfw.yara.db; yarac src/hu-language-nsfw.yara dist/hu-language-nsfw.yara.db; yarac src/it-language-nsfw.yara dist/it-language-nsfw.yara.db; yarac src/ja-language-nsfw.yara dist/ja-language-nsfw.yara.db; yarac src/ko-language-nsfw.yara dist/ko-language-nsfw.yara.db; yarac src/nl-language-nsfw.yara dist/nl-language-nsfw.yara.db; yarac src/no-language-nsfw.yara dist/no-language-nsfw.yara.db; yarac src/pl-language-nsfw.yara dist/pl-language-nsfw.yara.db; yarac src/pt-language-nsfw.yara dist/pt-language-nsfw.yara.db; yarac src/ru-language-nsfw.yara dist/ru-language-nsfw.yara.db; yarac src/sv-language-nsfw.yara dist/sv-language-nsfw.yara.db; yarac src/th-language-nsfw.yara dist/th-language-nsfw.yara.db; yarac src/tlh-language-nsfw.yara dist/tlh-language-nsfw.yara.db; yarac src/tr-language-nsfw.yara dist/tr-language-nsfw.yara.db; yarac src/zh-language-nsfw.yara dist/zh-language-nsfw.yara.db;

Creating single bundle rule

$ make bundle
mkdir -p dist
yarac src/cs-language-nsfw.yara dist/cs-language-nsfw.yara.db; yarac src/da-language-nsfw.yara dist/da-language-nsfw.yara.db; yarac src/de-language-nsfw.yara dist/de-language-nsfw.yara.db; yarac src/en-language-nsfw.yara dist/en-language-nsfw.yara.db; yarac src/eo-language-nsfw.yara dist/eo-language-nsfw.yara.db; yarac src/es-language-nsfw.yara dist/es-language-nsfw.yara.db; yarac src/fi-language-nsfw.yara dist/fi-language-nsfw.yara.db; yarac src/fr-language-nsfw.yara dist/fr-language-nsfw.yara.db; yarac src/hi-language-nsfw.yara dist/hi-language-nsfw.yara.db; yarac src/hu-language-nsfw.yara dist/hu-language-nsfw.yara.db; yarac src/it-language-nsfw.yara dist/it-language-nsfw.yara.db; yarac src/ja-language-nsfw.yara dist/ja-language-nsfw.yara.db; yarac src/ko-language-nsfw.yara dist/ko-language-nsfw.yara.db; yarac src/nl-language-nsfw.yara dist/nl-language-nsfw.yara.db; yarac src/no-language-nsfw.yara dist/no-language-nsfw.yara.db; yarac src/pl-language-nsfw.yara dist/pl-language-nsfw.yara.db; yarac src/pt-language-nsfw.yara dist/pt-language-nsfw.yara.db; yarac src/ru-language-nsfw.yara dist/ru-language-nsfw.yara.db; yarac src/sv-language-nsfw.yara dist/sv-language-nsfw.yara.db; yarac src/th-language-nsfw.yara dist/th-language-nsfw.yara.db; yarac src/tlh-language-nsfw.yara dist/tlh-language-nsfw.yara.db; yarac src/tr-language-nsfw.yara dist/tr-language-nsfw.yara.db; yarac src/zh-language-nsfw.yara dist/zh-language-nsfw.yara.db;
yarac src/language-nsfw.yara dist/language-nsfw.db

this will create a single langauge-nsfw.db compiled yara rule with all the individual languages

Using signature database

$yara dist/language-nsfw.db nsfw-language-sample.txt 
content_da_language_nsfw nsfw-language-sample.txt
content_en_language_nsfw nsfw-language-sample.txt
content_nl_language_nsfw nsfw-language-sample.txt