Tagger is a php-basered library, that can be used to autotag texts using Named Entity Recognition and Keyword Extraction. Each tag has a relevancy score. The tags can be tied to URI's making it possible to enrich and filter your own content using Linked Open Data. The project works with Danish, Norwegian (bokmål) and Swedish texts.
Pull request Compare This branch is 36 commits behind 40c:4.x.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
classes
db
keyword-import
logger
resources
.gitignore
COPYRIGHT.txt
INSTALL.txt
Install.php
LICENSE.txt
README.md
Tagger.php
Update.php
conf.example.php
defaults.php

README.md

Tagger

The Tagger project is a library making it possible to extract relevant tags (keywords and named entities) from texts.

Named Entities are rated for relevancy according to their position, frequency and whether they are enclosed in specific HTML-tags (<emph>, <strong> etc.).

The library can be included in a webservice wrapper (https://github.com/40c/tagger-webservice) or it can be integrated into your favorite CMS (e.g. Drupal: http://drupal.org/project/tagger)