Skip to content

Collection of tools, utilities, datasets and approaches towards realising natural language interfaces for the Web of Data.

License

Notifications You must be signed in to change notification settings

codeaudit/NLIWOD

 
 

Repository files navigation

NLIWOD - Natural Language Interfaces for the Web of Data

Collection of tools, utilities, datasets and approaches towards realizing natural language interfaces for the Web of Data. Currently, we are focusing on Question Answering (QA) utilities.

Especially, this repository contains

  • QA Systems: A set of existing online webservices of QA systems all executable via a simple Java interface.
  • QA Datasets: A collection of existing Question Answering datasets
  • QA Machine Learning: This projects aims at learning a ML-based algorithm to combine multiple QA systems into one
  • QA Commons: A collection to ease handling of QA datasets. It allows to load, store and evaluate datasets and systems.

We aim at providing a fast entrance to the field of natural language interfaces (search, question answering, ranking). Thus, we will offer here Maven dependencies and source code for using many available datasets, systems and techniques.

More interesting Question Answering and Natural Language Generation projects can be found here:

  • SemWeb2NL https://github.com/aksw/semweb2nl: This repository provides means to verbalize triples, entities, SPARQL queries and many more RDF dataset. This is especially useful to allow users understand Web of Data content.

Foreseen modules:

  • QA Features: Features calculated on a NL question to train ML algorithms.

If you are interested in standardization efforts join or W3C Commmunity Group https://www.w3.org/community/nli/ !

For developers

To deploy a new version increase the according versions in the pom.xml and execute mvn clean deploy after setting your ~.m2/settings.xml in accordance to https://wiki.aksw.org/private/infrastructure/aksw-responsibilities/maven .

Maven Dependency

This library is available as snapshot here: http://maven.aksw.org/archiva/#artifact~snapshots/org.aksw.qa/datasets

Add the following repository to your project:

Artifacts are described in the sub-modules.

Look for more interesting libraries here: http://maven.aksw.org/archiva/#browse/org.aksw.qa

Docker

work in progress

hawk

docker build -t hawk -f Dockerfile-hawk .
docker run -t hawk

hawk-fuseki

run cd qa.hawk/deploy-scripts && ./index_fuseki.sh to create index

docker build -t hawk-fuseki -f Dockerfile-fuseki .
docker run --name hawk-fuseki -v /path/to/deploy-scripts/:/staging hawk-fuseki

About

Collection of tools, utilities, datasets and approaches towards realising natural language interfaces for the Web of Data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 77.8%
  • Web Ontology Language 20.6%
  • Java 1.6%