Skip to content
This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.
Branch: master
Clone or download
Latest commit 3846bed Jun 11, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
bin Added Dockerfile to build docker container of DDT Dec 2, 2018
client Enhanced page selection after version 54 radviz Nov 23, 2018
docs Updated installation instructions Oct 16, 2018
server Added RadvizModel in the default constructor of server. Nov 8, 2017
.gitignore Connecting back-end with front-end (React). Dec 6, 2016
Dockerfile Added Dockerfile to build docker container of DDT Dec 2, 2018
LICENSE Initial commit Nov 23, 2016
Makefile Updated to radviz-component 1.0.54 Nov 23, 2018
README.md Removed installation instructions and pointed to documentation for in… Feb 4, 2018
ache.yml
docker-compose.yml Changed the order of starting containers Oct 19, 2017
docker-compose.yml.ache Updated ACHE version to 0.9.0 Mar 19, 2018
environment.yml Updated jdk to 1.8 Jun 11, 2019

README.md

Documentation Status

Domain Discovery Tool (DDT)

This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web. It achieves this by integrating human insights with machine computation (data mining and machine learning) through visualization. DDT allows a domain expert to visualize and analyze pages returned by a search engine or a crawler, and easily provide feedback about relevance. DDT addresses important challenges:

  • It assist users in the process of domain understanding and discovery, guiding them to construct effective queries to be issued to a search engine to find additional relevant information;
  • It provides an easy-to-use interface whereby users can quickly provide feedback regarding the relevance of pages which can then be used to create learning classifiers for the domains of interest; and
  • It supports the configuration and deployment of focused crawlers that automatically and efficiently search the Web for additional pages on the topic. DDT allows users to quickly select crawling seeds as well as positive and negatives required to create the page classifier required for the focus topic.

Documentation

Documentation for installation and usage is available HERE!.

Publication

Yamuna Krishnamurthy, Kien Pham, Aecio Santos, and Juliana Friere. 2016. Interactive Web Content Exploration for Domain Discovery (Interactive Data Exploration and Analytics (IDEA) Workshop at Knowledge Discovery and Data Mining (KDD), San Francisco, CA).

Contact

DDT Development Team [ddt-dev@vgc.poly.edu]

You can’t perform that action at this time.