Skip to content

Latest commit

 

History

History
260 lines (156 loc) · 5.33 KB

PITCHME.md

File metadata and controls

260 lines (156 loc) · 5.33 KB

DALPHI

Dalphi Logo

DALPHI Active Learning Platform for Human Interaction

#HSLIDE

Motivation

  • Supervised Machine Learning needs annotated data
  • Existing training data are
    • often outdated
    • in military context
    • too universal or too specific
  • Need to generate own training data
  • Existing solutions are time consuming and exhausting
  • ... and as such expensive

#HSLIDE

Example:
Named Entity Recognition

  • Inconvenient available annotation interfaces
  • Mostly with linguistic focus
  • Only whole document views
  • Distribution to annotators done manually

#VSLIDE

GATE

GATE

#VSLIDE

brat rapid annotation tool

brat

#VSLIDE

WebAnno

WebAnno

#HSLIDE

  • To solve these problems, we restructured the way annotations are done fundamentally.
  • We designed an iterative workflow to automate as much as we can and to save annotator's attention.
  • And created the OpenSource project Dalphi.

#HSLIDE

Dalphi

  • Web application, runs everywhere
  • Helps building and maintaining annotated data
  • Key features:
    • Iterating active learning supported workflow framework
    • Human readable presentation
    • Server side to propose useful annotations
    • Parallel distribution to annotators
    • Problem-agnostic document handling

#HSLIDE

Terminology

  • Service
  • Raw data
  • Annotation document
  • Statistic
  • Interface
  • Project

#VSLIDE

Service

Service

  • any system capable of communicating over HTTP
  • maintaining problem specific jobs
  • three types:
    • Iterate
    • Merge
    • Machine Learning

#VSLIDE

Raw data

Raw data

  • data that needs to be annotated

#VSLIDE

Raw data

JSON

{
	"foo": "bar",
	"foobar": 1.23
}

HTML

<h1>Impressum</h1>
<p>3antworten UG (haftungsbeschränkt)<br>Karl-Kunger Straße 64<br>12435 Berlin</p>

JPG
Cat

#VSLIDE

Annotation document

Annotation document

  • a subset of raw data
  • document which is renderable and annotable

#VSLIDE

Statistic

Statistic

  • key-value pair
  • mostly chronological numeric values

#VSLIDE

Interface

Interface

  • problem specific user interface
  • renders an annotation document as a subset of raw data

#VSLIDE

Project

Project

#HSLIDE

Projects

Projects

#VSLIDE

Dashboard

Projects Dashboard

#VSLIDE

Raw data

Projects Raw Data

#VSLIDE

Edit raw datum

Projects Raw Data Edit

#VSLIDE

Annotation documents

Projects Annotation Documents

#VSLIDE

Show annotation document

Projects Annotation Documents Show

#VSLIDE

Annotators

Projects Annotators

#VSLIDE

Show Annotator

Projects Annotators Show

#VSLIDE

Statistics table

Projects Statistics Table

#VSLIDE

Statistics graph

Projects Statistics Graph

#HSLIDE

Services

Services

#VSLIDE

Edit service

Services Edit

#HSLIDE

Interfaces

Interfaces

#VSLIDE

Edit Interface

Interfaces Edit

#VSLIDE

Test Interface

Interfaces Test

#HSLIDE

Annotators

Annotators

#VSLIDE

Edit Annotator

Annotators Edit