Skip to content

Human In The Loop

Xin Zou edited this page Dec 17, 2020 · 9 revisions

HUMAN IN THE LOOP (HITL)

Human in the loop(HITL) is a way to speed up the labeling process and improve model quality.

When you have a lot of documents to label, and analyze a lot of testing files to improve model quality, it's a long process. HITL can help you here.

How does HITL work?

Scenario 1: Analyze a local file and then add it to training set

  1. Train a model with 5 labeled documents image image
  2. Pick a up local file and analyze it, if you find there are incorrect fields, you could add the document to training set then adjust the labels if necessary. image image
  3. Train a new model with new labeling data. image
  4. Repeat step 2 - 3 until the model is good enough.

Scenario 2: Auto-label current document

  1. Upload lots of documents into your Azure blob storage, train a model with at least 5 labeled documents.
  2. Click auto-label, FoTT will automatically pick up other files in the blob storage and analyze them, then return the analyze results as label files. (note: there will be cost associated with using Form Recognizer Analyze API, please be aware). image
  3. Users can revise the labels of those files if necessary, notice the icons of the document are changed, indicating it’s been revised and ready to be used for training. image
  4. You could start a new training session with newly revised files.
  5. Repeat step 2 - 4 until the model is good enough.