Skip to content

Latest commit

 

History

History
17 lines (14 loc) · 845 Bytes

ner-dataset-guide.md

File metadata and controls

17 lines (14 loc) · 845 Bytes
title description ms.date author ms.author ms.custom ms.topic
How to format data for Named Entity Recognition (NER)
Learn how to format data for the Named Entity Recognition (NER) scenario in Model Builder
02/23/2024
zewditu
zehailem
mvc,how-to
how-to

How to format data for Named Entity Recognition (NER)

NER dataset shapes:

  • Key information file: The key information file contains a list of entities, which serves as key information for the training data.
  • Training data: Training data consists of a file (.txt, .tsv) containing columns separated by a Tab character. One of the columns is a sentence column, while the others represent labels for tokens within the sentence column.