Skip to content

MRGRD56/textractor-integration-extensions

Repository files navigation

Textractor Extensions

TextractorPipe

The most stable extension here.
Creates a named pipe and sends sentence data to it.

The pipe name is \\.\pipe\MRGRD56_TextractorPipe_f30799d5-c7eb-48e2-b723-bd6314a03ba2.

The data sent is in JSON format:

{
  "text": "Lorem ipsum dolor sit amet",
  "meta": {
    "isCurrentSelect": true,    // sentenceInfo["current select"]
    "processId": 2898235,       // sentenceInfo["process id"]
    "threadNumber": 12,         // sentenceInfo["text number"]
    "threadName": "Some name",  // sentenceInfo["text name"]
    "timestamp": 1669721578
  }
}

Every message in the pipe starts from the length of data that is represented by an unsigned int32 that is exactly 4 bytes long (it is not text data!). The size is followed by JSON text data, whose length in bytes is equal to the number before it.

Range Description
[0..3] uint32 representing the length of the message
[4..{length + 4 - 1}] JSON string representing the message

HttpSender

Asynchronously sends each sentence as an HTTP request. Can be used to integrate Textractor with other applications.

Configuration

HttpSenderConfig.json

Example
{
  "sentence": {
    "enabled": false,
    "url": "http://localhost:9650/sentence",
    "requestType": "PLAIN_TEXT",
    "selectedThreadOnly": true
  }
}
Field Type
enabled* boolean
url string
requestType** "PLAIN_TEXT" | "JSON_TEXT" | "JSON_TEXT_WITH_META"
selectedThreadOnly boolean

⚠️⚠️⚠️*Set enabled field to true to make it work

**requestType field
PLAIN_TEXT
POST http://localhost:9650/sentence
Content-Type: text/plain; charset=UTF-8

Lorem ipsum dolor sit amet
JSON_TEXT
POST http://localhost:9650/sentence
Content-Type: application/json; charset=UTF-8

{
  "text": "Lorem ipsum dolor sit amet"
}
JSON_TEXT_WITH_META
POST http://localhost:9650/sentence
Content-Type: application/json; charset=UTF-8

{
  "text": "Lorem ipsum dolor sit amet",
  "meta": {
    "isCurrentSelect": true,    // sentenceInfo["current select"]
    "processId": 2898235,       // sentenceInfo["process id"]
    "threadNumber": 12,         // sentenceInfo["text number"]
    "threadName": "Some name",  // sentenceInfo["text name"]
    "timestamp": 1669721578
  }
}

TextractorTranslatorBridge

Zero config version of HttpSender special for Textractor Translator

Matches the following config of HttpSender:

{
  "sentence": {
    "enabled": true,
    "method": "POST",
    "url": "http://localhost:18952/sentence",
    "requestType": "JSON_TEXT_WITH_META",
    "selectedThreadOnly": true
  }
}