Skip to content
This repository has been archived by the owner on May 7, 2019. It is now read-only.

bizreach/drill-ltsv-plugin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

drill-ltsv-plugin

This project is no longer maintained. You can use Drill LTSV storage plugin as of Drill 1.16.

Apache Drill plugin for LTSV (Labeled Tab-separated Values) files.

What is LTSV?

LTSV is a format for the text file like this:

label:value<TAB>label:value<TAB>label:value ...

Quote the explanation from the web site:

Labeled Tab-separated Values (LTSV) format is a variant of Tab-separated Values (TSV). Each record in a LTSV file is represented as a single line. Each field is separated by TAB and has a label and a value. The label and the value have been separated by ‘:’. With the LTSV format, you can parse each line by spliting with TAB (like original TSV format) easily, and extend any fields with unique labels in no particular order. LTSV is simple and flexible. Parsing LTSV is super easy, and we can add new fields feel free. It’s very suitable for various logs.

This plugin adds LTSV suuport to Apache Drill.

Installation

Download drill-ltsv-plugin-VERSION.jar from the release page and put it into DRILL_HOME/jars/3rdparty.

Add ltsv format setting to the storage configuration as below:

  "formats": {
    "ltsv": {
      "type": "ltsv",
      "extensions": [
        "ltsv"
      ]
    },
    ...
  }

Then you can query *.ltsv files on Apache Drill.

Build from source

Run mvn package. drill-ltsv-plugin-VERSION.jar is generated into target directory.