Facebook Presto input plugin for Embulk.
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
lib/embulk/input extract proc to connection.rb. Mar 21, 2017
test add test test_explain_parser.rb. Mar 21, 2017
.gitignore
.ruby-version migrate embulk v0.8.18 Mar 21, 2017
.travis.yml add jruby-9.1.5.0 test for travis. Mar 21, 2017
Gemfile initial commit. Dec 7, 2015
LICENSE.txt initial commit. Dec 7, 2015
README.md add badge.fury.io link to README. Mar 23, 2017
Rakefile add test. Jun 22, 2016
embulk-input-presto.gemspec

README.md

Facebook Presto input plugin for Embulk Gem Version Build Status

Facebook Presto input plugin for Embulk. see.

Overview

  • Plugin type: input
  • Resume supported: yes
  • Cleanup supported: yes
  • Guess supported: no

Configuration

  • host: host (string, default: "localhost")
  • port: port (integer, default: 8080)
  • schema: schema (string, default: "default")
  • catalog: catalog (string, default: "native")
  • query: query (string, required)
  • user: user (string, default: "embulk")
  • columns(deprecated): columns (array, required)
    • name: name (string, required)
    • type: type (string, required)

Warning : columns deprecated since over v0.2.0. Support auto fetch schema.

Example

in:
  type: presto
  host: presto-cordinator
  catalog: store
  schema: public
  query: |
    SELECT
      trim(upper(url_decode(keyword))) AS keyword,
      count(*) as count
    FROM search
    CROSS JOIN UNNEST(split(keywords, ',')) AS t (keyword)
    WHERE log_date >= (CURRENT_DATE - INTERVAL '90' DAY)
     AND length(keywords) != 256
    group by keyword
    having count(*) >= 10
    order by count(*) desc
out:
  type: stdout

Support type

  • TIMESTAMP
  • LONG
  • DOUBLE
  • BOOLEAN
  • STRING
  • JSON

Build

$ rake