Skip to content

Latest commit

 

History

History
178 lines (95 loc) · 2.82 KB

Supported Objects.md

File metadata and controls

178 lines (95 loc) · 2.82 KB

Overview

Below is a list of all AWS DataPipeline object types and the level of support from this library.

General Support Notes

  • Object ids are inferred from the name of the object specified in the dag, thus an object named "My EMR Cluster" will have the id "My-EMR-Cluster"".
  • onFail, failureAndRerunMode, role, resourceRole and pipelineLogUri are only supported in the Default object, meaning there can only be one global definition of these per pipeline.
  • onSuccess actions are not supported.
  • maxActiveInstances is not supported.
  • The fields reportProgressTimeout, lateAfterTimeout and onLateAction are not currently supported.
  • The parent parameter is not supported. parent is used to build object hierarchies in AWS datapipeline, but this is much better achieved via Scala hierarchies and/or Scala factory methods.
  • User-defined fields are not supported.

Supported DataPipeline Objects

Data Nodes

DynamoDBDataNode

  • Not yet supported

MySqlDataNode

  • Not yet supported

RedshiftDataNode

  • Not yet supported

S3DataNode

  • Not yet supported

SqlDataNode

  • Not yet supported

Activities

CopyActivity

  • Not yet supported

EmrActivity

  • Supported
  • TODO: Support for various fields

HadoopActivity

  • Not yet supported

HiveActivity

  • Not yet supported

HiveCopyActivity

  • Not yet supported

PigActivity

  • Not yet supported

RedshiftCopyActivity

  • Not yet supported

ShellCommandActivity

  • Supported
  • TODO: Support for scriptArgument, runsOn

SqlActivity

  • Supported
  • TODO: Support for scriptUri, scriptArgument, runsOn

Resources

Ec2Resource

  • Not yet supported

EmrCluster

  • Supported

HttpProxy

  • Not yet supported

Preconditions

DynamoDBDataExists

  • Not yet supported

DynamoDBTableExists

  • Not yet supported

Exists

  • Not yet supported

S3KeyExists

  • Supported

S3PrefixNotEmpty

  • Supported

ShellCommandPrecondition

  • Supported
  • TODO: Support for scriptArgument and presumably workerGroup and runsOn but they are not documented

Databases

JdbcDatabase

  • Not yet supported

RdsDatabase

  • Not yet supported

RedshiftDatabase

  • Supported

Data Formats

  • Not yet supported

Actions

SnsAlarm

  • Supported
  • Note that currently only one alarm per pipeline is supported

Terminate

  • Not yet supported

Schedule

  • Supported
  • Both ondemand and cron schedules are supported
  • timeseries schedules are not supported
  • Note that currently only one schedule per pipeline is supported

Utilities

ShellScriptConfig

  • Not yet supported

EmrConfiguration

  • Supported

Property

  • Supported