Skip to content

Extensions to HadoopReader

hildrum edited this page Sep 28, 2015 · 1 revision

This page is to keep track of possible extensions to HadoopReader operator. They are listed in no particular order.

  • Specify directories as input
  • Support type String for keys
  • Support Parquet format
  • Support generic key and value class:
    • by calling toString() method
    • by mapping to a byte array (blob)
    • by supporting the use of a user supplied jar.
  • Support other InputFormats, such as HBase.