Skip to content

A repo with a few tiny Apache Beam utilities that I've coded.

License

Notifications You must be signed in to change notification settings

disdi/beam_utils

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

My Personal Beam Utils

This package is a small set of Apache Beam utilities that I've coded as I've needed them. These might -or might not- eventually make it into the Apache Beam Python SDK. The utilities are the following:

  • Sources. A few sources for common file formats:

    • [CsvFileSource] - A source for CSV files. It returns Dictionaries and Lists
    • [JsonLinesFileSource] - A source for JsonLines files, where each JSON object is in a single line.
  • Coders. A few coders for common encodings

    • [NoopCoder] - It does not do any operation when encoding/decoding.

Installation

The package is available on PiPy as beam_utils. Just use pip install beam_utils, and import it as necessary. The apache_beam package is a dependency.

About

A repo with a few tiny Apache Beam utilities that I've coded.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%