Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BEAM-79] merge gearpump-runner into master #3611

Closed
wants to merge 207 commits into from

Commits on Jul 20, 2016

  1. Configuration menu
    Copy the full SHA
    9478f41 View commit details
    Browse the repository at this point in the history
  2. This closes apache#323

    kennknowles committed Jul 20, 2016
    Configuration menu
    Copy the full SHA
    02b2248 View commit details
    Browse the repository at this point in the history

Commits on Aug 25, 2016

  1. Configuration menu
    Copy the full SHA
    2a0ba61 View commit details
    Browse the repository at this point in the history
  2. move integration tests to profile

    manuzhang authored and kennknowles committed Aug 25, 2016
    Configuration menu
    Copy the full SHA
    1672b54 View commit details
    Browse the repository at this point in the history
  3. add package-info.java

    manuzhang authored and kennknowles committed Aug 25, 2016
    Configuration menu
    Copy the full SHA
    276a2e1 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    40be715 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    bc1b354 View commit details
    Browse the repository at this point in the history
  6. This closess apache#750

    kennknowles committed Aug 25, 2016
    Configuration menu
    Copy the full SHA
    091a15a View commit details
    Browse the repository at this point in the history

Commits on Aug 26, 2016

  1. gearpump: switch to stable version

    They have apparently deleted the SNAPSHOT jar and now builds are failing.
    dhalperi committed Aug 26, 2016
    Configuration menu
    Copy the full SHA
    fb74c93 View commit details
    Browse the repository at this point in the history

Commits on Aug 27, 2016

  1. Closes apache#895

    dhalperi committed Aug 27, 2016
    Configuration menu
    Copy the full SHA
    bf0a2ed View commit details
    Browse the repository at this point in the history

Commits on Sep 13, 2016

  1. Made byteArrayCoder final static

    gaurav gupta authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    0dfb8ff View commit details
    Browse the repository at this point in the history
  2. CompressedSource: CompressedReader is never splittable

    The only way it's safe to split a compressed file is if the file is not compressed. This can
    only happen when the source itself is splittable, and that in turn will result in the inner
    source's reader being returned. A CompressedReader will only be created in the event that
    the file is NOT splittable. So remove all the logic handling splittable compressed readers,
    and instead go with the logic when we know/assume the file is compressed.
    
    * TextIO: test compression with larger files
    
    It is important for correctness that we test with large files
    because otherwise the compressed file may be larger than the
    uncompressed file, which could mask bugs
    
    * TextIOTest: flesh out more
    
    * TextIOTest: add large uncompressed file
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    b9f8263 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    011bea9 View commit details
    Browse the repository at this point in the history
  4. Remove timeout in DirectRunnerTest

    If the test hangs due to bugs, the infrastructure should kill it.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    1d86335 View commit details
    Browse the repository at this point in the history
  5. Improve Write Error Message

    If provided with an Unbounded PCollection, Write will fail due to
    restriction of calling finalize only once. This error message fails in a
    deep stack trace based on it not being possible to apply a GroupByKey.
    Instead, throw immediately on application with a specific error message.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    36a9aa2 View commit details
    Browse the repository at this point in the history
  6. Remove Streaming Write Overrides in DataflowRunner

    These writes should be forbidden based on the boundedness of the input
    PCollection. As Write explicitly forbids the application of the
    transform to an Unbounded PCollection, this will be equivalent in most
    cases; In cases where the input PCollection is Bounded, due to an
    UnboundedReadFromBoundedSource, the write will function as expected and
    does not need to be forbidden.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    d564155 View commit details
    Browse the repository at this point in the history
  7. Remove Counter and associated code

    Aggregator is the model level concept. Counter was specific to the
    Dataflow Runner, and is now not needed as part of Beam.
    bchambers authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    89921c4 View commit details
    Browse the repository at this point in the history
  8. [BEAM-495] Create General Verifier for File Checksum

    Mark Liu authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    7fc2c68 View commit details
    Browse the repository at this point in the history
  9. Add output checksum to WordCountITOptions

    Mark Liu authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    b47549e View commit details
    Browse the repository at this point in the history
  10. More unit test and code style fix

    Mark Liu authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    37ce2a3 View commit details
    Browse the repository at this point in the history
  11. Using IOChannelUtils to resolve file path

    Mark Liu authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    046e36e View commit details
    Browse the repository at this point in the history
  12. Added unit tests and error handling in removeTemporaryTables

    Ian Zhou authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    58cd781 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    d99a652 View commit details
    Browse the repository at this point in the history
  14. Remove DataflowPipelineJob from examples

    peihe authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    39f763e View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    424c4c4 View commit details
    Browse the repository at this point in the history
  16. Add TransformEvaluatorFactory#cleanup

    This cleans up any state stored within the Transform Evaluator Factory.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    b80d967 View commit details
    Browse the repository at this point in the history
  17. Replace CloningThreadLocal with DoFnLifecycleManager

    This is a more focused interface that interacts with a DoFn before it
    is available for use and after it has completed and the reference is
    lost. It is required to properly support setup and teardown, as the
    fields in a ThreadLocal cannot all be cleaned up without additional
    tracking.
    
    Part of BEAM-452.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    77c90d0 View commit details
    Browse the repository at this point in the history
  18. Add DoFn @setup and @teardown

    Methods annotated with these annotations are used to perform expensive
    setup work and clean up a DoFn after another method throws an exception
    or the DoFn is discarded.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    d056f46 View commit details
    Browse the repository at this point in the history
  19. Move ParDo Lifecycle tests to their own file

    These tests are not yet functional in all runners, and this makes them
    easier to ignore.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    6603307 View commit details
    Browse the repository at this point in the history
  20. Exclude ParDoTest from Dataflow @RunnableOnService

    Until we implement it for Dataflow runner.
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    d6cf4f2 View commit details
    Browse the repository at this point in the history
  21. Exclude guava-testlib from shading relocation

    Previously, guava-testlib guava-testlib was being relocated as part of
    the shading process, but test-scope dependencies aren't bundled in the
    uber-jar. As a result, the output JAR was unusable without recreating the
    same shading rules in a consuming project.
    swegner authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    0f1f114 View commit details
    Browse the repository at this point in the history
  22. ByteKeyRangeTracker: synchronize toString

    It's the only unsynchronized function, and it could potentially produce bad data.
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    09cd1b7 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    bd53cdc View commit details
    Browse the repository at this point in the history
  24. Rewrites DoFnReflector to go via DoFnSignature

    DoFnSignature encapsulates type information about a DoFn,
    in particular which arguments/features its methods
    actually use.
    
    Before this commit, DoFnReflector would parse/verify/generate
    code in one go; after this commit, these stages are separated:
    DoFnSignature encapsulates all information needed to generate
    the code.
    
    Additionally, removes the unnecessary genericity in the
    implementation of DoFnReflector's code generation for the
    very different methods processElement and start/finishBundle.
    The code is simpler if decomposed into utility functions,
    rather than attempting a uniform representation for different
    methods.
    jkff authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    cc189b4 View commit details
    Browse the repository at this point in the history
  25. Replace ParDo with simpler transforms where possible

    There are a number of places in the Java SDK where we use
    ParDo.of(DoFn) when MapElements or other higher-level
    composites are applicable and readable. This change
    alters a number of those.
    kennknowles authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    da638b6 View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    235bf3b View commit details
    Browse the repository at this point in the history
  27. addressed feedback

    peihe authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    530b9c0 View commit details
    Browse the repository at this point in the history
  28. fix unused imports

    peihe authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    aa541e7 View commit details
    Browse the repository at this point in the history
  29. DatastoreIO Sink as ParDo

    vikkyrk authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    bbd0e6b View commit details
    Browse the repository at this point in the history
  30. Configuration menu
    Copy the full SHA
    245c3ce View commit details
    Browse the repository at this point in the history
  31. Fix NPE in BigQueryIO.TransformingReader

    peihe authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    16bcf78 View commit details
    Browse the repository at this point in the history
  32. Add inEarlyPanesInGlobalWindow as a PAssert Extractor

    This is for use in asserting the contents of speculative panes in the
    global window.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    178898f View commit details
    Browse the repository at this point in the history
  33. Add TestStream to the Testing package

    This is a source suitable for use with tests that have interesting
    triggering behavior. It is an Unbounded source that emits elements in
    bundles, and advances the watermark and processing time appropriately.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    6c82321 View commit details
    Browse the repository at this point in the history
  34. Implement TestStream in the DirectRunner

    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    f37dba8 View commit details
    Browse the repository at this point in the history
  35. Configuration menu
    Copy the full SHA
    dab9efc View commit details
    Browse the repository at this point in the history
  36. Datastore Sink support for writing Mutations

    This generalizes Write to Write and Delete cleanly.
    vikkyrk authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    a4053ac View commit details
    Browse the repository at this point in the history
  37. Mark JAXBContext as Volatile in JAXBCoder

    This corrects the double-checked locking and lazy intialization
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    c996c1e View commit details
    Browse the repository at this point in the history
  38. Modify example dependencies to only add runners as optional dependenc…

    …ies.
    
    Also support excluding runners by disabling the include-runners profile.
    
    Tested ITs
    BQ and WordCount pass on Dataflow
    WordCount passes on Flink and Spark
    lukecwik authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    4ad78b2 View commit details
    Browse the repository at this point in the history
  39. DatastoreIO v1beta3 to v1

    vikkyrk authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    5c1b9f1 View commit details
    Browse the repository at this point in the history
  40. Configuration menu
    Copy the full SHA
    c3c11b1 View commit details
    Browse the repository at this point in the history
  41. Remove extra timer firings in WatermarkManager

    These timers should not be fired - the windows should be expired via the
    GC timer, and any elements should be emitted if neccessary.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    64a2d51 View commit details
    Browse the repository at this point in the history
  42. Configuration menu
    Copy the full SHA
    a60806a View commit details
    Browse the repository at this point in the history
  43. Configuration menu
    Copy the full SHA
    780ffcb View commit details
    Browse the repository at this point in the history
  44. Write: Remove invalid import

    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    730e7b0 View commit details
    Browse the repository at this point in the history
  45. Configuration menu
    Copy the full SHA
    67efb17 View commit details
    Browse the repository at this point in the history
  46. Remove ParDoTest Suppression in Google Cloud Dataflow

    This reenables the lifecycle tests now that they are properly supported.
    
    Update the container image.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    438d8bd View commit details
    Browse the repository at this point in the history
  47. Configuration menu
    Copy the full SHA
    67e095d View commit details
    Browse the repository at this point in the history
  48. [BEAM-253] Unify Flink-Streaming Operator Wrappers

    This also replaces the custom Flink StateInternals by proper Flink
    Partitioned StateInternals.
    aljoscha authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    686a286 View commit details
    Browse the repository at this point in the history
  49. Configuration menu
    Copy the full SHA
    9eef8a6 View commit details
    Browse the repository at this point in the history
  50. [BEAM-102] Add Side Inputs in Flink Streaming Runner

    This adds a generic SideInputHandler in runners-core that is only used
    by the Flink runner right now but can be used by other runner
    implementations.
    aljoscha authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    9179e93 View commit details
    Browse the repository at this point in the history
  51. Configuration menu
    Copy the full SHA
    de744c5 View commit details
    Browse the repository at this point in the history
  52. Configuration menu
    Copy the full SHA
    b4a38c3 View commit details
    Browse the repository at this point in the history
  53. Configuration menu
    Copy the full SHA
    7a2cccd View commit details
    Browse the repository at this point in the history
  54. Configuration menu
    Copy the full SHA
    f2a992e View commit details
    Browse the repository at this point in the history
  55. Fix combine tests with Accumulation Mode

    These tests were not written in such a way as to succeed if the trigger
    fired multiple times.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    a07b29f View commit details
    Browse the repository at this point in the history
  56. Use AllPanes as the PaneExtractor in IterableAssert

    This ensures that tests with triggering in the global window which assert
    on the entire PCollection (not a singleton iterable) will succeed over the
    entire PCollection
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    97e093c View commit details
    Browse the repository at this point in the history
  57. Configuration menu
    Copy the full SHA
    b7ba1d6 View commit details
    Browse the repository at this point in the history
  58. Configuration menu
    Copy the full SHA
    7012a22 View commit details
    Browse the repository at this point in the history
  59. Update checkstyle.xml to put all imports in one group

    bchambers authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    93f7955 View commit details
    Browse the repository at this point in the history
  60. Optimize imports

    bchambers authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    00441f8 View commit details
    Browse the repository at this point in the history
  61. BigQueryIO.Write: raise size limit to 11 TiB

    BigQuery has changed their total size quota to 12 TiB.
    https://cloud.google.com/bigquery/quota-policy#import
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    8d32196 View commit details
    Browse the repository at this point in the history
  62. Configuration menu
    Copy the full SHA
    186fe28 View commit details
    Browse the repository at this point in the history
  63. Configuration menu
    Copy the full SHA
    433842b View commit details
    Browse the repository at this point in the history
  64. Configuration menu
    Copy the full SHA
    32928c3 View commit details
    Browse the repository at this point in the history
  65. Configuration menu
    Copy the full SHA
    f05fbe7 View commit details
    Browse the repository at this point in the history
  66. Make WriteTest more resilient to Randomness

    In the worst case scenario for random key assignment in
    Write.ApplyShardingKey, the chance of the number of records per output
    shard was too high. This makes the test significantly less likely to
    flake.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    4ec73d8 View commit details
    Browse the repository at this point in the history
  67. checkstyle: prohibit API client repackaged Guava

    Apparently the IllegalImport check only blocks packages,
    so we had to move to Regexp to get individual classes.
    
    As a bonus, this enforcement let us remove two bogus dependencies.
    Smaller JARs for the win!
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    b3be7b7 View commit details
    Browse the repository at this point in the history
  68. Modified BigtableIO to use DoFn setup/tearDown methods instead of sta…

    …rtBundle/finishBundle
    Ian Zhou authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    1f8b534 View commit details
    Browse the repository at this point in the history
  69. Configuration menu
    Copy the full SHA
    5b425ac View commit details
    Browse the repository at this point in the history
  70. Configuration menu
    Copy the full SHA
    ef312e9 View commit details
    Browse the repository at this point in the history
  71. travis.yml: disable updating snapshots

    Will still update releases with the --update-snapshots version (I know, bad name)
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    0fbd9c8 View commit details
    Browse the repository at this point in the history
  72. Query latest timestamp

    vikkyrk authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    79491eb View commit details
    Browse the repository at this point in the history
  73. [BEAM-589] Fixing IO.Read transformation

    gaurav gupta authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    4023167 View commit details
    Browse the repository at this point in the history
  74. Configuration menu
    Copy the full SHA
    bce9aef View commit details
    Browse the repository at this point in the history
  75. Organize imports in Kinesis

    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    aee5fbf View commit details
    Browse the repository at this point in the history
  76. Fix javadoc in Kinesis

    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    973081e View commit details
    Browse the repository at this point in the history
  77. [BEAM-592] Fix SparkRunner Dependency Problem in WordCount

    Mark Liu authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    1c1115e View commit details
    Browse the repository at this point in the history
  78. DataflowRunner: get PBegin from PInput

    Fixes an invalid cast that breaks some PCollectionList-related tests.
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    8454d5c View commit details
    Browse the repository at this point in the history
  79. Configuration menu
    Copy the full SHA
    07dd978 View commit details
    Browse the repository at this point in the history
  80. Update Dataflow Container Version

    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    435054b View commit details
    Browse the repository at this point in the history
  81. [BEAM-572] Remove Spark Reference in WordCount

    Mark Liu authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    cf9ce2f View commit details
    Browse the repository at this point in the history
  82. Configuration menu
    Copy the full SHA
    a58afd3 View commit details
    Browse the repository at this point in the history
  83. take advantage of setup/teardown for KafkaWriter

    Raghu Angadi authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    74d0195 View commit details
    Browse the repository at this point in the history
  84. Add LeaderBoardTest

    This test exercises the PTransforms that make up the LeaderBoard
    example. This includes speculative and late trigger firings to produce
    team and individual scores on a global and fixed window basis.
    
    Refactor LeaderBoard to expose the team and user score calculations as
    composite PTransforms to enable this testing.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    00b4e95 View commit details
    Browse the repository at this point in the history
  85. Configuration menu
    Copy the full SHA
    8007bdf View commit details
    Browse the repository at this point in the history
  86. Address comments of Flink Side-Input PR

    aljoscha authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    6ae4b6a View commit details
    Browse the repository at this point in the history
  87. Configuration menu
    Copy the full SHA
    1524494 View commit details
    Browse the repository at this point in the history
  88. Correct some accidental renames

    IDE over-eagerly replaced some occurrences of createAggregator with
    createAggregatorForDoFn. This corrects that.
    bchambers authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    798566c View commit details
    Browse the repository at this point in the history
  89. Test that multiple instances of TestStream are supported

    Add KeyedResourcePool
    
    This interface represents some shared pool of values that may be used by
    at most one caller at a time.
    
    Add LockedKeyedResourcePool which has at most one value per key and
    at most one user per value at a time.
    
    Use KeyedResourcePool in TestStream
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    4251761 View commit details
    Browse the repository at this point in the history
  90. Configuration menu
    Copy the full SHA
    28ad44d View commit details
    Browse the repository at this point in the history
  91. Add Latest CombineFn and PTransforms

    Add DoFnTester support for specifying input timestamps
    swegner authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    6ee7b62 View commit details
    Browse the repository at this point in the history
  92. Configuration menu
    Copy the full SHA
    0312f15 View commit details
    Browse the repository at this point in the history
  93. Cloud Datastore naming clean-up

    vikkyrk authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    f44fa2c View commit details
    Browse the repository at this point in the history
  94. Fixed Combine display data

    Ian Zhou authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    9943fd7 View commit details
    Browse the repository at this point in the history
  95. Configuration menu
    Copy the full SHA
    60d8cd9 View commit details
    Browse the repository at this point in the history
  96. Configuration menu
    Copy the full SHA
    4bf3a3b View commit details
    Browse the repository at this point in the history
  97. Configuration menu
    Copy the full SHA
    c92e45d View commit details
    Browse the repository at this point in the history
  98. FluentBackoff: a replacement for a variety of custom backoff implemen…

    …tations
    
    We have 3 different backoff classes, which don't really have that much
    different functionality. Add a single, flexible backoff implementation
    that can be used to replace all three classes. Additionally, this new
    backoff actually supports more functionality than any of the other three
    did -- you can limit retries, cap the exponential growth of an
    individual backoff, and cap the cumulative time spent in backoff; prior
    implementations did not allow all 3.
    
    This also makes the parameters self-obvious (Duration, not
    number-that-is-also-millis) where appropriate.
    
    This initial PR should have no functional changes.
    
    * Implement FluentBackoff
    * Replace other custom BackOff implementations with FluentBackoff
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    3f48566 View commit details
    Browse the repository at this point in the history
  99. [BEAM-456] Add MongoDbIO

    jbonofre authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    9ae5cc7 View commit details
    Browse the repository at this point in the history
  100. Configuration menu
    Copy the full SHA
    5eb44aa View commit details
    Browse the repository at this point in the history
  101. BigQuery: limit max job polling time to 1 minute

    Before the backoff would grow unboundedly, so we could in principle wait
    1.5x to 2x the actual job time. For long running jobs this is hours.
    Now, we just back off at most 1 minute between checking the job state.
    Note there should be no danger of QPS overload here because we should
    have very few concurrent outstanding jobs
    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    958f3fe View commit details
    Browse the repository at this point in the history
  102. Be more accepting in UnboundedReadDeduplicatorTest

    Don't depend on all the threads failing. Instead, assert that at most
    one success was encountered, and we saw at most numThreads - 1 failures.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    c8052b6 View commit details
    Browse the repository at this point in the history
  103. Remove timeout in JAXBCoderTest

    The infrastructure should kill the test if the await does not return.
    tgroh authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    8f68085 View commit details
    Browse the repository at this point in the history
  104. Configuration menu
    Copy the full SHA
    50c1c88 View commit details
    Browse the repository at this point in the history
  105. Configuration menu
    Copy the full SHA
    b235595 View commit details
    Browse the repository at this point in the history
  106. Configuration menu
    Copy the full SHA
    1b420db View commit details
    Browse the repository at this point in the history
  107. Configuration menu
    Copy the full SHA
    092a187 View commit details
    Browse the repository at this point in the history
  108. Configuration menu
    Copy the full SHA
    5084580 View commit details
    Browse the repository at this point in the history
  109. Configuration menu
    Copy the full SHA
    e5db1c7 View commit details
    Browse the repository at this point in the history
  110. Configuration menu
    Copy the full SHA
    34c731f View commit details
    Browse the repository at this point in the history
  111. !fixup Minor javadoc clean-up

    lukecwik authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    6cd48c4 View commit details
    Browse the repository at this point in the history
  112. fix import order

    manuzhang authored and dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    59ae94c View commit details
    Browse the repository at this point in the history
  113. Closes apache#943

    dhalperi committed Sep 13, 2016
    Configuration menu
    Copy the full SHA
    ed7c4aa View commit details
    Browse the repository at this point in the history

Commits on Sep 14, 2016

  1. Configuration menu
    Copy the full SHA
    272fe9f View commit details
    Browse the repository at this point in the history
  2. Closes apache#956

    dhalperi committed Sep 14, 2016
    Configuration menu
    Copy the full SHA
    8f4334c View commit details
    Browse the repository at this point in the history

Commits on Oct 26, 2016

  1. Configuration menu
    Copy the full SHA
    9dc9be9 View commit details
    Browse the repository at this point in the history
  2. post-merge fix

    manuzhang committed Oct 26, 2016
    Configuration menu
    Copy the full SHA
    8f013cb View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    94bd47c View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    3f06382 View commit details
    Browse the repository at this point in the history
  5. Closes apache#1193

    dhalperi committed Oct 26, 2016
    Configuration menu
    Copy the full SHA
    3933b55 View commit details
    Browse the repository at this point in the history

Commits on Nov 3, 2016

  1. Configuration menu
    Copy the full SHA
    45570b9 View commit details
    Browse the repository at this point in the history
  2. This closes apache#1234

    kennknowles committed Nov 3, 2016
    Configuration menu
    Copy the full SHA
    323ec11 View commit details
    Browse the repository at this point in the history

Commits on Nov 8, 2016

  1. Merge branch 'master' into gearpump-runner

    # Conflicts:
    #	runners/pom.xml
    manuzhang committed Nov 8, 2016
    Configuration menu
    Copy the full SHA
    0c36228 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2a96a17 View commit details
    Browse the repository at this point in the history
  3. This closes apache#1306

    kennknowles committed Nov 8, 2016
    Configuration menu
    Copy the full SHA
    a14927f View commit details
    Browse the repository at this point in the history

Commits on Nov 12, 2016

  1. Configuration menu
    Copy the full SHA
    68363d0 View commit details
    Browse the repository at this point in the history

Commits on Nov 23, 2016

  1. Configuration menu
    Copy the full SHA
    2812405 View commit details
    Browse the repository at this point in the history

Commits on Dec 5, 2016

  1. Configuration menu
    Copy the full SHA
    86414c0 View commit details
    Browse the repository at this point in the history

Commits on Dec 6, 2016

  1. Configuration menu
    Copy the full SHA
    2afc0cd View commit details
    Browse the repository at this point in the history
  2. This closes apache#1507

    kennknowles committed Dec 6, 2016
    Configuration menu
    Copy the full SHA
    88de0cb View commit details
    Browse the repository at this point in the history

Commits on Dec 16, 2016

  1. Upgrade Gearpump version

    manuzhang committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    46d3563 View commit details
    Browse the repository at this point in the history
  2. Add Window.Bound translator

    manuzhang committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    85d54ab View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c37de00 View commit details
    Browse the repository at this point in the history
  4. Remove print to stdout

    manuzhang committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    cb8c5e5 View commit details
    Browse the repository at this point in the history
  5. Fix NoOpAggregatorFactory

    manuzhang committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    8e0e819 View commit details
    Browse the repository at this point in the history
  6. This closes apache#1623: [BEAM-1086] Upgrade to latest Gearpump snapshot

      Fix NoOpAggregatorFactory
      Remove print to stdout
      Skip window assignment when windows don't change
      Add Window.Bound translator
      Upgrade Gearpump version
    kennknowles committed Dec 16, 2016
    Configuration menu
    Copy the full SHA
    b6e7bb6 View commit details
    Browse the repository at this point in the history

Commits on Dec 20, 2016

  1. Configuration menu
    Copy the full SHA
    81d94cf View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c2fb7c0 View commit details
    Browse the repository at this point in the history

Commits on Dec 21, 2016

  1. Configuration menu
    Copy the full SHA
    647034c View commit details
    Browse the repository at this point in the history
  2. This closes apache#1663: Merge master (b3de17b) into gearpump-runner

    Adjustments in gearpump-runner:
    
      [BEAM-79] Upgrade to beam-0.5.0-incubating-SNAPSHOT
      [BEAM-79] Update to latest Gearpump API
    
    From master:
    
      Disable automatic archiving of Maven builds
      [BEAM-59] initial interfaces and classes of Beam FileSystem.
      Change counter name in TestDataflowRunner
      More escaping in Jenkins timestamp spec
      Add RunnableOnService test for Metrics
      Fix seed job fetch spec
      Show timestamps on log lines in Jenkins
      [BEAM-1165] Fix unexpected file creation when checking dependencies
      [BEAM-1178] Make naming of logger objects consistent
      [BEAM-716] Fix javadoc on with* methods [BEAM-959] Improve check preconditions in JmsIO
      [BEAM-716] Use AutoValue in JmsIO
      Fix grammar error (repeated for)
      Empty TestPipeline need not be run
      [BEAM-85, BEAM-298] Make TestPipeline a JUnit Rule checking proper usage
      Change counter name in TestDataflowRunner
      BigQueryIO: fix streaming write, typo in API
      [BEAM-853] Force streaming execution on batch pipelines for testing. Expose the adapted source.
      Use empty SideInputReader, fixes NPE in SimpleDoFnRunnerTest
      Test that SimpleDoFnRunner wraps exceptions in startBundle and finishBundle
      Add timer support to DoFnRunner(s)
      Make TimerSpec and StateSpec fields accessible
      View.asMap: minor javadoc fixes
      Revert "Move InMemoryTimerInternals to runners-core"
      Revert "Moves DoFnAdapters to runners-core"
      Revert "Removes ArgumentProvider.windowingInternals"
      Revert "Removes code for wrapping DoFn as an OldDoFn"
      checkstyle: missed newline in DistributionCell
      Make {Metric,Counter,Distribution}Cell public
      Add PTransformOverrideFactory to the Core SDK
      Move ActiveWindowSet and implementations to runners-core
      Update Dataflow worker to beam-master-20161216
      [BEAM-1108] Remove outdated language about experimental autoscaling
      [BEAM-450] Shade modules to separate paths
      [BEAM-362] Port runners to runners-core AggregatoryFactory
      Move InMemoryTimerInternals to runners-core
      Delete deprecated TimerCallback
      Remove deprecated methods of InMemoryTimerInternals
      Don't incorrectly log error in MetricsEnvironment
      Renames ParDo.getNewFn to getFn
      Moves DoFnAdapters to runners-core
      Removes unused code from NoOpOldDoFn
      Removes ArgumentProvider.windowingInternals
      Removes code for wrapping DoFn as an OldDoFn
      Removes OldDoFn from ParDo
      Pushes uses of OldDoFn deeper inside Flink runner
      Remove ParDo.of(OldDoFn) from Apex runner
      Converts all easy OldDoFns to DoFn
      [BEAM-1022] Add testing coverage for BigQuery streaming writes
      Fix mvn command args in Apex postcommit Jenkins job
      [BEAM-932] Enable findbugs validation (and fix existing issues)
      Fail to split in FileBasedSource if filePattern expands to empty.
      [BEAM-1154] Get side input from proper window in ReduceFn
      [BEAM-1153] GcsUtil: use non-batch API for single file size requests.
      Fix NPE in StatefulParDoEvaluatorFactoryTest mocking
      [BEAM-1033] Retry Bigquery Verifier when Query Fails
      Implement GetDefaultOutputCoder in DirectGroupByKey
      SimpleDoFnRunner observes window if SideInputReader is nonempty
      Better comments and cleanup
      Allow empty string value for ValueProvider types.
      starter: fix typo in pom.xml
      Revert "Allow stateful DoFn in DataflowRunner"
      Re-exclude UsesStatefulParDo tests for Dataflow
      Some minor changes and fixes for sorter module
      [BEAM-1149] Explode windows when fn uses side inputs
      Add Jenkins postcommit for RunnableOnService in Apex runner
      Update version from 0.5.0-SNAPSHOT to 0.5.0-incubating-SNAPSHOT
      Update Maven Archetype versions after cutting the release branch
      Move PerKeyCombineFnRunner to runners-core
      Update Dataflow worker to beam-master-20161212
      [maven-release-plugin] prepare for next development iteration
      [maven-release-plugin] prepare branch release-0.4.0-incubating
      Fix version of Kryo in examples/java jenkins-precommit profile
      Revert 91cc606 "This closes apache#1586": Kryo + UBRFBS
      [BEAM-909] improve starter archetype
      Fix JDom malformed comment in Apex runner.
      [BEAM-927] Fix findbugs and re-enable Maven plugin in JmsIO
      [BEAM-807] Replace OldDoFn with DoFn.
      [BEAM-757] Use DoFnRunner in the implementation of DoFn via FlatMapFunction.
      FileBasedSinkTest: fix tests in Windows OS by using IOChannelUtils.resolve().
      FileBasedSink: ignore exceptions when removing temp output files for issues in Windows OS.
      [BEAM-1142] Upgrade maven-invoker to address maven bug ARCHETYPE-488.
      Add Tests for Kryo Serialization of URFBS
      Add no-arg constructor for UnboundedReadFromBoundedSource
      Revise WindowedWordCount for runner and execution mode portability
      Factor out ShardedFile from FileChecksumMatcher
      Add IntervalWindow coder to the standard registry
      Stop expanding PValues in DirectRunner visitors
      Migrate AppliedPTransform to use AutoValue
      Enable and fix DirectRunnerTest case missing @test
      [BEAM-1130] SparkRunner ResumeFromCheckpointStreamingTest Failing.
      [BEAM-1133] Add maxNumRecords per micro-batch for Spark runner options.
      BigQueryIO.Write: support runtime schema and table
      Fix handling of null ValueProviders in DisplayData
      [BEAM-551] Fix handling of default for VP
      [BEAM-1120] Move some DataflowRunner configurations from code to properties
      [BEAM-551] Fix toString for FileBasedSource
      [BEAM-921] spark-runner: register sources and coders to serialize with java serializer
      [BEAM-551] Fix handling of TextIO.Sink
      ...
    kennknowles committed Dec 21, 2016
    Configuration menu
    Copy the full SHA
    4c445dd View commit details
    Browse the repository at this point in the history

Commits on Jan 4, 2017

  1. Configuration menu
    Copy the full SHA
    2155476 View commit details
    Browse the repository at this point in the history

Commits on Jan 5, 2017

  1. update ROS configurations

    manuzhang committed Jan 5, 2017
    Configuration menu
    Copy the full SHA
    cfdc971 View commit details
    Browse the repository at this point in the history

Commits on Jan 7, 2017

  1. Configuration menu
    Copy the full SHA
    ea633d2 View commit details
    Browse the repository at this point in the history

Commits on Jan 14, 2017

  1. fix group by window

    manuzhang committed Jan 14, 2017
    Configuration menu
    Copy the full SHA
    e63d42d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3bf8263 View commit details
    Browse the repository at this point in the history
  3. support OutputTimeFn

    manuzhang committed Jan 14, 2017
    Configuration menu
    Copy the full SHA
    f6aaf0d View commit details
    Browse the repository at this point in the history

Commits on Jan 16, 2017

  1. Configuration menu
    Copy the full SHA
    364a3f0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b2d326f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    7613ec4 View commit details
    Browse the repository at this point in the history

Commits on Jan 20, 2017

  1. Configuration menu
    Copy the full SHA
    85dcfbd View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d814857 View commit details
    Browse the repository at this point in the history

Commits on Jan 24, 2017

  1. This closes apache#1661: Implement GearpumpPipelineResult

      note thread is interrupted on InterruptedException
      Remove cache for Gearpump on travis
      reduce timeout to wait for result
      fix ParDo.BoundMulti translation
      return encoded key for GroupByKey translation
      support OutputTimeFn
      update to latest gearpump dsl function interface
      fix group by window
      activate ROS on Gearpump by default
      update ROS configurations
      [BEAM-1180] Implement GearpumpPipelineResult
    kennknowles committed Jan 24, 2017
    Configuration menu
    Copy the full SHA
    1ed16f1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4fd216b View commit details
    Browse the repository at this point in the history

Commits on Jan 26, 2017

  1. Configuration menu
    Copy the full SHA
    4001aeb View commit details
    Browse the repository at this point in the history

Commits on Feb 15, 2017

  1. Configuration menu
    Copy the full SHA
    7af6472 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2d0aed9 View commit details
    Browse the repository at this point in the history

Commits on Mar 7, 2017

  1. Configuration menu
    Copy the full SHA
    4eb50d1 View commit details
    Browse the repository at this point in the history

Commits on Mar 10, 2017

  1. enable ParDoTest

    manuzhang committed Mar 10, 2017
    Configuration menu
    Copy the full SHA
    3dc8fc8 View commit details
    Browse the repository at this point in the history
  2. This closes apache#2150: Add SideInput support for GearpumpRunner

      enable ParDoTest
      [BEAM-79] Add SideInput support for GearpumpRunner
    kennknowles committed Mar 10, 2017
    Configuration menu
    Copy the full SHA
    15a8ad6 View commit details
    Browse the repository at this point in the history

Commits on Mar 12, 2017

  1. Configuration menu
    Copy the full SHA
    3f91798 View commit details
    Browse the repository at this point in the history

Commits on Mar 15, 2017

  1. Configuration menu
    Copy the full SHA
    3eab6a6 View commit details
    Browse the repository at this point in the history

Commits on Mar 17, 2017

  1. This closes apache#2241: merge master to gearpump-master and fixup

      [BEAM-79] Fix gearpump-runner merge conflicts and test failure
      Revert BigQueryIO bit of 'Make all uses of CountingOutputStream close their resources'
      [BEAM-1629] Init metrics/aggregators accumulators before traversing pipeline
      Generate zip distribution for pyhthon
      Ignore results from the tox clean up phase
      Add README to python tarball.
      Remove exception suppression from PAssert.SideInputCheckerDoFn
      Remove duplicated dependency from Dataflow runner pom.xml
      Added assertion failure tests for `PAssert#thatSingleton`
      Added a test of default PAssert failure reason
      Javadoc changes
      [BEAM-1551] Allow `PAssert`s to take a message
      add unicode type to the typeDict attribute in Python SDK
      Remove Pipeline.getRunner
      [BEAM-1686] Use random MQTT clientID when not defined to avoid NPE
      Properly deal with late processing-time timers
      [BEAM-1661] Shade guava in the JdbcIO
      [BEAM-797] A PipelineVisitor that creates a Spark-native pipeline.
      Introduce Flink-specific state GC implementations
      Move GC timer checking to StatefulDoFnRunner.CleanupTimer
      Move pipeline context and add more tests.
      Add license to new files.
      Runner API translation of triggers and windowing strategies.
      Runner API encoding of WindowFns.
      Runner API context helper classes.
      Auto-generated runner api proto bindings.
      HadoopInputFormatIO with junits
      Test runner to stop on EOT watermark, or timeout.
      [BEAM-1184] Add integration tests to ElasticsearchIO
      Jdbc k8 script: postgres data store only accessible inside test project
      Jdbc k8 & data loading: add teardown and update names/docs
      Bump Dataflow ROS timeout to 120 minutes
      Fixup typo in WindowingStrategies
      Update archetypes
      [BEAM-1649] Fix unresolved references in Python SDK
      Upgrade Dataflow container version to beam-master-20170307
      Explicitly GBK before stateful ParDo in Dataflow batch
      Add ServicesResourceTransformer to all shading configuration
      Fix typo in proto: widow -> window.
      [BEAM-1646] Remove duplicated bigquery dependency
      Update python SDK version to the next version.
      Change Json parsing from gson to jackson for ElasticsearchIO
      [maven-release-plugin] prepare for next development iteration
      [maven-release-plugin] prepare branch release-0.6.0
      [BEAM-1633] Move .tox/ directory under target/ in Python SDK
      Make all uses of CountingOutputStream close their resources
      Flink: register known IOChannelFactories
      [BEAM-1546] Specify exact version for Python in the SDK
      [BEAM-1635] TypeError in AfterWatermark class's __repr__ method
      [BEAM-1636] UnboundedDataset action() does not materialize RDD
      [BEAM-1556] Make PipelineOptions a lazy-singleton and init IOs as part of it.
      Add tests for serialization of BigQueryIO.TableRowInfoCoder
      Fix tox warning for non-whitelisted find command
      Revert "Implement Single-Output ParDo as a composite"
      [BEAM-1623] Transform Reshuffle directly in Spark runner
      [BEAM-1626] Remove cache of MapWithStateDStream on read.
      Revert "DataflowRunner: experimental support for issuing FnAPI based jobs"
      Java examples: move shade plugin into default lifecycle
      Java 8 examples: add shade configuration for producing bundled/fat jar
      Java 8 examples: add SNAPSHOT repository to the pom.xml file
      Update Guava version from 19.0 to 20.0 in example projects
      [BEAM-1625] BoundedDataset action() does not materialize RDD
      Revert "[maven-release-plugin] prepare branch release-0.6.0"
      Updating Dataflow API client protobufs
      Implement Single-Output ParDo as a composite
      Add a Test for windowed CombineGloballyAsSingletonView
      Remove SingletonCombine
      Only Override CreatePCollectionView in Streaming
      [BEAM-1310] Add running integration tests in JdbcIO on Spark and Dataflow runners
      DataflowRunner: experimental support for issuing FnAPI based jobs
      Updates Python SDK source API so that sources can report limited parallelism signals.
      [BEAM-1188] Python Bigquery Verifier For E2E Test
      Deprecate Pipeline.getOptions
      BEAM-1567 hashStream should be closed in PackageUtil#createPackageAttributes()
      [BEAM-1565] Update Spark runner PostCommit Jenkins job.
      [maven-release-plugin] prepare branch release-0.6.0
      Do not Reassign Windows when WindowFn is null
      Fix DataflowRunner message about uploaded vs cached files
      Update javadoc ant to include runners/ and exclude modules with a wildcard
      Adding per-stage matching to metrics filters
      Upgrade dill to 0.2.6 and pin it
      Remove PipelineRunner#apply
      [BEAM-111] Move WritableCoder to hadoop-common
      [BEAM-1297] Update maven shade plugin, fix typo and remove unneeded version
      [BEAM-351] Add DisplayData to KafkaIO
      Inline rather than reference FunctionSpecs.
      [BEAM-1517] Garbage collect user state in Flink Runner
      BEAM-1417 Count should comply with PTransform style guide
      BEAM-1419 Flatten should comply with PTransform style guide
      BEAM-1416 Write transform should comply with PTransform style guide
      BEAM-1426 SortValues should comply with PTransform style guide
      BEAM-1424 ToString should comply with PTransform style guide
      BEAM-1423 Sample should comply with PTransform style guide
      BEAM-1421 Latest should comply with PTransform style guide
      BEAM-1420 GroupByKey should comply with PTransform style guide
      Use UnsupportedSideInputReader in GroupAlsoByWindowEvaluatorFactory
      Include cython tests in presubmits for linux platform
      Update output stream cython declaration
      Make side inputs a map, rather than embedding the name in the message.
      Streaming tests, especially the ones using checkpoints, need a time buffer to finish.
      ...
    kennknowles committed Mar 17, 2017
    Configuration menu
    Copy the full SHA
    555842a View commit details
    Browse the repository at this point in the history

Commits on Mar 23, 2017

  1. Configuration menu
    Copy the full SHA
    eb0d333 View commit details
    Browse the repository at this point in the history

Commits on Mar 30, 2017

  1. Configuration menu
    Copy the full SHA
    f4f2333 View commit details
    Browse the repository at this point in the history

Commits on Apr 13, 2017

  1. Configuration menu
    Copy the full SHA
    f3138dd View commit details
    Browse the repository at this point in the history

Commits on Apr 18, 2017

  1. Configuration menu
    Copy the full SHA
    ebbb613 View commit details
    Browse the repository at this point in the history

Commits on Apr 20, 2017

  1. Merge branch 'master' of https://github.com/apache/incubator-beam int…

    …o gearpump-runner
    
    # Conflicts:
    #	.test-infra/jenkins/job_beam_PostCommit_Java_ValidatesRunner_Gearpump.groovy
    #	runners/pom.xml
    manuzhang committed Apr 20, 2017
    Configuration menu
    Copy the full SHA
    46c41fc View commit details
    Browse the repository at this point in the history

Commits on Apr 21, 2017

  1. Configuration menu
    Copy the full SHA
    44d21ac View commit details
    Browse the repository at this point in the history
  2. This closes apache#2610: Merge master into gearpump-runner branch

      Update gearpump-runner against master changes.
      add temp dataset location for non-query BigQuerySource
      added module option, use more common zero test, show module name in log
      Modify types for input PCollections of Flatten transform to that of the output PCollection
      [BEAM-1871] Remove another depedendency by moving TestCredential
      [BEAM-2017] Fix NPE in DataflowRunner when there are no metrics
      [BEAM-2013] Upgrade to Jackson 2.8.8
      [BEAM-2014] Upgrade to Google Auth 0.6.1
      [BEAM-2015] Remove shared profile in runners/pom.xml and fix Dataflow ValidatesRunner PostCommit
      Cache result of BigQuerySourceBase.split
      Ensure all Read outputs are consumed in Dataflow
      [BEAM-1441] Remove deprecated ChannelFactory
      [BEAM-1994] Remove Flink examples package
      Pin default commons-compress version to beam-parent pom
      [BEAM-1914] XmlIO now complies with PTransform style guide
      Separate streaming writes into two pluggable components - CreateTables, and StreamingWriteTables. Also address many code review comments. Also merge with master.
      Fix tests to properly fake out BigQueryService, and add tests for dynamic-table functionality.
      Refactor batch loads, and add support for windowed writes.
      Refactor batch load job path, and add support for data-dependent tables.
      Refactor streaming write branch into separate reusable components.
      Add PrepareWrite transform.
      Use tableRefFunction throughout BigQueryIO. Constant table writes use ConstantTableSpecFunction.
      Explodes windows before GBKIKWI
      Creates ProcessFnRunner and wires it through ParDoEvaluator
      Extracts interface from PushbackSideInputDoFnRunner
      Minor cleanups in ParDoEvaluator
      ProcessFn remembers more info about its application context
      Separates side input test and side output test
      Changed snappy version to 1.1.4-M3
      Upgrade worker to not depend on deprecated now deleted code
      Delete AppEngineEnvironment
      Delete IntervalBoundedExponentialBackoff
      Delete AttemptBoundedExponentialBackoff
      Remove deprecated/unused code from Pipeline
      Remove deprecated method in IOChannelUtils
      Delete deprecated AttemptAndTimeBoundedExponentialBackoff
      [BEAM-1871] Create new GCP core module package and move several GCP related classes from beam-sdks-java-core over.
      [BEAM-1964] Upgrade Pylint
      Remove options_id concept from templated runs.
      Revert "Revert "Throw specialized exception in value providers""
      Revert "Revert "Revert "Revert "Add ValueProvider class for FileBasedSource I/O Transforms""""
      Removes unused validation parameter
      Converts TFRecordIO.Write to AutoValue
      Gets rid of TFRecordIO.Write.Bound
      Converts TFRecordIO.Read to AutoValue
      Gets rid of TFRecordIO.Read.Bound
      runners-core-construction-java fix artifact name
      Rename SideOutputValue to OutputValue
      [BEAM-1990] Comment: Don't use Window.Assign
      [BEAM-1272] Align the naming of "generateInitialSplits" and "splitIntoBundles" to better reflect their intention
      Revert "Removes final minor usages of OldDoFn outside OldDoFn itself"
      Fix Hadoop pom.xml
      Making metrics usage in datastore_wordcount consistent
      Remove overloading of __call__ in DirectRunner
      Clean up DirectRunner Clock and TransformResult
      Translate PTransforms to and from Runner API Protos
      [BEAM-1993] Remove special unbounded Flink source/sink
      Remove flink-annotations dependency
      Fix Javadoc warnings on Flink Runner
      Enable flink dependency enforcement and make dependencies explicit
      [BEAM-59] Register standard FileSystems wherever we register IOChannelFactories
      [BEAM-1991] Sum.SumDoubleFn => Sum.ofDoubles
      clean up description for sdk_location
      Set the Project of a Table Reference at Runtime
      Only compile HIFIO ITs when compiling with java 8.
      Update assertions of source_test_utils from camelcase to underscore-separated.
      Add no-else return to pylintrc
      Remove getSideInputWindow
      Remove reference to the isStreaming flag
      Javadoc fixups after style guide changes
      Update Dataflow Worker Version
      [BEAM-1922] Close datasource in JdbcIO when possible
      Fix javadoc warnings
      Add javadoc to getCheckpointMark in UnboundedSource
      Removes final minor usages of OldDoFn outside OldDoFn itself
      [BEAM-1915] Removes use of OldDoFn from Apex
      Update Signature of PTransformOverrideFactory
      [BEAM-1964] Fix lint issues and pylint upgrade
      Rename DoFn.Context#sideOutput to output
      [BEAM-1964] Fix lint issues for linter upgrade -3
      [BEAM-1964] Fix lint issues for linter upgrade -2
      Avoi repackaging bigtable classes in dataflow runner.
      ApexRunner: register standard IOs when deserializing pipeline options
      Add PCollections Utilities
      Free PTransform Names if they are being Replaced
      [BEAM-1347] Update protos related to State API for prototyping purposes.
      Update java8 examples pom files to include maven-shade-plugin.
      fix the simplest typo
      [BEAM-1964] Fix lint issues for linter upgrade
      Merge PR#2423: Add Kubernetes scripts for clusters for Performance and Integration tests of Cassandra and ES for Hadoop Input Format IO
      Remove Triggers.java from SDK entirely
      [BEAM-1708] Improve error message when GCP not installed
      Improve gcloud logging message
      [BEAM-1101, BEAM-1068] Remove service account name credential pipeline options
      Update user_score.py
      Pin versions in tox script
      Improve Empty Create Default Coder Error Message
      Represent a Pipeline via a list of Top-level Transforms
      Test all Known Coders to ensure they Serialize via URN
      [BEAM-1950] Add missing 'static' keyword to MicrobatchSource#initReaderCache
      ...
    kennknowles committed Apr 21, 2017
    Configuration menu
    Copy the full SHA
    4078c22 View commit details
    Browse the repository at this point in the history

Commits on May 3, 2017

  1. Merge remote-tracking branch 'upstream/master' into gearpump-runner

    # Conflicts:
    #	.travis.yml
    manuzhang committed May 3, 2017
    Configuration menu
    Copy the full SHA
    9a59ea3 View commit details
    Browse the repository at this point in the history

Commits on May 4, 2017

  1. Configuration menu
    Copy the full SHA
    12b9719 View commit details
    Browse the repository at this point in the history
  2. This closes apache#2888: Merge master into gearpump-runner branch

      Update gearpump-runner against master changes
      Introduces read/writePubsubMessages
      Removed coder and formatFn from PubsubIO.Write
      Removed coder and parseFn from PubsubIO.Read
      Add Additional CloudObjectTranslators
      Updating Dataflow API protos and client
      Remove Coder.asCloudObject and related methods
      Deprecate <pipeline>.options usage
      Include LICENSE and NOTICE in python dist files
      Full removal of Aggregators in Java SDK and Runners
      [BEAM-59] Move GcsFileSystem to gcp-core
      [BEAM-2139] Disable SplittableDoFn ValidatesRunner tests for Streaming Flink Runner
      [BEAM-59] Delete old restrictions on output file paths
      [BEAM-2093] Use the jackson version from the maven property in maven archetypes
      [BEAM-2135] Fix pointers to sdks/java/io/hadoop-file-system
      Instantiate runner briefly in Pipeline
      Skip null options when converting back to argv
      [BEAM-2135] Fix pointers to GCP-core
      [BEAM-1316] Remove the usage of mock from ptransform tests
      [BEAM-539] Fixes several issues of FileSink.
      [BEAM-2020] Convert all unknown Coders into CustomCoder CloudObjects
      BigQuery: swap from asSingleton to asIterable for Cleanup
      [BEAM-2141] Disable JDBC tests
      [BEAM-2135] Move hdfs to hadoop-file-system
      Moves TextSource and TextSink to top level
      Converts TextIO.Write to AutoValue
      Removes TextIO.Write.Bound
      Converts TextIO.Read to AutoValue
      Removes TextIO.Read.Bound
      Removes leftover no-op withoutValidation methods
      Make DoFnSignatures robust to StateSpec subclasses
      Simplify type parameters of StateSpec and related
      [BEAM-2135] Move gcp-core to google-cloud-platform-core
      Revert clean else-return lint changes.
      Add an @internal Annotation
      Add retry logic for job creation
      Update github PR template after travis removal.
      [BEAM-1764] Remove aggregators from Flink Runner
      Use processing time for synchronized processing time in Flink runner
      Remove unneeded elements from the POM files
      Remove accumulators from DoFn tester.
      Removing Aggregator from core runner code
      Remove Aggregators from StatefulDoFn runner
      [BEAM-59] AvroIOTest: use absolute paths for display data
      [BEAM-59] DataflowRunnerTests: configure FileSystems in test
      [BEAM-59] TFRecordIOTest: cleanup
      [BEAM-59] Core tests: stop using gs:// paths
      Remove aggregators from DoFn contexts and internal SDK usage
      [BEAM-2005] Fix build break, ignore test due to change in TestPipeline/FileSystems interaction
      [BEAM-2005, BEAM-2030, BEAM-2031, BEAM-2032, BEAM-2033, BEAM-2070] Base implementation of HadoopFileSystem.
      maptask_executor_runner_test: build fix
      Scattered minor improvements per review comments
      Moves AvroIO.write().withSchema into write()
      Converts AvroIO.Write to AutoValue; adds writeGenericRecords()
      Moves AvroIO.Read.withSchema into read()
      Removes AvroIO.Write.Bound
      Converts AvroIO.Read to AutoValue
      Adds AvroIO.readGenericRecords()
      Removes AvroIO.Read.Bound
      Moves AvroSink to upper level
      Fixes javadoc of TextIO to not point to AvroIO
      [BEAM-1871] Remove deprecated org.apache.beam.sdk.options.GcsOptions after updating Dataflow worker
      Copy CloudObject to the Dataflow Module
      Rename OutputValue to TaggedOutput.
      Fn API support for Python.
      [BEAM-1441] remove manual join from apiclient
      Add CloudObjectTranslators for Avro, Serializable
      [BEAM-59] Switch mimeType from mutable protected field to constructor
      [BEAM-1908] Allow setting CREATE_NEVER when using a tablespec in BigQueryIO
      Remove IOChannelUtils from BigQuery TableRowWriter
      [BEAM-1925] Remove deprecated context param from DoFn
      [BEAM-1871] Hide CloudResourceManager client in sdks/java/extensions/gcp-core
      [BEAM-1676] Internalize ClassPath from Guava 21.0 to fix the failures related to having whitespaces in resource names.
      Add Cloud Object Translators for Coders
      Re-enable WindowedWordCountIT on Flink runner in precommit
      Add A CoderTranslator Interface
      [BEAM-2130] Ensure the options id is never null.
      Use a new ReflectData for each AvroCoder instance
      [BEAM-2074,BEAM-2073] Fixed SourceDStream's rate control usage.
      [BEAM-2072] Fixed MicrobatchSource.reader stops reading after reaching maxNumRecords for the first time.
      [BEAM-2129] Fix flaky KafkaIOTest#testUnboundedSourceMetrics
      [BEAM-59] DataflowRunner: Sink is always a FileBasedSink now
      Do not repeat log messages in DataflowPipelineJob
      Update Dataflow worker version to beam-master-20170430
      Remove KeyedCombineFn
      [BEAM-59] Minor style cleanups to WriteOneWindowPerFile
      Do not prune branches in Jenkins
      Fix hamcrest-core version in parent pom
      [BEAM-2114] Tests for KafkaIO: use ExpectedException rule
      [BEAM-2114] Throw instead of warning when KafkaIO cannot infer coder
      [BEAM-2114] Fixed display data for Kafka read/write with coders
      Remove useless continue statements
      Remove redundant private on enum constructors
      Remove useless return statement
      Renames {id,timestamp}Label to {id,timestamp}Attribute throughout SDK
      Adds PubsubIO.writeStrings(), writeProtos(), writeAvros()
      Adds PubsubIO.readStrings(), readProtos(), readAvros()
      Renames PubsubIO.Write builder methods to be style guide compliant
      Converts PubsubIO.Write to AutoValue
      Remove override of topic by subscription and vice versa
      ...
    kennknowles committed May 4, 2017
    Configuration menu
    Copy the full SHA
    58546ac View commit details
    Browse the repository at this point in the history

Commits on May 18, 2017

  1. Configuration menu
    Copy the full SHA
    bc8da29 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6c06967 View commit details
    Browse the repository at this point in the history
  3. This closes apache#3172: Sync gearpump-runner with master

      Update gearpump-runner against master changes
      Use stable naming strategy for ByteBuddy invokers
      Translate a Pipeline in SdkComponents
      [TRIVIAL] runners-core: delete placeholder
      Fixes an accidentally found bug in SimpleDoFnRunner
      Removes OldDoFn and its kin from runners-core
      Bump Dataflow containers to 0512
      Improve Pruning performed by the DirectRunnerApiSurfaceTest
      Adding support for subnetwork in Python Pipelineoptions
      Use built-in cmp python function in comparing datastore paths
      ApexRunner SDF support
      Fix documentation for the shard_template_name
      [BEAM-2299] Run maven install on Windows machine for build/test coverage on Windows
      Remove "Dataflow" from apache_beam __init__.py file
      Moving the data file for trigger tests to testing/data
      Fix GcsResourceIdTest in postcommits
      readAvros should't have proto Message upper bound
      Reduce Log Level of PubsubUnboundedSource
      [BEAM-2290] Fix issue where timestamps weren't set when using CompressedSource
      [BEAM-2279] Fix archetype breakages
      internal comments
      Fix shading of guava testlib
      Rename FileSystems.setDefaultConfigInWorkers
      [BEAM-2277] HadoopFileSystem: normalize implementation
      Mark FileSystem and related as Experimental
      [BEAM-2277] Add ResourceIdTester and test existing ResourceId implementations
      Remove '/' entirely from determining FileSystem scheme
      [BEAM-2279] Add HDFS support to Spark runner profiles in archetypes and examples
      [BEAM-2277] Fix URI_SCHEME_PATTERN in FileSystems
      BigtableIO should use AutoValue for read and write
      [BEAM-2153] Move connection management in JmsIO.write() to setup/teardown methods
      Mark More values methods Internal
      Rename filesink to filebasedsink
      Enable SerializableCoder to Serialize with Generic Types
      Remove unused test data
      Fix due to GBKO name change.
      Don't deploy jdk1.8-tests module
      Remove some internal details from the public API.
      Move assert_that, equal_to, is_empty to apache_beam.testing.util
      [BEAM-1345] Clearly delineate public api in apache_beam/typehints.
      [BEAM-1345] Mark apache_beam/internal as internal.
      [BEAM-1345] Annotate public members of pvalue.
      Add internal comments to metrics
      [BEAM-1340] Add __all__ tags to modules in package apache_beam/transforms
      [BEAM-2256] Add the last previous range filter
      Use a consistent calculation for GC Time
      fix lint error in fake_datastore.py
      Add __all__ tags to modules in package apache_beam/testing
      [BEAM-1340] Adds __all__ tags to classes in package apache_beam/io.
      [BEAM-1345] Clearly delineate public api in apache_beam/coders.
      [BEAM-1345] Clearly delineate public api in runners package.
      [BEAM-1345] Mark Pipeline as public.
      [BEAM-1345] Clearly delineate public API in apache_beam/options
      Mark internal modules in python datastoreio
      [BEAM-2260] Improve construction-time errors for Text and AvroIO
      [BEAM-2179] Archetype generate-sources.sh cleanup the existing sources before rsync
      [BEAM-1345] Mark windowed value as experimental
      Add internal usage only comments to util/
      Remove protobuf and http-client dependency from runners/google-cloud-dataflow
      minor typo fix in comment
      Add support for local execution to PubsubIO using the google cloud emulator
      [BEAM-2150] Relax regex to support wildcard globbing for GCS
      bump time of precommits
      [BEAM-2244] Move details of Metrics to Runners Core
      Correct javadoc for mobile gaming examples
      Update SDK Coders to return the Empty List from getCoderArguments
      Skip generating empty jars for parent poms
      Fix a typo in TestDataflowRunnerTest
      Re-enable UsesTimersInParDo tests in Dataflow runner
      TestDataflowRunner: throw AssertionError only when assertion known failed
      Allow any throwable in PAssert to constitute adequate failure
      [BEAM-2242] Ensure that jars are shaded correctly by running the jar plugin before the shade plugin
      [BEAM-2240] Always augment exception with step name.
      Adds dependency on findbugs to examples/java
      Splits WriteBundles into windowed/unwindowed versions
      Simpler code for setting shard numbers on results in FileBasedSink
      Implement dynamic-sharding for windowed file outputs, and add an integration test.
      Renames FileBasedSink inner classes
      [BEAM-2250] remove experimental and internal things from pydoc
      [BEAM-2249] Correctly handle partial reads in AvroSource
      Use text output for first two mobile gaming examples
      Remove verifyDeterministic from StructuredCoder
      Update Coder Documentation
      Improve DirectRunner Javadoc
      [BEAM-2211] Delete deprecated NoopPathValidator
      Remove Timer.cancel() from user-facing API
      Remove Readme files.
      Renames some python classes and functions that were unnecessarily public.
      Mark PipelineVisitor and AppliedPTransform as internal.
      Mark PValue and PValueBase Internal
      [BEAM-2236] Move test utilities out of python core
      Include 'sun.reflect' in GcpCoreApiSurfaceTest
      Fix checkstyle error
      Shade dependencies in sdks/core
      Remove trailing whitespace
      Add per-runner profile to Java 8 examples
      Register TestSparkPipelineOptions only in src/test to avoid hard hamcrest dep
      Update Apache Beam Python version to 2.1.0.dev
      Shade JSR305 in the DirectRunner
      Remove hadoop io readme
      ...
    kennknowles committed May 18, 2017
    Configuration menu
    Copy the full SHA
    99221e7 View commit details
    Browse the repository at this point in the history

Commits on Jun 5, 2017

  1. Configuration menu
    Copy the full SHA
    9e6c906 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c9aac96 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3c7e3e6 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    98854d4 View commit details
    Browse the repository at this point in the history

Commits on Jun 12, 2017

  1. Configuration menu
    Copy the full SHA
    7653e7e View commit details
    Browse the repository at this point in the history
  2. This closes apache#3292: Merge master into gearpump-runner

    Adjustments to gearpump-runner:
    
      Fix side input handling in DoFnFunction
      Respect WindowFn#getOutputTime in gearpump-runner
      Activate Gearpump local-validates-runner-tests in precommit
      Update against master changes
    
    Merged from master:
    
      [BEAM-972] Add more unit test to Gearpump runner
      [BEAM-972] Add unit tests to Gearpump runner
      [BEAM-79] Fix gearpump-runner merge conflicts and test failure
      enable ParDoTest
      [BEAM-79] Add SideInput support for GearpumpRunner
      [BEAM-79] Support merging windows in GearpumpRunner
      [BEAM-79] Fix PostCommit test confs for Gearpump runner
      note thread is interrupted on InterruptedException
      Remove cache for Gearpump on travis
      reduce timeout to wait for result
      fix ParDo.BoundMulti translation
      return encoded key for GroupByKey translation
      support OutputTimeFn
      update to latest gearpump dsl function interface
      fix group by window
      activate ROS on Gearpump by default
      update ROS configurations
      [BEAM-1180] Implement GearpumpPipelineResult
      [BEAM-79] Upgrade to beam-0.5.0-incubating-SNAPSHOT
      [BEAM-79] Update to latest Gearpump API
      Fix NoOpAggregatorFactory
      Remove print to stdout
      Skip window assignment when windows don't change
      Add Window.Bound translator
      Upgrade Gearpump version
      [BEAM-79] fix gearpump runner build failure
      [BEAM-79] update GearpumpPipelineResult
      [BEAM-79] Port Gearpump runner from OldDoFn to new DoFn
      upgrade gearpump-runner to 0.4.0-incubating-SNAPSHOT
      remove "pipeline" in runner name
      post-merge fix
      [BEAM-79] fix integration-test failure
      fix import order
      !fixup Minor javadoc clean-up
      Added even more javadoc to TextIO#withHeader and TextIO#withFooter (2).
      Added even more javadoc to TextIO#withHeader and TextIO#withFooter.
      Added javadoc to TextIO#withHeader and TextIO#withFooter.
      Reverted header and footer to be of type String.
      Revised according to comments following a code review.
      Add header/footer support to TextIO.Write
      [BEAM-242] Enable and fix checkstyle in Flink runner examples
      Remove timeout in JAXBCoderTest
      Be more accepting in UnboundedReadDeduplicatorTest
      BigQuery: limit max job polling time to 1 minute
      [BEAM-242] Enable checkstyle and fix checkstyle errors in Flink runner
      [BEAM-456] Add MongoDbIO
      FluentBackoff: a replacement for a variety of custom backoff implementations
      Remove the DataflowRunner instructions from examples
      Put classes in runners-core package into runners.core namespace
      Delegate populateDipslayData to wrapped combineFn's
      Fixed Combine display data
      Cloud Datastore naming clean-up
      DatastoreIO SplitQueryFn integration test
      Add Latest CombineFn and PTransforms
      Remove empty unused method in TestStreamEvaluatorFactory
      Test that multiple instances of TestStream are supported
      Correct some accidental renames
      Fix condition in FlinkStreamingPipelineTranslator
      Address comments of Flink Side-Input PR
      [BEAM-569] Define maxNumRecords default value to Long.MAX_VALUE in JmsIO
      Add LeaderBoardTest
      take advantage of setup/teardown for KafkaWriter
      Returned KafkaIO getWatermark log line in debug mode
      [BEAM-572] Remove Spark Reference in WordCount
      Update Dataflow Container Version
      [BEAM-313] Provide a context for SparkRunner
      DataflowRunner: get PBegin from PInput
      [BEAM-592] Fix SparkRunner Dependency Problem in WordCount
      Fix javadoc in Kinesis
      Organize imports in Kinesis
      kinesis: a connector for Amazon Kinesis
      [BEAM-589] Fixing IO.Read transformation
      Query latest timestamp
      travis.yml: disable updating snapshots
      Added support for reporting aggregator values to Spark sinks
      [BEAM-294] Rename dataflow references to beam
      Modified BigtableIO to use DoFn setup/tearDown methods instead of startBundle/finishBundle
      checkstyle: prohibit API client repackaged Guava
      Make WriteTest more resilient to Randomness
      Update DoFn javadocs to remove references to OldDoFn and Dataflow
      [BEAM-545] Promote JobName to PipelineOptions
      Move the samples data to gs://apache-beam-samples/
      Cleanup some javadoc that referring Dataflow
      BigQueryIO.Write: raise size limit to 11 TiB
      Optimize imports
      Update checkstyle.xml to put all imports in one group
      Fix Exception Unwrapping in TestFlinkRunner
      Make ParDoLifecycleTest Serializable to Fix Test with TupleTag
      Use AllPanes as the PaneExtractor in IterableAssert
      Fix combine tests with Accumulation Mode
      Enable Flink Streaming Runner RunnableOnService tests
      Fix Flink Runner Pom for Batch RunnableOnService tests
      Don't Suppress Throwable in PAssert in Streaming Mode
      ...
    kennknowles committed Jun 12, 2017
    Configuration menu
    Copy the full SHA
    559e3c3 View commit details
    Browse the repository at this point in the history

Commits on Jun 17, 2017

  1. Configuration menu
    Copy the full SHA
    fed98c8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f61822d View commit details
    Browse the repository at this point in the history

Commits on Jun 19, 2017

  1. Configuration menu
    Copy the full SHA
    a7b5d98 View commit details
    Browse the repository at this point in the history
  2. Fix kryo exception

    manuzhang committed Jun 19, 2017
    Configuration menu
    Copy the full SHA
    11caa97 View commit details
    Browse the repository at this point in the history

Commits on Jun 23, 2017

  1. Remove unused codes

    manuzhang committed Jun 23, 2017
    Configuration menu
    Copy the full SHA
    b21fa04 View commit details
    Browse the repository at this point in the history
  2. This closes apache#3388: Sync gearpump-runner branch with master

      Remove unused codes
      Fix kryo exception
      Fix PCollectionView translation
      upgrade to gearpump 0.8.4-SNAPSHOT
      Fix side input handling in DoFnFunction
      Respect WindowFn#getOutputTime in gearpump-runner
      Activate Gearpump local-validates-runner-tests in precommit
      Update against master changes
      Update gearpump-runner against master changes
      Update gearpump-runner against master changes
      Update gearpump-runner against master changes.
      [BEAM-972] Add more unit test to Gearpump runner
      [BEAM-972] Add unit tests to Gearpump runner
      [BEAM-79] Fix gearpump-runner merge conflicts and test failure
      enable ParDoTest
      [BEAM-79] Add SideInput support for GearpumpRunner
      [BEAM-79] Support merging windows in GearpumpRunner
      [BEAM-79] Fix PostCommit test confs for Gearpump runner
      note thread is interrupted on InterruptedException
      Remove cache for Gearpump on travis
      reduce timeout to wait for result
      fix ParDo.BoundMulti translation
      return encoded key for GroupByKey translation
      support OutputTimeFn
      update to latest gearpump dsl function interface
      fix group by window
      activate ROS on Gearpump by default
      update ROS configurations
      [BEAM-1180] Implement GearpumpPipelineResult
      [BEAM-79] Upgrade to beam-0.5.0-incubating-SNAPSHOT
      [BEAM-79] Update to latest Gearpump API
      Fix NoOpAggregatorFactory
      Remove print to stdout
      Skip window assignment when windows don't change
      Add Window.Bound translator
      Upgrade Gearpump version
      [BEAM-79] fix gearpump runner build failure
      [BEAM-79] update GearpumpPipelineResult
      [BEAM-79] Port Gearpump runner from OldDoFn to new DoFn
      upgrade gearpump-runner to 0.4.0-incubating-SNAPSHOT
      remove "pipeline" in runner name
      post-merge fix
      [BEAM-79] fix integration-test failure
      fix import order
      !fixup Minor javadoc clean-up
      Added even more javadoc to TextIO#withHeader and TextIO#withFooter (2).
      Added even more javadoc to TextIO#withHeader and TextIO#withFooter.
      Added javadoc to TextIO#withHeader and TextIO#withFooter.
      Reverted header and footer to be of type String.
      Revised according to comments following a code review.
      Add header/footer support to TextIO.Write
      [BEAM-242] Enable and fix checkstyle in Flink runner examples
      Remove timeout in JAXBCoderTest
      Be more accepting in UnboundedReadDeduplicatorTest
      BigQuery: limit max job polling time to 1 minute
      [BEAM-242] Enable checkstyle and fix checkstyle errors in Flink runner
      [BEAM-456] Add MongoDbIO
      FluentBackoff: a replacement for a variety of custom backoff implementations
      Remove the DataflowRunner instructions from examples
      Put classes in runners-core package into runners.core namespace
      Delegate populateDipslayData to wrapped combineFn's
      Fixed Combine display data
      Cloud Datastore naming clean-up
      DatastoreIO SplitQueryFn integration test
      Add Latest CombineFn and PTransforms
      Remove empty unused method in TestStreamEvaluatorFactory
      Test that multiple instances of TestStream are supported
      Correct some accidental renames
      Fix condition in FlinkStreamingPipelineTranslator
      Address comments of Flink Side-Input PR
      [BEAM-569] Define maxNumRecords default value to Long.MAX_VALUE in JmsIO
      Add LeaderBoardTest
      take advantage of setup/teardown for KafkaWriter
      Returned KafkaIO getWatermark log line in debug mode
      [BEAM-572] Remove Spark Reference in WordCount
      Update Dataflow Container Version
      [BEAM-313] Provide a context for SparkRunner
      DataflowRunner: get PBegin from PInput
      [BEAM-592] Fix SparkRunner Dependency Problem in WordCount
      Fix javadoc in Kinesis
      Organize imports in Kinesis
      kinesis: a connector for Amazon Kinesis
      [BEAM-589] Fixing IO.Read transformation
      Query latest timestamp
      travis.yml: disable updating snapshots
      Added support for reporting aggregator values to Spark sinks
      [BEAM-294] Rename dataflow references to beam
      Modified BigtableIO to use DoFn setup/tearDown methods instead of startBundle/finishBundle
      checkstyle: prohibit API client repackaged Guava
      Make WriteTest more resilient to Randomness
      Update DoFn javadocs to remove references to OldDoFn and Dataflow
      [BEAM-545] Promote JobName to PipelineOptions
      Move the samples data to gs://apache-beam-samples/
      Cleanup some javadoc that referring Dataflow
      BigQueryIO.Write: raise size limit to 11 TiB
      Optimize imports
      Update checkstyle.xml to put all imports in one group
      Fix Exception Unwrapping in TestFlinkRunner
      Make ParDoLifecycleTest Serializable to Fix Test with TupleTag
      Use AllPanes as the PaneExtractor in IterableAssert
      ...
    kennknowles committed Jun 23, 2017
    Configuration menu
    Copy the full SHA
    99f4f8b View commit details
    Browse the repository at this point in the history

Commits on Jun 30, 2017

  1. Configuration menu
    Copy the full SHA
    f158257 View commit details
    Browse the repository at this point in the history
  2. This closes apache#3479: [BEAM-79] Merge master into gearpump-runner …

    …branch
    
      Don't call .testingPipelineOptions() a second time
      GCP IO ITs now all use --project option
      Select SDK distribution based on the selected SDK name
      [BEAM-2373] Upgrade commons-compress dependency version to 1.14
      Define the projectId in the SpannerIO Read Test (utest, not itest)
      Use SDK harness container for FnAPI jobs when worker_harness_container_image is not specified. Add a separate image tag to use with the SDK harness container.
      Ditch apache commons
      Add PubSub I/O support to Python DirectRunner
      Only use ASCII 'a' through 'z' for temporary Spanner tables
      ReduceFnRunner.onTrigger: add short circuit for empty pane, and move inputWM and pane after the short circuit.
      WindowingStrategy: add OnTimeBehavior to control whether to emit empty ON_TIME pane.
      Removed OnceTriggerStateMachine
      Visit composite nodes when checking for picklability.
      Upgrade beam bigtable client dependency to 0.9.7.1
      Add a Combine Test for Sliding Windows without Context
      [BEAM-2389] moved GcpCoreApiSurfaceTest to corresponding module, adapted exposed packagees
      Add Experimental annotation to AMQP and refine Kind for the Experimental IOs
      [BEAM-2488] Elasticsearch IO should read also in replica shards
      Use PCollectionViews.toAdditionalInputs in Combine
      Use PCollectionViews.toAdditionalInputs in ParDo
      Use PCollectionViews.toAdditionalInputs in ParDoMultiOverrideFactory
      Fix getAdditionalInputs for SplittableParDo transforms
      Add utility to expand list of PCollectionViews
      Read api with naive implementation
      Pre read api refactoring. Extract `SpannerConfig` and `AbstractSpannerFn`
      Bump spanner version
      [BEAM-1187] Improve logging to contain the number of retries done due to IOException and unsuccessful response codes.
      Add WindowFn#assignsToOneWindow
      Use installed distribution name for sdk name
      [BEAM-2522] upgrading jackson to 2.8.9 (mitigating apache#1599)
      Enable grpc controller in fn_api_runner
      Removed uses of proto builder clone method
      [BEAM-2514] Improve error message on missing required value
      [BEAM-1237] Create AmqpIO
      Implement streaming GroupByKey in Python DirectRunner
      Bump Dataflow worker to 0623
      Reintroduces DoFn.ProcessContinuation (Dataflow worker compatibility part)
      Remove old deprecated PubSub code
      Fix a typo in function args
      Avoid pickling the entire pipeline per-transform.
      Fix python fn API data plane remote grpc port access
      [BEAM-2745] Add Jenkins Suite for Python Performance Test
      [BEAM-2489] Use dynamic ES port in HIFIOWithElasticTest
      [BEAM-2497] Fix the reading of concat gzip files
      Allow output from FinishBundle in DoFnTester
      DataflowRunner: Reject merging windowing for stateful ParDo
    kennknowles committed Jun 30, 2017
    Configuration menu
    Copy the full SHA
    627ae0b View commit details
    Browse the repository at this point in the history

Commits on Jul 7, 2017

  1. Configuration menu
    Copy the full SHA
    c2d3fbc View commit details
    Browse the repository at this point in the history
  2. Upgrade to gearpump 0.8.4

    manuzhang committed Jul 7, 2017
    Configuration menu
    Copy the full SHA
    2206827 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    725f547 View commit details
    Browse the repository at this point in the history

Commits on Jul 10, 2017

  1. This closes apache#3515: Sync gearpump-runner with master and upgrade…

    … to gearpump 0.8.4
    
      Fix ParDoTest#testPipelineOptionsParameter
      Upgrade to gearpump 0.8.4
      Fix javadoc generation for AmqpIO, CassandraIO and HCatalogIO
      Simplified ByteBuddyOnTimerInvokerFactory
      Fix bad merge
      Made DataflowRunner TransformTranslator public
      Process timer firings for a window together
      Ignore processing time timers in expired windows
      Add timeout to initialization of partition in KafkaIO
      [BEAM-2534] Handle offset gaps in Kafka messages.
      Fix PValue input in _PubSubReadEvaluator
      Update SDK dependencies
      Disallow Combiner Lifting for multi-window WindowFns
      [BEAM-2553] Update Maven exec plugin to 1.6.0 to incorporate messaging improvements
      Website Mergebot Job
      Update Python SDK version
      [maven-release-plugin] prepare for next development iteration
      [maven-release-plugin] prepare branch release-2.1.0
      For GCS operations use an http client with a default timeout value.
      [BEAM-2530] Fix compilation of modules with Java 9 that depend on jdk.tools
      Make modules that depend on Hadoop and Spark use the same version property
      Fix DoFn javadoc: StateSpec does not require a key
      Add support for PipelineOptions parameters
      Properly convert milliseconds whether there's less than 3/more than 9 digits. TimeUtil did not properly convert (and returned null) when the number of digits for fractions of seconds was less than 3 digits or more than 9 digits. The solution is to pad with zeros when there is less than 3 digits and to truncate when there is more than 3.
    kennknowles committed Jul 10, 2017
    Configuration menu
    Copy the full SHA
    1ce60b4 View commit details
    Browse the repository at this point in the history

Commits on Jul 21, 2017

  1. Configuration menu
    Copy the full SHA
    e655f53 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    daa7566 View commit details
    Browse the repository at this point in the history

Commits on Jul 22, 2017

  1. Configuration menu
    Copy the full SHA
    49d4ed5 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2017

  1. Configuration menu
    Copy the full SHA
    b0ed584 View commit details
    Browse the repository at this point in the history