Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rw beam 9557 #11253

Closed
wants to merge 394 commits into from
Closed

Rw beam 9557 #11253

wants to merge 394 commits into from

Conversation

amaliujia
Copy link
Contributor

R @aaltay

cherry-pick #11252


Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Choose reviewer(s) and mention them in a comment (R: @username).
  • Format the pull request title like [BEAM-XXX] Fixes bug in ApproximateQuantiles, where you replace BEAM-XXX with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue.
  • Update CHANGES.md with noteworthy changes.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

Post-Commit Tests Status (on master branch)

Lang SDK Apex Dataflow Flink Gearpump Samza Spark
Go Build Status --- --- Build Status --- --- Build Status
Java Build Status Build Status Build Status
Build Status
Build Status
Build Status
Build Status
Build Status Build Status Build Status
Build Status
Build Status
Python Build Status
Build Status
Build Status
Build Status
--- Build Status
Build Status
Build Status
Build Status
Build Status
--- --- Build Status
XLang --- --- --- Build Status --- --- Build Status

Pre-Commit Tests Status (on master branch)

--- Java Python Go Website
Non-portable Build Status Build Status
Build Status
Build Status Build Status
Portable --- Build Status --- ---

See .test-infra/jenkins/README for trigger phrase, status and link of all Jenkins jobs.

henryken and others added 30 commits March 9, 2020 22:37
* Add WithKeys lesson in Python Katas

* Add Windowing Adding Timestamp section using ParDo lesson

* Fix window import statement

* Add Windowing Fixed Time Window lesson

* Add WithKeys lesson in Python Katas

* Add Windowing Adding Timestamp section using ParDo lesson

* Fix window import statement

* Add Windowing Fixed Time Window lesson

* Update course on Stepik

* Add missing license header
[BEAM-5898] Upgrading gRPC to 1.27
…tegration with DirectRunner

[BEAM-8335] TeststreamService integration with DirectRunner
…dencies (apache#11093)

* Exclude signature files from test dependencies

* Apply suggestions from code review

Thanks Luke!

Co-Authored-By: Lukasz Cwik <lcwik@google.com>

Co-authored-by: Lukasz Cwik <lcwik@google.com>
…get and Make FlinkRunner compatible with Flink 1.10
[BEAM-8681] Upgrading grpc-google-cloud-pubsub-v1 to 1.85.1
The cleanup code in DefaultJobBundleFactory and its RemoteEnvironments may leak
resources. This is especially a concern when the execution engines reuses the
same JVM or underlying machines for multiple runs of a pipeline.

Exceptions encountered during cleanup should not lead to aborting the cleanup
procedure. Not all code handles this correctly. We should also ensure that the
cleanup succeeds even if the runner does not properly close the bundle,
e.g. when a exception occurs during closing the bundle.
[BEAM-9074] Update Python roadmap for 2.7 eol
…d_gbk_side_input

[BEAM-9402] Remove options overwrite
Change-Id: Ib61aa3fac53d9109178744e11eeebe5c5da0929c
…Cache to subclass the CacheManager

[BEAM-8335] Modify the StreamingCache to subclass the CacheManager
…ache#11106)

* Fixing apache_beam.io.gcp.bigquery_test:PubSubBigQueryIT. at head (apache#11106)
* Adding flink 1.10 to pipelineoptions
mwalenia and others added 21 commits March 26, 2020 08:11
[BEAM-9731] Add SideInputLoadTest implementation
… holds

Benchmarks have shown that the minimum output timestamp computation over all
timers is very expensive because we have to iterate over all keys in the state
backend. This is especially costly for RocksDB.

This change introduces a cache which stores and updates the timer output
timestamps such that the minimum can be retrieved efficiently.

Only at startup the cache is populated from the state backend. This should
give us roughly the same performance as before introducing output timestamps for
timers.

The following have been run with RocksDB:

Nexmark Query 12 before:
```
  Conf  Runtime(sec)    (Baseline)  Events(/sec)    (Baseline)       Results    (Baseline)
  0000         105.3                       950.0                        9039
  0001           2.2                      4466.3                         196
```

Nexmark query 12 with this change:
```
Performance:
  Conf  Runtime(sec)    (Baseline)  Events(/sec)    (Baseline)       Results    (Baseline)
  0000           6.0                     16730.8                        1919
  0001           0.8                     12391.6                         196
```

For more details please see BEAM-9566.
Reenable test_streaming_wordcount_it
Rename the test in streaming_wordcount_debugging_it_test.py
[BEAM-9340] Validate pipeline requirements in PipelineValidator.
pyarrow was released on Feb 2020.
[BEAM-9562] Remove TimerSpec from Proto
…Option to setOption on Option builder

[BEAM-9605] BIP-1: Rename setRowOption to setOption on Option builder
…Option to setOption on Option builder

[BEAM-9605] BIP-1: Rename setRowOption to setOption on Option builder
@amaliujia
Copy link
Contributor Author

O actually this is not right way to create cherry-pick PR. closing now.

@amaliujia amaliujia closed this Mar 27, 2020
@amaliujia amaliujia deleted the rw-beam-9557 branch March 29, 2020 00:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet