<a href="https://colab.research.google.com/github/YoheiShinozaki/BeamKatasColab/blob/master/Beam_Katas_07_Core_Transforms_Combine_Simple_Function.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Beam Katas on Colab

<!--
  ~  Licensed to the Apache Software Foundation (ASF) under one
  ~  or more contributor license agreements.  See the NOTICE file
  ~  distributed with this work for additional information
  ~  regarding copyright ownership.  The ASF licenses this file
  ~  to you under the Apache License, Version 2.0 (the
  ~  "License"); you may not use this file except in compliance
  ~  with the License.  You may obtain a copy of the License at
  ~
  ~      http://www.apache.org/licenses/LICENSE-2.0
  ~
  ~  Unless required by applicable law or agreed to in writing, software
  ~  distributed under the License is distributed on an "AS IS" BASIS,
  ~  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  ~  See the License for the specific language governing permissions and
  ~  limitations under the License.
  -->

<html>
<h2>Combine - Simple Function</h2>
<p>
  Combine is a Beam transform for combining collections of elements or values in your data.
  When you apply a Combine transform, you must provide the function that contains the logic for
  combining the elements or values. The combining function should be commutative and associative,
  as the function is not necessarily invoked exactly once on all values with a given key. Because
  the input data (including the value collection) may be distributed across multiple workers, the
  combining function might be called multiple times to perform partial combining on subsets of
  the value collection.
</p>
<p>
  Simple combine operations, such as sums, can usually be implemented as a simple function.
</p>
<p>
  <b>Kata:</b> Implement the summation of numbers using
  <a href="https://beam.apache.org/releases/pydoc/current/apache_beam.transforms.core.html#apache_beam.transforms.core.CombineGlobally">
    CombineGlobally</a>.
</p>
<br>
<div class="hint">
  Implement a simple Python function that performs the summation of the values.
</div>
<div class="hint">
  Refer to the Beam Programming Guide
  <a href="https://beam.apache.org/documentation/programming-guide/#simple-combines">
    "Simple combinations using simple functions"</a> section for more information.
</div>
</html>

In [0]:
!pip install apache-beam -qqq

import apache_beam as beam
from apache_beam.runners.interactive import interactive_runner

## Python Collection

In [0]:
[1, 2, 3, 4, 5] | beam.CombineGlobally(sum)

[15]

## Beam Pcollection

In [0]:
p = beam.Pipeline(interactive_runner.InteractiveRunner())

(p | 'Create' >> beam.Create([1, 2, 3, 4, 5])
   | 'Sum' >> beam.CombineGlobally(sum))

p.run()

Running...

Using 0 cached PCollections
Executing 2 of 2 transforms.

Create produced {3, 4, 5, 2, 1}

Sum produced {15}

<apache_beam.runners.interactive.interactive_runner.PipelineResult at 0x7f667cbf7690>