Resolve: #4 + Spark RDD example #13

icharo-tb · 2022-11-03T23:32:17Z

I deleted the whole repository and cloned it again, therefore, I have no Hadoop commits saved anymore, but since they are already on the original repo, its ok. I changed this 2 Spark files again, set them up in a new branch called "spark_rdd".

Hope that everything goes neat now. I will update the repo everytime from now on to avoid having this kind of problems again.
If there is something you'd like me to change, just tell me! :3

Tools/Apache Spark.md

Tutorials/Apache Spark RDD example.md

JPHaus · 2022-11-04T00:04:05Z

Tutorials/Apache Spark RDD example.md

+nums2 = sc.parallelize([3,2,1,4,5])
+evens = nums2.filter(lambda elem: elem%2==0)
+odds = nums2.filter(lambda elem: elem%2!=0)
+
+order = pairs.union(impairs)
+order.takeOrdered(5)
+```
+```
+[1, 2, 3, 4, 5]


Great example and explantation!

JPHaus

Thank you again for the contributions - I hope you don't mind the minor formatting edit I made.

icharo-tb · 2022-11-04T00:06:36Z

No problem at all! I'm okay with the changes so far :) it's my pleasure to contribute and also being able to learn while contributing.

Resolve: data-engineering-community#4 + Spark RDD example

9698158

JPHaus reviewed Nov 3, 2022

View reviewed changes

Tools/Apache Spark.md Outdated Show resolved Hide resolved

JPHaus reviewed Nov 3, 2022

View reviewed changes

Tools/Apache Spark.md Outdated Show resolved Hide resolved

JPHaus reviewed Nov 3, 2022

View reviewed changes

Tools/Apache Spark.md Outdated Show resolved Hide resolved

Make headers h2

ee266fa

JPHaus reviewed Nov 3, 2022

View reviewed changes

Tools/Apache Spark.md Outdated Show resolved Hide resolved