Skip to content

In the repository, we have several task related to Data Engineering

Notifications You must be signed in to change notification settings

tushargoyal02/Celebal-Task

Repository files navigation

Celebal-Task

In the repository, we have several task related to Data Engineering

Windows Function with Pyspark - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/2685669578609160/5764242340617371/latest.html

Distinct user within some Date - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/1155399167638853/5764242340617371/latest.html

Distinct user with date mapping[finalize file] - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/524104771162516/5764242340617371/latest.html

App Status for total distinct user: https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/524104771162535/5764242340617371/latest.html

Text Delimiter file in pyspark like with Dollar["$"] - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/3527025572877450/5764242340617371/latest.html

SQLZOO Implementation in Scala/python Spark:

SQLZOO Chapter 1&2 - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/1227106145213404/5764242340617371/latest.html

SQLZOO CHAPTER 3 - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/3882546477227230/5764242340617371/latest.html

SQLZOO CHAPTER 4 - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/3882546477227250/5764242340617371/latest.html

SQLZOO CHAPTER 5 - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/1832172108127233/5764242340617371/latest.html

SQLZOO CHAPTER 6 - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/3859837373276120/5764242340617371/latest.html

SQLZOO CHAPTER 8 (NULL USE CASES) - https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/1977730164728891/5764242340617371/latest.html


Scala Program and task

GroupBy Pandas Iteration & dataframe in Spark: https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/2185313371049953/5764242340617371/latest.html

< HEAD Learn Scala file : https://books.google.co.in/books?id=QOzSBQAAQBAJ&lpg=PP1&pg=PP1#v=onepage&q&f=true

Hive internal and external table all cases: https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/750140149301887/5764242340617371/latest.html

Hive Partition & Bucket: https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/3504377729219686/5764242340617371/latest.html

Partition and Bucket with Spark: https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/931328135249323/1973285828819995/5764242340617371/latest.html

======= Learn Scala files : https://books.google.co.in/books?id=QOzSBQAAQBAJ&lpg=PP1&pg=PP1#v=onepage&q&f=true

14041ee03ef83a212e48ff6fd318c5a7a10f6359

About

In the repository, we have several task related to Data Engineering

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published