- 🔭 I am a data enthusiast with extensive experience in big data tools and cloud technologies. I have worked on various projects involving data ingestion, processing, and analysis, and I am proficient in using tools like Hadoop, Spark, Hive. My experience with Amazon Web Services (AWS) includes working with S3, EC2, and EMR, and I have successfully implemented scalable and fault-tolerant data pipelines using these services.
Pinned Loading
-
complex-data-processing-spark
complex-data-processing-spark PublicThis project demonstrates how to use Apache Spark to process JSON and Avro data. The code includes operations such as flattening complex JSON data, selecting specific columns, joining data, and fil…
Scala 1
-
movie-rating-spark
movie-rating-spark PublicThis is a simple project that demonstrates how to use Apache Spark to analyze movie ratings data.
Scala
-
pyspark-transformations
pyspark-transformations PublicThis project demonstrates various data manipulation techniques on Spark dataframes such as reading and processing data from different file formats, applying filters and maps, and creating unified c…
Python
-
spark-incremental-batchprocessing
spark-incremental-batchprocessing PublicThis project showcases how to efficiently process and write incremental data in batches to an RDBMS (MySQL) using Spark. The code provides a comprehensive solution for processing large volumes of d…
Scala
-
SparkSQL
SparkSQL PublicThis project demonstrates how to use Spark SQL to execute SQL queries on structured data in Spark, and display the results in a tabular format using the show() method.
Scala
-
pyspark-currency-conversion
pyspark-currency-conversion PublicThis project demonstrates how to perform linear regression using Spark, convert currency rates using data from a MySQL table, and integrate the results into a MySQL database.
Python
If the problem persists, check the GitHub status page or contact support.