✨🎉 This repository explains about Apache Spark with practicals 🎉✨
- It is a real time processing tool which is developed to address the problem of working with real time Data.
- It is handled by Apache.
- It is lightning fast, gives result on click.
- Uses Lazy Evaluation i.e process whenever required.
- As MapReduce was unable to handle real time data, Spark come into picture to help.
- It is now used by many big tech giants like Oracle, Amazon, Microsoft, Visa, Cisco, Verizon, Hortonworks.
- Like above we have 3000 companies using Apache Spark.
- To download Apache Spark, visit: Download Apache Spark
- For Mac: Apache Spark Installation
- Python
- Scala
- Copy the code from the repective script and paste it in the corresponding Spark shell to run code.
- Open terminal(in Linux encironment)/Command Propmpt(Windows).
- Run "python <script name>"
To check the license visit LICENSE