Sample program for demo of spark program with cucumber framework
- Employee dataframe with data
employee_id | employee_name | dept_id | salary |
---|---|---|---|
101 | "Rohit P" | 10 | 1000 |
102 | "Pooja P" | 10 | 1000 |
103 | "Rutu M" | 10 | 400 |
104 | "Rushi M" | 20 | 4000 |
105 | "Prithvi D" | 20 | 6000 |
106 | "Rajani D" | 30 | 10000 |
107 | "Shrikant D" | 30 | 5000 |
108 | "Rahul S" | 30 | 3000 |
- Calculated average salary per department dataframe
dept_id | avg_sal_per_dept |
---|---|
30 | 6000.0 |
20 | 5000.0 |
10 | 800.0 |
- Expected average salary per department dataframe
dept_id | avg_sal_per_dept |
---|---|
10 | 800.0 |
20 | 5000.0 |
30 | 6000.0 |