Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[HOTFIX]Fixed Join Query Performance issue
Problem Join Query is taking for time in case of carbon as for some of the Join Query it is not reusing the exchange plan(Scanning+Snuffle) this is because in ReuseExchange it checking same result of two plan is same or not, in case of carbon it always giving false as SparkCarbonTable equals method is not overridden and comparison is failing. Solution: Added equals method in SparkCarbonTableFormat Tested TPCH query Query: select c_name, c_custkey, o_orderkey, o_orderdate, o_totalprice, sum(l_quantity) from customer, orders, lineitem where o_orderkey in ( select l_orderkey from lineitem group by l_orderkey having sum(l_quantity) > 300 ) and c_custkey = o_custkey and o_orderkey = l_orderkey group by c_name, c_custkey, o_orderkey, o_orderdate, o_totalprice order by o_totalprice desc, o_orderdate; This closes #2650
- Loading branch information