Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can I join two tables using SQL workloads? Also, can I query data from Hive table? #138

Closed
akasaki opened this issue Jan 5, 2018 · 4 comments

Comments

@akasaki
Copy link

akasaki commented Jan 5, 2018

Spark-Bench Version: spark-bench_2.1.1_0.2.2

Spark Version on Your Cluster: 2.1.1.2.6.1.0

Scala Version on Your Spark Cluster: 2.11.8

Can I join two tables with SQL workload? Also, can I query data from Hive table?

@ecurtin
Copy link
Contributor

ecurtin commented Jan 10, 2018

Currently the SQL workload only loads one table, but I can imagine a fix for that. In the config file could be something like:

{
// NOT REAL SYNTAX!!
  name = "sql"
  input = { 
                  "FIRST"  -> "hdfs://location/of/first/table.csv",
                 "SECOND" -> "hdfs://location/of/second/table.parquet"
               }
  output = "/tmp/sql-query-results.parquet"
  query = "SELECT FIRST.OrderID, FIRST.CustomerName, SECOND.OrderDate
                  FROM $(FIRST)
                  INNER JOIN $(SECOND) ON SECOND.CustomerID=FIRST.CustomerID;"
}

@akasaki
Copy link
Author

akasaki commented Jan 11, 2018

Thank you very much, @ecurtin ! That will be perfect. Is it able to be easily implemented?

Btw, happy New Year!!

@ecurtin
Copy link
Contributor

ecurtin commented Jan 11, 2018

@akasaki Happy new year to you too! I think this would be relatively straightforward to implement in the SQL workload. I'll make a ticket for it but it might be awhile until I can tackle it.

@ecurtin
Copy link
Contributor

ecurtin commented Jan 19, 2018

I'm going to close this ticket,but feel free to reopen it or create a new one if you have questions!

@ecurtin ecurtin closed this as completed Jan 19, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants