Skip to content

hyperj/note.sql-engine

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 

Introduction

当前版本基于Spark SQL 2.x进行整理,参考了主流分布式SQL计算引擎相关的开源项目。

Spark SQL

  • Spark Core(RDD APIs)、Data Source Connectors
  • Catalyst Optimization、Tungsten Execution
  • SparkSession、Dataset/DataFrame APIs、SQL
  • Structured Streaming、MLlib、GraphFrame、TensorFrames

Reference

  • Spark SQL: Spark SQL is Apache Spark's module for working with structured data.
  • Hive: The Apache Hive ™ data warehouse software facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage. A command line tool and JDBC driver are provided to connect users to Hive.
  • Presto: Distributed SQL Query Engine for Big Data.

About

参考主流分布式SQL计算引擎的开源项目进行整理

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published