[Presto-integration-Technical-note] created documentation for presto integration #2568

vandana7 · 2018-07-27T06:06:48Z

No description provided.

CarbonDataQA · 2018-07-27T07:20:34Z

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/7550/

CarbonDataQA · 2018-07-27T08:11:23Z

Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/6308/

CarbonDataQA · 2018-07-27T08:17:13Z

Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/6304/

CarbonDataQA · 2018-07-27T08:17:29Z

Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/7557/

CarbonDataQA · 2018-07-27T09:53:38Z

Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/6312/

chenliang613 · 2018-07-31T11:30:20Z

integration/presto/Presto-integration-in-carbondata.md

+    * ConnectorHandleResolver
+
+1. **CarbonDataConnector :** It implements the Connector Interface of the Presto.
+1. **CarbonDataMetadata  :** It implements the ConnectorMetadata Interface of the Presto.  The connector metadata interface has a large number of important methods that are responsible for allowing Presto to look at lists of schemas, lists of tables, lists of columns, and other metadata about a particular data source.


it would be better that add these description to code as annotation

CarbonDataQA · 2018-07-31T15:53:48Z

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/7682/

ravipesala · 2018-07-31T17:08:38Z

SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/6081/

CarbonDataQA · 2018-07-31T20:02:01Z

Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/6413/

sraghunandan · 2018-08-03T03:10:54Z

integration/presto/performance-report-of-presto-with-carbon.md

+
+On the other side, Apache Spark is a lightning-fast cluster computing technology, designed for fast computation. It is based on Hadoop MapReduce and it extends the MapReduce model to efficiently use it for more types of computations, which includes interactive queries and stream processing. The main feature of Spark is its in-memory cluster computing that increases the processing speed of an application.
+
+While dealing with Carbondata, both of them have their own advantage but presto is far better than spark while executing 90% of the queries. As the Presto-carbon vector readers are much optimized and reduces the table scan time dealing with large table. Even in case of dictionary aggregation and multiple table join, presto performs much better due to its own optimised way of dealing with properties.


remove the work far better.

done, removed the word far better.

sraghunandan · 2018-08-03T03:37:29Z

integration/presto/presto-integration-in-carbondata.md

+   *  Provide the link between the Functional Requirement and the detailed Technical Design documents.
+   *  Detail the functionality which will be provided by each component or group of components and show how the various components interact in the design.
+
+ This document is not intended to address installation and configuration details of the actual implementation. Installation and configuration details are provided in technology guides provided on CarbonData wiki page.As is true with any high level design, this document will be updated and refined based on changing requirements.


what do you mean by installation and configuration details of actual implementation?

installation and configuration details of actual implementation meant we are not providing documentation for installation and configuration of presto integration with carbondata and for this we have a separate document.

To make it more clear I have linked the installation and configuration for integrating Carbondata with presto to this document. If anyone wants to know about installation and configuration they can easily visit that document page.

sraghunandan · 2018-08-03T03:38:23Z

integration/presto/presto-integration-in-carbondata.md

+
+ This document is not intended to address installation and configuration details of the actual implementation. Installation and configuration details are provided in technology guides provided on CarbonData wiki page.As is true with any high level design, this document will be updated and refined based on changing requirements.
+ * #### _Scope_
+ Presto Integration with CarbonData will allow execution of CarbonData queries on the Presto CLI.  CarbonData can be added easily as a Data Source among the multiple heterogeneous data sources for Presto.


Carbondata integration with presto. carbondata is not an execution engine

sraghunandan · 2018-08-03T03:40:05Z

integration/presto/presto-integration-in-carbondata.md

+
+1. **Presto**
+
+Integration of Presto with CarbonData includes implementation of connector Api of the Presto.


carbondata with presto

sraghunandan · 2018-08-03T03:43:33Z

integration/presto/presto-integration-in-carbondata.md

+ * Support of Apache CarbonData as Data Source in Presto.
+ * Execution of Apache CarbonData Queries on Presto.
+
+## Design Considerations


Can we add a design from presto which talks about integration of data sources

sraghunandan · 2018-08-03T04:05:23Z