Skip to content
This repository


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Mirror of Apache Hive

HIVE-6956 : Duplicate partitioning column for union when dynamic part…

…ition sort optimization is enabled (Prasanth J via Ashutosh Chauhan)

git-svn-id: 13f79535-47bb-0310-9956-ffa450edef68
latest commit 2f74bcedd2
ashutoshc ashutoshc authored April 24, 2014
Octocat-spinner-32 ant HIVE-6752: Vectorized Between and IN expressions don't work with deci… March 28, 2014
Octocat-spinner-32 beeline HIVE-6927 : Add support for MSSQL in schematool (Deepesh Khandelwal v… April 22, 2014
Octocat-spinner-32 bin HIVE-6709 : HiveServer2 help command is not recognizing properly. (Y.… March 20, 2014
Octocat-spinner-32 checkstyle HIVE-1198. When checkstyle is activated for Hive in Eclipse environme… May 29, 2010
Octocat-spinner-32 cli HIVE-6779 : Hive cli may get into inconsistent state when Ctrl-C is h… March 31, 2014
Octocat-spinner-32 common HIVE-6916: Export/import inherit permissions from parent directory (S… April 21, 2014
Octocat-spinner-32 conf HIVE-6887 Add missing params to hive-default.xml.template (Harish But… April 11, 2014
Octocat-spinner-32 contrib HIVE-6808 : sql std auth - describe table, show partitions are not be… April 03, 2014
Octocat-spinner-32 data HIVE-6883 : Dynamic partitioning optimization does not honor sort ord… April 12, 2014
Octocat-spinner-32 docs Reverting HIVE-4508 May 10, 2013
Octocat-spinner-32 findbugs HIVE-3099. add findbugs in build.xml (Ransom Hezhiqiang via egc) June 10, 2012
Octocat-spinner-32 hbase-handler HIVE-6817: Some hadoop2-only tests need diffs to be updated (jdere, r… April 04, 2014
Octocat-spinner-32 hcatalog HIVE-6944 : WebHCat e2e tests broken by HIVE-6432 (Eugene Koifman via… April 23, 2014
Octocat-spinner-32 hwi HIVE-6880 : TestHWISessionManager fails with -Phadoop-2 (Jason Dere v… April 10, 2014
Octocat-spinner-32 itests HIVE-6916: Export/import inherit permissions from parent directory (S… April 21, 2014
Octocat-spinner-32 jdbc HIVE-5847: DatabaseMetadata.getColumns() doesn't show correct column … April 12, 2014
Octocat-spinner-32 lib HIVE-2761: Remove lib/javaewah-0.3.jar (ecapriolo via hashutosh) February 25, 2012
Octocat-spinner-32 metastore HIVE-6862 : add DB schema DDL and upgrade 12to13 scripts for MS SQL S… April 17, 2014
Octocat-spinner-32 odbc Preparing for 0.14 development March 05, 2014
Octocat-spinner-32 packaging HIVE-6906 Fix assembly/src.xml so that sr tar ball contains top level… April 15, 2014
Octocat-spinner-32 ql HIVE-6956 : Duplicate partitioning column for union when dynamic part… April 24, 2014
Octocat-spinner-32 serde HIVE-6822 : TestAvroSerdeUtils fails with -Phadoop-2 (Jason Dere via … April 08, 2014
Octocat-spinner-32 service HIVE-6907 HiveServer2 - wrong user gets used for metastore operation … April 15, 2014
Octocat-spinner-32 shims HIVE-6745 : HCat MultiOutputFormat hardcodes DistributedCache keyname… April 14, 2014
Octocat-spinner-32 testlibs HIVE-2518 pull junit jar from maven repos via ivy March 28, 2012
Octocat-spinner-32 testutils HIVE-6773 : Update readme for ptest2 framework (Szehon Ho via Brock N… April 08, 2014
Octocat-spinner-32 .arcconfig HIVE-2588 [jira] Update arcconfig to include commit listener November 17, 2011
Octocat-spinner-32 .checkstyle HIVE-2930 [jira] Add license to the Hive files April 17, 2012
Octocat-spinner-32 .gitattributes Wincompat : Add .cmd/text/crlf to .gitattributes (Sushanth Sowmyan vi… September 16, 2013
Octocat-spinner-32 .gitignore HBASE-4388 - Upgrade HBase to 0.96 (Brock Noland, Sushanth Sowmyan, G… November 13, 2013
Octocat-spinner-32 .reviewboardrc HIVE-6481. Add .reviewboardrc file (Carl Steinbach via cws) February 28, 2014
Octocat-spinner-32 LICENSE HIVE-3100. Add HiveCLI that runs over JDBC (Prasad Mujumdar via cws) July 02, 2012
Octocat-spinner-32 NOTICE HIVE-6482 Fix NOTICE file: pre release task (Harish Butani via Thejas… March 04, 2014
Octocat-spinner-32 README.txt HIVE-5489 : NOTICE copyright dates are out of date, README needs upda… October 09, 2013
Octocat-spinner-32 RELEASE_NOTES.txt HIVE-6917 Update Release Notes for Hive 0.13 RC2 April 15, 2014
Octocat-spinner-32 doap_Hive.rdf HIVE-2433. add DOAP file for Hive December 19, 2011
Octocat-spinner-32 pom.xml HIVE-6870 : Fix maven.repo.local setting in Hive build (Jason Dere vi… April 10, 2014
Apache Hive (TM) @VERSION@

The Apache Hive (TM) data warehouse software facilitates querying and
managing large datasets residing in distributed storage. Built on top
of Apache Hadoop (TM), it provides:

* Tools to enable easy data extract/transform/load (ETL)

* A mechanism to impose structure on a variety of data formats

* Access to files stored either directly in Apache HDFS (TM) or in other
  data storage systems such as Apache HBase (TM)

* Query execution via MapReduce

Hive defines a simple SQL-like query language, called QL, that enables
users familiar with SQL to query the data. At the same time, this
language also allows programmers who are familiar with the MapReduce
framework to be able to plug in their custom mappers and reducers to
perform more sophisticated analysis that may not be supported by the
built-in capabilities of the language. QL can also be extended with
custom scalar functions (UDF's), aggregations (UDAF's), and table
functions (UDTF's).

Please note that Hadoop is a batch processing system and Hadoop jobs
tend to have high latency and incur substantial overheads in job
submission and scheduling. Consequently the average latency for Hive
queries is generally very high (minutes) even when data sets involved
are very small (say a few hundred megabytes). As a result it cannot be
compared with systems such as Oracle where analyses are conducted on a
significantly smaller amount of data but the analyses proceed much
more iteratively with the response times between iterations being less
than a few minutes. Hive aims to provide acceptable (but not optimal)
latency for interactive data browsing, queries over small data sets or
test queries.

Hive is not designed for online transaction processing and does not
support real-time queries or row level insert/updates. It is best used
for batch jobs over large sets of immutable data (like web logs). What
Hive values most are scalability (scale out with more machines added
dynamically to the Hadoop cluster), extensibility (with MapReduce
framework and UDF/UDAF/UDTF), fault-tolerance, and loose-coupling with
its input formats.

General Info

For the latest information about Hive, please visit out website at:

Getting Started

- Installation Instructions and a quick tutorial:

- A longer tutorial that covers more features of HiveQL:

- The HiveQL Language Manual:


- Java 1.6

- Hadoop 0.20.x (x >= 1)

Upgrading from older versions of Hive

- Hive @VERSION@ includes changes to the MetaStore schema. If
  you are upgrading from an earlier version of Hive it is imperative
  that you upgrade the MetaStore schema by running the appropriate
  schema upgrade scripts located in the scripts/metastore/upgrade

- We have provided upgrade scripts for MySQL, PostgreSQL, Oracle and Derby
  databases. If you are using a different database for your MetaStore
  you will need to provide your own upgrade script.

Useful mailing lists

1. - To discuss and ask usage questions. Send an
   empty email to in order to subscribe
   to this mailing list.

2. - For discussions about code, design and features.
   Send an empty email to in order to
   subscribe to this mailing list.

3. - In order to monitor commits to the source
   repository. Send an empty email to
   in order to subscribe to this mailing list.
Something went wrong with that request. Please try again.