Skip to content


Subversion checkout URL

You can clone with
Download ZIP
Real-time Query for Hadoop
C++ Java Python Thrift C Shell Other
Latest commit 7c956c @dhecht dhecht Harrison Sheinblatt committed Skip test_load_hive_table
Another test snuck in that uses Hive directly. Need to skip
during Isilon and S3 testing.

Change-Id: I56e5b241839dfc7a39b52b9d480eec207df254b3
Reviewed-by: Dan Hecht <>
Tested-by: Dan Hecht <>
Failed to load latest commit information.
be IMPALA-2531: null_probe_rows != NULL DCHECK failed
bin Perf Framework: Move exec functions to a separate file and deprecate …
cmake_modules Toolchain Cleanup and ASAN Improvements
common Update the Avro scanner bad version header error message to include t…
ext-data-source Upgrade a few important mvn plugins.
fe IMPALA-2495: make Expr::IsConstant() recurse on children
infra/python Python: Upgrade impyla to bring in bug fix
llvm-ir Move IR cross compile output to a better folder for packaging.
shell IMPALA-2309: Compute stats query return error if set LIVE_PROGRESS=true
ssh_keys Move ssh keys from bin directory to fix packaging build break
testdata IMPALA-2480, IMPALA-2519: Don't force IO-buffer on probe side when sp…
tests Skip test_load_hive_table
thirdparty Add cdh5.7.0-SNAPSHOT Hadoop/HBase/Hive/LLAMA/Sentry dependencies.
www Add HdrHistogram and HistogramMetric
.gitignore Add MetricDefs, static definitions of metric metadata generated from …
CMakeLists.txt Toolchain Cleanup and ASAN Improvements
LICENSE.txt Add text of Apache license
NOTICE.txt Add NOTICE.txt file to Impala repo Fix link syntax for IMPALA-2284: Disallow long (1<<30) strings in group_concat()

Welcome to Impala

Lightning-fast, distributed SQL queries for petabytes of data stored in Apache Hadoop clusters.

Impala is a modern, massively-distributed, massively-parallel, C++ query engine that lets you analyze, transform and combine data from a variety of data sources:

  • Best of breed performance and scalability.
  • Support for data stored in HDFS, Apache HBase and Amazon S3.
  • Wide analytic SQL support, including window functions and subqueries.
  • On-the-fly code generation using LLVM to generate CPU-efficient code tailored specifically to each individual query.
  • Support for the most commonly-used Hadoop file formats, including the Apache Parquet (incubating) project.
  • Apache-licensed, 100% open source.

More about Impala

To learn more about Impala as a business user, or to try Impala live or in a VM, please visit the Impala homepage.

If you are interested in contributing to Impala as a developer, or learning more about Impala's internals and architecture, visit the Impala wiki.

Something went wrong with that request. Please try again.