Skip to content


Subversion checkout URL

You can clone with
Download ZIP
Real-time Query for Hadoop
C++ Java Python Thrift C Shell Other
Latest commit a55f805 @henryr henryr committed with Internal Jenkins IMPALA-2682: Change default buffer size for RPC servers and clients
This patch sets the default buffer size for TBufferedTransport - used by
all Thrift clients and servers - to 128KB, from the previous value of
512. The higher value allows Thrift to do more buffering and less
copying of data to aggregate multiple small packets.

Change-Id: I7de78b99d59ceef641f610eb95c31c6ed126c466
Reviewed-by: Marcel Kornacker <>
Tested-by: Internal Jenkins

Welcome to Impala

Lightning-fast, distributed SQL queries for petabytes of data stored in Apache Hadoop clusters.

Impala is a modern, massively-distributed, massively-parallel, C++ query engine that lets you analyze, transform and combine data from a variety of data sources:

  • Best of breed performance and scalability.
  • Support for data stored in HDFS, Apache HBase and Amazon S3.
  • Wide analytic SQL support, including window functions and subqueries.
  • On-the-fly code generation using LLVM to generate CPU-efficient code tailored specifically to each individual query.
  • Support for the most commonly-used Hadoop file formats, including the Apache Parquet (incubating) project.
  • Apache-licensed, 100% open source.

More about Impala

To learn more about Impala as a business user, or to try Impala live or in a VM, please visit the Impala homepage.

If you are interested in contributing to Impala as a developer, or learning more about Impala's internals and architecture, visit the Impala wiki.

Something went wrong with that request. Please try again.