Welcome to Impala

Lightning-fast, distributed SQL queries for petabytes of data stored in Apache Hadoop clusters.

Impala is a modern, massively-distributed, massively-parallel, C++ query engine that lets you analyze, transform and combine data from a variety of data sources:

Best of breed performance and scalability.
Support for data stored in HDFS, Apache HBase and Amazon S3.
Wide analytic SQL support, including window functions and subqueries.
On-the-fly code generation using LLVM to generate CPU-efficient code tailored specifically to each individual query.
Support for the most commonly-used Hadoop file formats, including the Apache Parquet (incubating) project.
Apache-licensed, 100% open source.

More about Impala

To learn more about Impala as a business user, or to try Impala live or in a VM, please visit the Impala homepage.

If you are interested in contributing to Impala as a developer, or learning more about Impala's internals and architecture, visit the Impala wiki.

Name		Name	Last commit message	Last commit date
Latest commit History 4,027 Commits
be		be
bin		bin
cmake_modules		cmake_modules
common		common
ext-data-source		ext-data-source
fe		fe
infra/python		infra/python
llvm-ir		llvm-ir
shell		shell
ssh_keys		ssh_keys
testdata		testdata
tests		tests
thirdparty		thirdparty
www		www
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
README.md		README.md
buildall.sh		buildall.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to Impala

More about Impala

About

Releases

Packages

Languages

License

bharathv/Impala

Folders and files

Latest commit

History

Repository files navigation

Welcome to Impala

More about Impala

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages