Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Facebook Data Warehouse based on Apache Hadoop 0.20
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
.eclipse.templates
bin
conf Initial commit of hadoop-20-warehouse
ivy
lib
nativelib/lzma Initial commit of hadoop-20-warehouse
src
APACHE-README.txt
CHANGES.txt
FB-CHANGES.txt
LICENSE.txt
NOTICE.txt
README.txt
YAHOO-CHANGES.txt
build.xml
ivy.xml

README.txt

This is a version of code that runs on Facebook's data warehouse clusters and 
is powered by Apache Hadoop.  The size of the biggest single cluster is 30 PB 
and has 3000 nodes.

This code is based on Apache Hadoop 0.20.

FB-CHANGES.txt contains the additional pathches that have been committed to
the original code base.

PLEASE NOTE: 
 
 * This distribution includes cryptographic software that 
   is subject to U.S. export control laws and applicable 
   export and import laws of other countries. BEFORE using 
   any software made available from this site, it is your 
   responsibility to understand and comply with these laws. 
   This software is being exported in accordance with the 
   Export Administration Regulations. As of June 2009, you 
   are prohibited from exporting and re-exporting this 
   software to Cuba, Iran, North Korea, Sudan, Syria and 
   any other countries specified by regulatory update to 
   the U.S. export control laws and regulations. Diversion 
   contrary to U.S. law is prohibited.  

Something went wrong with that request. Please try again.