Skip to content

ngiridharan/hadoop

 
 

Repository files navigation

HADOOP

About
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing.
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.
Who Uses Hadoop?
A wide variety of companies and organizations use Hadoop for both research and production. Users are encouraged to add themselves to the Hadoop PoweredBy wiki page.

Download link - https://hadoop.apache.org/releases.html

Latest Release Information – 3.3.4 - 2022 Aug 8
This is a release of Apache Hadoop 3.3 line.
It contains a small number security and critical integration fixes since 3.3.3.
Users of Apache Hadoop 3.3.3 should upgrade to this release.
Users of hadoop 2.x and hadoop 3.2 should also upgrade to the 3.3.x line. As well as feature enhancements, this is the sole branch currently receiving fixes for anything other than critical security/data integrity issues.
Users are encouraged to read the overview of major changes since release 3.3.3. For details of bug fixes, improvements, and other enhancements since the previous 3.3.3 release, please check release notes and changelog.



For the latest information about Hadoop, please visit our website at:

   http://hadoop.apache.org/

and our wiki, at:

   https://cwiki.apache.org/confluence/display/HADOOP/

Releases

No releases published

Packages

No packages published

Languages

  • Java 92.8%
  • C++ 2.9%
  • C 1.9%
  • JavaScript 1.2%
  • Shell 0.5%
  • HTML 0.2%
  • Other 0.5%