Skip to content
Permalink
Branch: master
Find file Copy path
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
73 lines (51 sloc) 2.59 KB
:orphan:
.. _hadoop-connector:
============================
MongoDB Connector for Hadoop
============================
.. include:: /includes/attention-docs-eol.rst
.. default-domain:: mongodb
.. contents:: On this page
:local:
:backlinks: none
:depth: 1
:class: singlecol
.. include:: /includes/hadoop-eol-spark.rst
The MongoDB Connector for Hadoop is a plugin for Hadoop that provides
the ability to use MongoDB as an input source and/or an output destination.
The source code is available on `Github
<https://github.com/mongodb/mongo-hadoop>`_ where you can find a more
comprehensive `wiki
<https://github.com/mongodb/mongo-hadoop/wiki>`_.
If you have questions please email the `mongodb-user Mailing List
<http://groups.google.com/group/mongodb-user>`_. For any issues please
file a ticket in `Jira <https://jira.mongodb.org/browse/HADOOP>`_.
This guide also includes the following documentation:
- :doc:`/tutorial/getting-started-with-hadoop`
- :doc:`/use-cases/hadoop`
Installation
------------
#. Obtain the Hadoop connector. You may either download the JARs from the
`Maven Central Repository
<http://repo1.maven.org/maven2/org/mongodb/mongo-hadoop/>`_ or `build it
yourself
<https://github.com/mongodb/mongo-hadoop/blob/master/README.md#building>`_.
The JARs are universal and will work with any version of Hadoop.
#. Obtain a JAR for the `MongoDB Java Driver <http://mongodb.github.io/mongo-java-driver/?jmp=docs>`_.
#. Move these JARs onto each node of the Hadoop cluster.
You may provision each node so that the jars are somewhere on Hadoop's `CLASSPATH`
(e.g. ``$HADOOP_PREFIX/share/hadoop/common``), or you can use the Hadoop
`DistributedCache <http://hadoop.apache.org/docs/stable/api/org/apache/hadoop/filecache/DistributedCache.html>`_
to move the JARs onto pre-existing nodes.
For more complete install instructions, please see the installation instructions
for your platform on the `Hadoop Connector wiki <https://github.com/mongodb/mongo-hadoop/wiki>`_.
Presentations
-------------
- `MongoDB and Hadoop: Driving Business Insights <http://www.mongodb.com/presentations/mongodb-and-hadoop-driving-business-insights>`_
by Sandeep Parikh, 2014
- `Webinar: Using MongoDB + Hadoop Together <http://www.mongodb.com/presentations/webinar-using-hadoop-mongodb-together>`_
by Buzz Moschetti, 2014
Blog Posts
----------
- `MongoDB Hadoop Connector 1.4 Improvements <https://mongodbcom-rails.corp.mongodb.com/blog/post/mongodb-connector-for-hadoop-1-4>`_
- `MongoDB Hadoop Connector Announcement <https://www.mongodb.com/post/20840407875/mongodb-hadoop-connector-announced>`_
You can’t perform that action at this time.