Skip to content

dgleich/hadoopcxx

Repository files navigation

Hadoop C++ Utilities for streaming

By David F. Gleich

This library implements various helper utilities for working with Hadoop streaming programs in C++. It uses the C++ typedbytes library for the typedbytes based communication.

Synposis

This package provides a small series of utilities to make working with hadoop streaming programs in C++ easier.

  • the typedbytes C++ library
  • hadoop_* commands for common streaming interaction

Example

Here is an example of how to use some of the utility routines

hadoop_message("started hadoop C++ streaming program %s\n", argv[0]);
hadoop_status("initializing");
hadoop_counter("status messages",1);
hadoop_status("computing");
hadoop_counter("status messages",2);
sleep(5):
hadoop_counter("sleeps",1);
hadoop_status("finalizing');
hadoop_status("status messages",3);

See typedbytes for an example of how to use that library.

Usage

The simplest way to use hadoopcxx is just to include

#include <hadoopcxx.hpp>

This will include all the headers and implementations. This is only appropriate if your code has a single file, however. Such programs are rather common, so that's why we mention it first.

Alternatively, you need to

#include <hadoopcxx.h>

And link:

-libhadoopcxx

Note that libhadoopcxx.a includes libtypedbytes.a, so only the former is required.

About

Hadoop C++ Utilities for streaming

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published