A generic non-blocking linked-in driver for interfacing Erlang and C
C Erlang Makefile
Switch branches/tags
Nothing to show
Failed to load latest commit information.
src Refactored + resolved potential race condition during thread initiali… Sep 13, 2014
.gitignore Added a simple test to verify it works Oct 5, 2013
LICENSE Refactored + resolved potential race condition during thread initiali… Sep 13, 2014


A Generic Linked-in Driver for Erlang

This is an attempt to provide a lightweight and clean implementation of a generic linked-in driver for communication between Erlang and C/C++ programs. It was designed from the ground up to support multi-threaded, non-blocking operations, has the ability to hold state across calls and is entirely customizable. The rationale behind this generic driver is to provide a default and clean implementation for communication between those two languages and to minimize the need to write boilerplate code.

Of course there is the Generic Erlang Port Driver (GEPD), which lets you generate a driver tailored to your needs by specifying function bindings via C macros. However, this generic linked-in driver implementation takes a different approach and provides the programmer with an easy to use interface without the need to generate any code and without any external dependencies. Additionally, the name of the driver is set via compiler flag, so that multiple instances of the same driver can be compiled and identified by different names without the need for code duplication. This is very handy if you design your program to be configured during compilation and need different configurations.

There's an example linked-in driver included with the source which demonstrates the features of this implementation. You can dive into the code right away or keep reading for a detailed reference of features and a simple how-to.

Theory of Operation

The driver consists of two parts: the Erlang part and the C part. On the Erlang side the driver is implemented as a generic server. Thus, running a driver in a supervision hierarchy is simple and also recommended. Every (!) request to the server can be synchronous (call) or asynchronous (cast). Naturally, synchronous requests wait for the server to respond, and return the result, while asynchronous requests return immediately with ok. In the latter case, any errors on the C side (except the driver crashing due to a segfault or other critical error) are not reported. Therefore, asynchronous requests should be used with caution.

Receiving the binary sent by the Erlang part, the C driver parses the request, passes it to a worker thread for non-blocking execution and returns immediately. This approach is mandatory, as the main thread on the C side is executed synchronously, thus blocking the VM scheduler. Linked-in drivers run in the same address space as the Erlang virtual machine, so beware:

When a driver is loaded it is executed in the context of the emulator, shares the same memory and the same thread. This means that all operations in the driver must be non-blocking, and that any crash in the driver will bring the whole emulator down. In short: you have to be extremely careful!

Thus, before you continue, you should make sure that you have a thorough understanding of the C language, or more precisely of the art of memory management. You should also know how to use a debugger (gdb is awesome).

Enough talk, let's play.


The generic linked-in driver can be used with plain Erlang or with any OTP application. The project itself is compiled using Rebar, so in order to play with the example driver, just cd into the root directory and make the project. Then go to the ebin directory and invoke the erl shell:

{ ok, Pid } = gen_driver_test:start_link(). % => { ok,<0.33.0> }
gen_driver_test:sum(Pid, [1,2,3,4]).        % => { ok, 10.0 }
gen_driver_test:ping(Pid).                  % => ok
gen_driver_test:stats(Pid).                 % => { ok, [{ driver, 2 }, { thread, 2 }] }

The stats/1 function returns the number of calls to the driver and the invoked worker thread. If you start the virtual machine with erl, these numbers will always be the same. This is because the request is processed synchronously, as no asynchronous worker threads are present. If you start the virtual machine with erl +A 4, and thus four worker threads, you will begin to see a difference.

Try it.

If you want to use the generic linked-in driver to create your own specific driver, you basically have two options: using the linked-in driver as a dependency or as a copy.

As a dependency via Rebar (recommended)

Using Rebar (or another build-tool) is absolutely recommended, as the process of building a driver is somehow non-trivial. Just open your rebar.config and add the following lines to your dependencies:

{ deps, [
  { gen_driver, ".*",
    { git, "git://github.com/squidfunk/generic-linked-in-driver.git", "master" }

After fetching your dependencies with rebar get-deps, you can start driver development. Rebar expects your C source to be located in a directory called c_src. If it isn't there already, create it. Then you just have to create a new file, include the gen_driver.h header file and define the necessary callbacks. The header can be included with:

#include "gen_driver.h"

During compilation, the folders containing the header files have to be included with the following flag:

-I deps/gen_driver/c_src

See the section discussing the build process for more information.

As a copy

If you don't use a build-tool or want the source to be situated in the same directory as your specific driver, you can just move gen_driver.c and gen_driver.h to your c_src directory, move the Erlang part consisting of gen_driver.erl to src, and get started. Using this approach, including the header files and linking them during compilation is easier, as all files are located in the same directory. However, you will mix up your business logic and dependencies, which is considered bad practice in general.



Creating your own specific linked-in driver using this generic implementation is quite simple. Take a look at the example driver gen_driver_test.erl. The driver is started with:

{ ok, Pid } = gen_driver:start_link(Path, Name).

A tuple containing the pid is returned, which is the process identifier of the associated generic server. Actions can then be called by invoking the generic server functions call and cast:

gen_server:call(Pid, { port, Cmd[, Params] }). % => { ok, Response }
gen_server:cast(Pid, { port, Cmd[, Params] }). % => ok

The Cmd parameter must be an integer which is used for identifying the action to perform on the C side. It is recommended to define those numbers as macros in Erlang and C. While parameters are optional, any valid Erlang term can be passed.

In the case of our example linked-in driver defined in gen_driver_test.erl, there are two synchronous (sum/2, stats/1) and one asychronous action (ping/1). Those functions are just simple wrappers which call the respective C functions:

% Take an arbitrary mixed list of integers and floats and add all numbers,
% returning the resulting sum.
-spec sum(Pid :: pid(), Numbers :: [float(), ...])
  -> { ok, float() } | { error, atom() }.
sum(Pid, Numbers) ->
  gen_server:call(Pid, { port, ?CMD_SUM, Numbers }).


Going to the C side [sic!], six callbacks need to be implemented. A callback-based architecture enables the definition of custom initializers on a driver- and thread-level. While the driver initializers are executed during the start up phase, thread initializers are executed upon the first invocation of a thread. Taken from gen_driver.h, these callbacks are:

 * Callbacks which manage memory of driver-specific state data.
void *init();
void destroy(void *drv_state);

 * Callbacks which manage memory of thread-specific state data.
void *thread_init();
void thread_destroy(void *trd_state);

 * Callbacks which perform load-balancing and the dispatch of the request.
unsigned int *balance(int cmd, unsigned char syn, unsigned int *key);
void dispatch(gd_req_t *req, gd_res_t *res, void *drv_state, void *trd_state);

All callbacks must be defined, even if there's no thread-specific data to hold or no custom balancing among threads. Thus, a minimal implementation which can be used as a skeleton when developing a new driver could be:

#include <stdlib.h>
#include <stdio.h>

#include <erl_driver.h>
#include <ei.h>

#include "gen_driver.h"

 * Callback to initialize the application-relevant state data when opening the
 * port driver and to return a pointer to the newly created driver state.
void *
init() {
  return NULL;

 * Upon closing the port, this callback is invoked in order to free all memory
 * allocated to the driver state.
destroy(void *drv_state) { }

 * Initialize any thread-specific data. This is called, when first dispatching
 * a request to a thread.
void *
thread_init() {
  return NULL;

 * Upon closing the port, this callback is invoked in order to free all memory
 * allocated to thread-specific data.
thread_destroy(void *trd_state) { }

 * Load balancing among threads. Balancing is implemented as a modulo
 * operation: % THREADS. Return null for round-robin strategy.
unsigned int *
balance(int cmd, unsigned char syn, unsigned int *key) {
  return NULL;

 * Dispatch an asynchronous request by invoking the respective callback. If no
 * matching command is found, return an error.
dispatch(gd_req_t *req, gd_res_t *res, void *drv_state, void *trd_state) {
  switch (req->cmd) {

    /* Branch here */

      error_set(res, "command");

Now let's have a look at the dispatch callback which is the entry point for requests coming from the Erlang virtual machine. The request as such is contained in the first parameter, encoded as a binary, and must be explicitly parsed using the Erlang Interface functions. This is far from being a complicated task, so for example, parsing a tuple of the form { question, "Why are we here?" } is as simple as:

ei_decode_tuple_header(req->buf, &req->index, &size);
ei_decode_atom(req->buf, &req->index, "question");
ei_decode_string(req->buf, &req->index, &question);

However, we should check for the return values of the Erlang Interface functions, as they return with a status > 0 when an error occurred. Please see handle_sum/4 in gen_driver_test.c for a more complex example. Then, after translating an Erlang term into native C datatypes, and doing our calculations, we can take the opposite direction and encode the result using the same interface:

ei_encode_tuple_header(res->buf, &res->index, 2);
ei_encode_atom(res->buf, &res->index, "answer");
ei_encode_string(res->buf, &res->index, "Plastic!\0");

While parsing the request involves the first parameter req, the result is encoded into the second parameter res. Of course. The result buffer is allocated 64 bytes, so after writing some bytes one has to check if the allocated memory is still sufficient with res->index < 64. If it isn't, just re-allocate some more bytes.

If an error is encountered during parsing or processing, the helper function error_set/2 can be used to exit from a handler. The first parameter has to be the respective res struct, the second the atom to return, so e.g.:

return error_set(res, "badtype"); // => { error, badtype }

This translates to the tuple { error, badtype } on the Erlang side. You don't have to clean the buffer, just set the error and return. However, when you return, you must make sure that all resources that were allocated are freed. Otherwise your driver will leak memory, and finally, crash your Erlang application.

Also remember that asynchronous calls are not able to return anything to the Erlang side. Even if you write something to the result buffer, the contents are discarded upon termination. This may be desired behaviour, as every function on the C side can be called in both ways. However, it just doesn't make any sense when you're fetching entries from a database.

And that is basically it. Now that you learned the basics, you should take a closer look at gen_driver_test.c, in order to understand the way how the generic driver is used. Interfacing with the generic linked-in driver, the only things that need to be done are initializing resources that need to be persistent across calls and bound to the driver or single threads, decoding the request and encoding the result. Oh, and your custom business logic of course.

Everything else is already implemented.


Now that we're done implementing our driver, we need to build it. As already stated, it is absolutely recommended to use Rebar or a similar build-tool for this task. Building our newly created linked-in driver, we need to specify the driver path (e.g. priv) and name (e.g. test) which must match the file name of the resulting binary (test.so), and link against the generic driver:

{ port_specs, [
  { ".*", "priv/test.so", [
    "deps/gen_driver/c_src/gen_driver.c", "c_src/*.c*"
  ], [
    { env, [
      { "CFLAGS", "$CFLAGS -std=c99 -I deps/gen_driver/c_src -D DRIVER_NAME=test" }
    ] }
  ] }
] }.
{ post_hooks, [
  { compile, "`which rm` -f deps/gen_driver/c_src/*.o" }
] }.

The generic driver is implemented using the C99 standard, so in case GCC is used for compilation, the respective flag needs to be set, since GCC defaults to C89. The path where the generic driver source is located must also be included. As mentioned before, multiple instances of the same driver with different names and configurations can be compiled without any code duplication.


Copyright (c) 2012-2014 Martin Donath

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.