GitHub - poryfly/fastcache: Wicked fast, thread safe in-memory key/object store for C++

fastcache

Wicked fast, thread safe in-memory key/object store for C++

Testing with 30 simultaneous threads
Object size is 16 bytes
Loading up cache...30000 objects stored in 0.041198 sec
Reading cache...30000 objects read in 0.0038482 sec

Features

Did I mention fast?
Thread safe
Apache 2.0 license

Introduction

Sometimes using an external caching solution like memcached or Redis is just unnecessary.

What!

No, think about it. Key/value stores are great if...

You need to access cached data from several different forward-facing servers
Your code is written in PHP or something non-persistent (request based)
You clustering

On the other hand, what if you...

Want to store entire objects natively
Know exactly what kind of data you are caching
Only need your cache available to one API (like a Thrift server daemon)
Need multiple worker threads to access the cache
Want supreme performance

Impetus

We wrote a miultithreaded server using Apache Thrift in C++. We then realized that maintaining several connections to Redis was a little silly, because we were working within the framework of a native, persistent, C++ daemon that was already the unified access point for all our API calls. In short, we had access to the same raw resources as Redis did. And unlike Redis, we could allow all our happy worker threads to access the data at the same time (Redis is effectively single threaded, BTW).

Design

fastcache is a template class. So you can store anything in it, and you can use as a key anything that boost::hash accepts (like int or std::string). If you replace the hashing function with your own, you can even do away with that requirement.

The design challenge was to not just store data, but to do so in a way that multiple threads could access it concurrently with a minimum of blocking. Obviously, we could use a single large std::map and then mutex it, but we would have a ton of threads all waiting on each other. Instead, we shard the cache into M maps. We then use a hash function to calculate which map (shard) will hold a particular piece of data. For T concurrent threads reading or writing data, we will have (mostly) non blocking operation as long as our hash generates an even distribution and M >> T. T defaults to 256 but can be changed by setting FASTCACHE_SHARDSIZE.

Objects are passed to and from the cache wrapped in boost::shared_ptr. Create all cache objects using boost::shared_ptr and then let them go out of scope as soon as you set them.

When in mutable mode (see below), we use the .unique() method of boost::shared_ptr in order to see if we have a lock. When operating in this mode, do not add the same object to the cache twice! Doing so will make the cache think the object is in use. You will then get a FastcacheObjectLocked exception every time you attempt to .get() it.

Operational modes

The cache runs in one of two modes, depending on whether you #define FASTCACHE_MUTABLE_DATA:

Default mode - This means that you can get an object from the cache and hold onto it as long as you like without fear of blocking other threads. If the object is changed in the meantime (i.e., replaced - actual mutations within the cache are not supported!) your copy will remain safe. You cannot mutate the object itself. This is the faster of the two modes
Mutable mode, where FASTCACHE_MUTABLE_DATA is defined. In this mode, you have a lock on the data as long as your share_ptr is valid. The lock does not keep the object from being removed from the cache, it just means you have an exclusive read/write lock on the object itself. Another attempt to read the data will result in a FastcacheObjectLocked exception. Since the shard is locked while a get() operation is taking place, Fastcache itself cannot just block in this case (it would block all operations on the entire shard!). It is YOUR responsibility in this case to catch() the exception, sleep, retry, etc. In mutable mode, you must be careful to make sure object references go out of scope as soon as possible so you don't block other requests. Obviously, this mode will be a little slower than non-mutable mode.

Again, DON'T MUTATE anything you put into or take out of the cache, unless you have #defined FASTCACHE_MUTABLE_DATA!! This is very important to avoid trouble with multiple threads. When you get something from the cache, make a copy of the data (not just the pointer) before you do anything except read it

Example

#include <iostream>
#include <string>
#include <boost/shared_ptr.hpp>
#include "Fastcache.h"

using namespace std;
using namespace active911;
using boost::shared_ptr;

class TestClass {
public:
	int id;
	string name;
};

int main(int argc, char **argv) {

	// Make data for cache
	shared_ptr<TestClass>in=shared_ptr<TestClass>(new TestClass());
	in->id=23;
	in->name="My name";

	// Create cache as string->TestClass store.  Add data
	Fastcache<string, TestClass>cache;
	cache.set("MyDataKey",in);

	// Fetch back
	shared_ptr<TestClass>out=cache.get("MyDataKey");

	cout << out->id << ":" << out->name << endl;

	return 0;
}

Installation

On Debian, you will need to run the following

apt-get install build-essential libboost-dev libboost-thread-dev
ldconfig

then you can run make

Configuration

setting FASTCACHE_CURATOR_SLEEP_MS will change how often the curator thread collects garbage. This won't have an effect on key expiration, it just means it will be a little lazier with the memory usage (++ performance though). The default setting is 30000 (30 seconds). If your cache is large, you might make this number larger since it has to traverse the whole cache every time it does its job.
FASTCACHE_SHARDSIZE may need to be increased if you have a large number of threads accessing the cache. It will use only a little more memory, but should really help in keeping shard contention low. Make sure you only set it to a power of two so hash modulus can be calculated easily!

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
test		test
.gitignore		.gitignore
Fastcache.h		Fastcache.h
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
example.cpp		example.cpp
run_tests.pl		run_tests.pl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test

test

.gitignore

.gitignore

Fastcache.h

Fastcache.h

LICENSE

LICENSE

Makefile

Makefile

NOTICE

NOTICE

README.md

README.md

example.cpp

example.cpp

run_tests.pl

run_tests.pl

Repository files navigation

fastcache

Features

Introduction

Impetus

Design

Operational modes

Example

Installation

Configuration

About

Releases

Packages

Languages

License

poryfly/fastcache

Folders and files

Latest commit

History

Repository files navigation

fastcache

Features

Introduction

Impetus

Design

Operational modes

Example

Installation

Configuration

About

Resources

License

Stars

Watchers

Forks

Languages