@colinmarc colinmarc released this Oct 26, 2016 · 220 commits to master since this release

Assets 4

This release represents a major rewrite of sequins. We've been running this new code internally for more than six months, and we think it's in pretty good shape!

The main difference is that sequins can now run as a cluster, partitioning and replicating data across multiple machines. There are lots of other changes, as well, including:

To say the least, this release is highly backwards incompatible with sequins 0.4.x. Moreover, input data that was organized for 0.4.x needs to be organized slightly different in S3 or HDFS. You can read more about the new source root layout here.

Additionally, since v1.0.0rc1, a few issues have been fixed:

  • #60: A rare deadlock in the zookeeper code was fixed, and automatic GCing of old zookeeper nodes was added
  • #59: A bug was fixed where sequins would cancel in-flight proxied requests
  • #58: Sequins can now start up with no data, to make the quickstart guide simpler
Pre-release

@colinmarc colinmarc released this Oct 4, 2016 · 266 commits to master since this release

Assets 3

This is the first draft release of v1.0, a major rewrite of sequins. We've been running this new code internally for more than six months, and we think it's in pretty good shape!

The main difference is that sequins can now run as a cluster, partitioning and replicating data across multiple machines. There are lots of other changes, as well, including:

To say the least, this release is highly backwards incompatible with sequins 0.4.x. Moreover, input data that was organized for 0.4.x needs to be organized slightly different in S3 or HDFS. You can read more about the new source root layout here.

@colinmarc colinmarc released this Sep 21, 2015 · 476 commits to master since this release

Assets 3

This minor release fixes a bug in the previous release - downloading data to temporary directories before renaming.

It also includes a new utility, sequins-dump, to dump out the data in sequencefiles. You use it like this:

$ sequins-dump test_data/0/
Alice     Practice
Bob       Hope
Charlie   Horse

It also has options to dump any combination of offsets, keys, and values:

$ sequins-dump --help
usage: sequins-dump [<flags>] <PATH>

Flags:
  --help         Show help (also see --help-long and --help-man).
  -o, --offsets  Display offsets.
  -k, --keys     Display keys.
  -v, --values   Display values.
  --version      Show application version.

@colinmarc colinmarc released this Aug 15, 2015 · 478 commits to master since this release

Assets 3

This point release contains one minor change: instead of downloading data directly into the destination local dir, sequins will first download to a temporary dir and then rename the directory into place.

@praboud praboud released this Apr 13, 2015 · 485 commits to master since this release

Assets 3

This fixes the previous broken release where the index would get unset immediately after starting the HTTP server, causing 500's to return when trying to make HTTP requests.

@praboud praboud released this Apr 10, 2015 · 488 commits to master since this release

Assets 3

This version brings two changes:

  1. A bugfix to remove a race condition which could cause requests to fail when hotloading a new version of the dataset.
  2. Add a X-Sequins-Version header to responses, which indicates the current version of the dataset.

@colinmarc colinmarc released this Mar 12, 2015 · 495 commits to master since this release

Assets 3

This minor release brings two small changes:

  • A bug has been fixed where sequins would keep indexed files open long after it had switched to a new version.
  • Sequins now writes out a .manifest file once it has fully indexed a dataset, and it can load from that on startup instead of indexing everything all over again.

@colinmarc colinmarc released this Feb 5, 2015 · 502 commits to master since this release

Assets 3

This is another bugfix release, for a second regression introduced in v0.4.0. Because of a copied lock, some rare race conditions were introduced to the read path.

@colinmarc colinmarc released this Feb 5, 2015 · 505 commits to master since this release

Assets 3

This is a hotfix release for a regression introduced in 0.4.1. In that version, the sync marker in sequencefiles could sometimes be read incorrectly, causing sequins to think that the file was corrupted.

@colinmarc colinmarc released this Feb 4, 2015 · 509 commits to master since this release

Assets 3

This release comes with some performance optimizations, as well as some code cleanup. The two major changes are:

  • Reading SequenceFiles now requires significantly less heap bloat; this should reduce memory usage for large datasets considerably
  • Indexing now happens in serial, instead of in parallel - this means it's slightly slower, but thrashes the disk a lot less.