Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Newer
Older
100644 462 lines (332 sloc) 18.141 kB
0c1b88b lots more docs.
Robey Pointer authored
1
62482e2 cleanup.
Robey Pointer authored
2 A working guide to kestrel
3 ==========================
0c1b88b lots more docs.
Robey Pointer authored
4
5 Kestrel is a very simple message queue that runs on the JVM and uses the
6 memcache protocol (with some extensions) to talk to clients.
7
8 A single kestrel server has a set of queues identified by a name, which is
9 also the filename of that queue's journal file (usually in
10 `/var/spool/kestrel`). Each queue is a strictly-ordered FIFO of "items" of
11 binary data. Usually this data is in some serialized format like JSON or
12 ruby's marshal format.
13
d6199a9 explain queue names.
Robey Pointer authored
14 Generally queue names should be limited to alphanumerics `[A-Za-z0-9]`, dash
15 (`-`) and underline (`_`). In practice, kestrel doesn't enforce any
16 restrictions other than the name can't contain slash (`/`) because that can't
17 be used in filenames, squiggle (`~`) because it's used for temporary files,
e45676a incorporate matt sanford's feedback.
Robey Pointer authored
18 plus (`+`) because it's used for fanout queues, and dot (`.`) because it's
19 reserved for future use. Queue names are case-sensitive, but if you're running
20 kestrel on OS X or Windows, you will want to refrain from taking advantage of
21 this, since the journal filenames on those two platforms are *not*
22 case-sensitive.
d6199a9 explain queue names.
Robey Pointer authored
23
0c1b88b lots more docs.
Robey Pointer authored
24 A cluster of kestrel servers is like a memcache cluster: the servers don't
25 know about each other, and don't do any cross-communication, so you can add as
26 many as you like. Clients have a list of all servers in the cluster, and pick
27 one at random for each operation. In this way, each queue appears to be spread
8c93d94 more doc improvements.
Robey Pointer authored
28 out across every server, with items in a loose ordering.
0c1b88b lots more docs.
Robey Pointer authored
29
58746d2 more docs!
Robey Pointer authored
30 When kestrel starts up, it scans the journal folder and creates queues based
31 on any journal files it finds there, to restore state to the way it was when
32 it last shutdown (or was killed or died). New queues are created by referring
33 to them (for example, adding or trying to remove an item). A queue can be
34 deleted with the "delete" command.
35
0c1b88b lots more docs.
Robey Pointer authored
36
37 Configuration
38 -------------
39
70ef6d0 add some more docs, and make the guide refer to them.
Robey Pointer authored
40 The config files for kestrel are scala expressions loaded at runtime, usually
41 from `production.scala`, although you can use `development.scala` by passing
42 `-Dstage=development` to the java command line.
43
44 The config file evaluates to a `KestrelConfig` object that's used to configure
45 the server as a whole, a default queue, and any overrides for specific named
46 queues. The fields on `KestrelConfig` are documented here with their default
47 values:
fd3f9d4 add more docs and clean them up a bit
Robey Pointer authored
48 [KestrelConfig.html](http://robey.github.com/kestrel/doc/main/api/net/lag/kestrel/config/KestrelConfig.html)
0c1b88b lots more docs.
Robey Pointer authored
49
50 To confirm the current configuration of each queue, send "dump_config" to
51 a server (which can be done over telnet).
52
53 To reload the config file on a running server, send "reload" the same way.
70ef6d0 add some more docs, and make the guide refer to them.
Robey Pointer authored
54 You should immediately see the changes in "dump_config", to confirm. Reloading
55 will only affect queue configuration, not global server configuration. To
56 change the server configuration, restart the server.
0c1b88b lots more docs.
Robey Pointer authored
57
70ef6d0 add some more docs, and make the guide refer to them.
Robey Pointer authored
58 Logging is configured according to `util-logging`. The logging configuration
59 syntax is described here:
fd3f9d4 add more docs and clean them up a bit
Robey Pointer authored
60 [util-logging](https://github.com/twitter/util/blob/master/util-logging/README.markdown)
0c1b88b lots more docs.
Robey Pointer authored
61
70ef6d0 add some more docs, and make the guide refer to them.
Robey Pointer authored
62 Per-queue configuration is documented here:
fd3f9d4 add more docs and clean them up a bit
Robey Pointer authored
63 [QueueBuilder.html](http://robey.github.com/kestrel/doc/main/api/net/lag/kestrel/config/QueueBuilder.html)
64
65
66 Full queues
67 -----------
68
69 A queue can have the following limits set on it:
70
71 - `maxItems` - total items in the queue
72 - `maxSize` - total bytes of data in the items in the queue
73
74 If either of these limits is reached, no new items can be added to the queue.
75 (Clients will receive an error when trying to add.) If you set
76 `discardOldWhenFull` to true, then all adds will succeed, and the oldest
77 item(s) will be silently discarded until the queue is back within the item
78 and size limits.
79
80 `maxItemSize` limits the size of any individual item. If an add is attempted
81 with an item larger than this limit, it always fails.
74f21d8 document expiration_timer_frequency_seconds, max_item_size, and move_…
Robey Pointer authored
82
0c1b88b lots more docs.
Robey Pointer authored
83
84 The journal file
85 ----------------
86
87 The journal file is the only on-disk storage of a queue's contents, and it's
88 just a sequential record of each add or remove operation that's happened on
89 that queue. When kestrel starts up, it replays each queue's journal to build
90 up the in-memory queue that it uses for client queries.
91
58746d2 more docs!
Robey Pointer authored
92 The journal file is rotated in one of two conditions:
0c1b88b lots more docs.
Robey Pointer authored
93
1531d0d finish cleaning up the guide.
Robey Pointer authored
94 1. the queue is empty and the journal is larger than `defaultJournalSize`
0c1b88b lots more docs.
Robey Pointer authored
95
1531d0d finish cleaning up the guide.
Robey Pointer authored
96 2. the journal is larger than `maxJournalSize`
0c1b88b lots more docs.
Robey Pointer authored
97
1531d0d finish cleaning up the guide.
Robey Pointer authored
98 For example, if `defaultJournalSize` is 16MB (the default), then if the queue
99 is empty and the journal is larger than 16MB, it will be truncated into a new
100 (empty) file. If the journal is larger than `maxJournalSize` (1GB by default),
101 the journal will be rewritten periodically to contain just the live items.
0c1b88b lots more docs.
Robey Pointer authored
102
1531d0d finish cleaning up the guide.
Robey Pointer authored
103 You can turn the journal off for a queue (`keepJournal` = false) and the queue
0c1b88b lots more docs.
Robey Pointer authored
104 will exist only in memory. If the server restarts, all enqueued items are
1531d0d finish cleaning up the guide.
Robey Pointer authored
105 lost. You can also force a queue's journal to be sync'd to disk periodically,
106 or even after every write operation, at a performance cost, using
107 `syncJournal`.
0c1b88b lots more docs.
Robey Pointer authored
108
1531d0d finish cleaning up the guide.
Robey Pointer authored
109 If a queue grows past `maxMemorySize` bytes (128MB by default), only the
0c1b88b lots more docs.
Robey Pointer authored
110 first 128MB is kept in memory. The journal is used to track later items, and
111 as items are removed, the journal is played forward to keep 128MB in memory.
112 This is usually known as "read-behind" mode, but Twitter engineers sometimes
113 refer to it as the "square snake" because of the diagram used to brainstorm
114 the implementation. When a queue is in read-behind mode, removing an item will
115 often cause 2 disk operations instead of one: one to record the remove, and
116 one to read an item in from disk to keep 128MB in memory. This is the
117 trade-off to avoid filling memory and crashing the JVM.
118
119
58746d2 more docs!
Robey Pointer authored
120 Item expiration
121 ---------------
0c1b88b lots more docs.
Robey Pointer authored
122
58746d2 more docs!
Robey Pointer authored
123 When they come from a client, expiration times are handled in the same way as
124 memcache: if the number is small (less than one million), it's interpreted as
125 a relative number of seconds from now. Otherwise it's interpreted as an
126 absolute unix epoch time, in seconds since the beginning of 1 January 1970
127 GMT.
0c1b88b lots more docs.
Robey Pointer authored
128
58746d2 more docs!
Robey Pointer authored
129 Expiration times are immediately translated into an absolute time, in
1531d0d finish cleaning up the guide.
Robey Pointer authored
130 *milliseconds*, and if it's further in the future than the queue's `maxAge`,
a3c112f fix typos
Robey Pointer authored
131 the `maxAge` is used instead. An expiration of 0, which is usually the
58746d2 more docs!
Robey Pointer authored
132 default, means an item never expires.
0c1b88b lots more docs.
Robey Pointer authored
133
58746d2 more docs!
Robey Pointer authored
134 Expired items are flushed from a queue whenever a new item is added or
fd3f9d4 add more docs and clean them up a bit
Robey Pointer authored
135 removed. Additionally, if the global config option `expirationTimerFrequency`
136 is set, a background thread will periodically remove expired items from the
137 head of each queue. The provided `production.conf` sets this to one second.
138 If this is turned off, an idle queue won't have any items expired, but you
139 can still trigger a check by doing a "peek" on it.
140
141 Normally, expired items are discarded. If `expireToQueue` is set, then
142 expired items are moved to the specified queue just as if a client had put
143 it there. The item is added with no expiration time, but that can be
144 overridden if the new queue has a default expiration policy.
145
146 To prevent stalling the server when it encounters a swarm of items that all
147 expired at the same time, `maxExpireSweep` limits the number of items that
148 will be removed by the background thread in a single round. This is primarily
149 useful as a throttling mechanism when using a queue as a way to delay work.
74f21d8 document expiration_timer_frequency_seconds, max_item_size, and move_…
Robey Pointer authored
150
0c1b88b lots more docs.
Robey Pointer authored
151
58746d2 more docs!
Robey Pointer authored
152 Fanout Queues
153 -------------
154
155 If a queue name has a `+` in it (like "`orders+audit`"), it's treated as a
156 fanout queue, using the format `<parent>+<child>`. These queues belong to a
157 parent queue -- in this example, the "orders" queue. Every item written into
158 a parent queue will also be written into each of its children.
159
160 Fanout queues each have their own journal file (if the parent queue has a
161 journal file) and otherwise behave exactly like any other queue. You can get
162 and peek and even add items directly to a child queue if you want. It uses the
163 parent queue's configuration instead of having independent child queue
164 configuration blocks.
165
166 When a fanout queue is first referenced by a client, the journal file (if any)
167 is created, and it will start receiving new items written to the parent queue.
168 Existing items are not copied over. A fanout queue can be deleted to stop it
169 from receiving new items.
170
171
bb7eb5e rename xid to id in the thrift interface, and link it from the docs
Robey Pointer authored
172 Thrift protocol
173 ---------------
174
175 The thrift protocol is documented in the thrift file here:
176 [kestrel.thrift](https://github.com/robey/kestrel/blob/master/src/main/thrift/kestrel.thrift)
177
178
58746d2 more docs!
Robey Pointer authored
179 Memcache commands
180 -----------------
181
182 - `SET <queue-name> <flags (ignored)> <expiration> <# bytes>`
183
184 Add an item to a queue. It may fail if the queue has a size or item limit
185 and it's full.
186
e45676a incorporate matt sanford's feedback.
Robey Pointer authored
187 - `GET <queue-name>[options]`
66c498b docs!
Robey Pointer authored
188
58746d2 more docs!
Robey Pointer authored
189 Remove an item from a queue. It will return an empty response immediately if
190 the queue is empty. The queue name may be followed by options separated
191 by `/`:
192
66c498b docs!
Robey Pointer authored
193 - `/t=<milliseconds>`
194
195 Wait up to a given time limit for a new item to arrive. If an item arrives
196 on the queue within this timeout, it's returned as normal. Otherwise,
197 after that timeout, an empty response is returned.
198
199 - `/open`
200
201 Tentatively remove an item from the queue. The item is returned as usual
202 but is also set aside in case the client disappears before sending a
203 "close" request. (See "Reliable Reads" below.)
58746d2 more docs!
Robey Pointer authored
204
66c498b docs!
Robey Pointer authored
205 - `/close`
58746d2 more docs!
Robey Pointer authored
206
66c498b docs!
Robey Pointer authored
207 Close any existing open read. (See "Reliable Reads" below.)
58746d2 more docs!
Robey Pointer authored
208
66c498b docs!
Robey Pointer authored
209 - `/abort`
58746d2 more docs!
Robey Pointer authored
210
66c498b docs!
Robey Pointer authored
211 Cancel any existing open read, returing that item to the head of the
212 queue. It will be the next item fetched. (See "Reliable Reads" below.)
58746d2 more docs!
Robey Pointer authored
213
66c498b docs!
Robey Pointer authored
214 - `/peek`
58746d2 more docs!
Robey Pointer authored
215
66c498b docs!
Robey Pointer authored
216 Return the first available item from the queue, if there is one, but don't
217 remove it. You can't combine this with any of the reliable read options.
58746d2 more docs!
Robey Pointer authored
218
d1b4ee7 @rtyler Update guide.md
rtyler authored
219 For example, to open a new read, waiting up to 500msec for an item:
e45676a incorporate matt sanford's feedback.
Robey Pointer authored
220
221 GET work/t=500/open
222
223 Or to close an existing read and open a new one:
224
225 GET work/close/open
226
58746d2 more docs!
Robey Pointer authored
227 - `DELETE <queue-name>`
228
229 Drop a queue, discarding any items in it, and deleting any associated
230 journal files.
231
232 - `FLUSH <queue-name>`
233
234 Discard all items remaining in this queue. The queue remains live and new
235 items can be added. The time it takes to flush will be linear to the current
236 queue size, and any other activity on this queue will block while it's being
237 flushed.
238
239 - `FLUSH_ALL`
240
241 Discard all items remaining in all queues. The queues are flushed one at a
242 time, as if kestrel received a `FLUSH` command for each queue.
0c1b88b lots more docs.
Robey Pointer authored
243
58746d2 more docs!
Robey Pointer authored
244 - `VERSION`
245
246 Display the kestrel version in a way compatible with memcache.
247
248 - `SHUTDOWN`
249
250 Cleanly shutdown the server and exit.
251
252 - `RELOAD`
253
254 Reload the config file and reconfigure all queues. This should have no
255 noticable effect on the server's responsiveness.
256
257 - `DUMP_CONFIG`
258
259 Dump a list of each queue currently known to the server, and list the config
260 values for each queue. The format is:
261
66c498b docs!
Robey Pointer authored
262 queue 'master' {
263 max_items=2147483647
264 max_size=9223372036854775807
265 max_age=0
266 max_journal_size=16277216
267 max_memory_size=134217728
268 max_journal_overflow=10
269 max_journal_size_absolute=9223372036854775807
270 discard_old_when_full=false
271 journal=true
272 sync_journal=false
273 }
274
275 The last queue will be followed by `END` on a line by itself.
58746d2 more docs!
Robey Pointer authored
276
277 - `STATS`
278
279 Display server stats in memcache style. They're described below.
280
281 - `DUMP_STATS`
282
283 Display server stats in a more readable style, grouped by queue. They're
284 described below.
285
55c7342 allow memcache protocol to have an optional max_items on "monitor".
Robey Pointer authored
286 - `MONITOR <queue-name> <seconds> [max-items]`
1531d0d finish cleaning up the guide.
Robey Pointer authored
287
288 Monitor a queue for a time, fetching any new items that arrive. Clients
289 are queued in a fair fashion, per-item, so many clients may monitor a
290 queue at once. After the given timeout, a separate `END` response will
291 signal the end of the monitor period. Any fetched items are open
292 transactions (see "Reliable Reads" below), and should be closed with
293 `CONFIRM`.
294
295 - `CONFIRM <queue-name> <count>`
296
297 Confirm receipt of `count` items from a queue. Usually this is the response
298 to a `MONITOR` command, to confirm the items that arrived during the monitor
299 period.
300
74f21d8 document expiration_timer_frequency_seconds, max_item_size, and move_…
Robey Pointer authored
301
58746d2 more docs!
Robey Pointer authored
302 Reliable reads
303 --------------
304
305 Normally when a client removes an item from the queue, kestrel immediately
306 discards the item and assumes the client has taken ownership. This isn't
307 always safe, because a client could crash or lose the network connection
308 before it gets the item. So kestrel also supports a "reliable read" that
309 happens in two stages, using the `/open` and `/close` options to `GET`.
310
311 When `/open` is used, and an item is available, kestrel will remove it from
312 the queue and send it to the client as usual. But it will also set the item
313 aside. If a client disconnects while it has an open read, the item is put back
e45676a incorporate matt sanford's feedback.
Robey Pointer authored
314 into the queue, at the head, so it will be the next item fetched. Only one
315 item can be "open" per client connection.
58746d2 more docs!
Robey Pointer authored
316
317 A previous open request is closed with `/close`. The server will reject any
318 attempt to open another read when one is already open, but it will ignore
319 `/close` if there's no open request, so that you can add `/close` to every
320 `GET` request for convenience.
321
322 If for some reason you want to abort a read without disconnecting, you can use
323 `/abort`. But because aborted items are placed back at the head of the queue,
324 this isn't a good way to deal with client errors. Since the error-causing item
325 will always be the next one available, you'll end up bouncing the same item
326 around between clients instead of making progress.
327
328 There's always a trade-off: either potentially lose items or potentially
329 receive the same item multiple times. Reliable reads choose the latter option.
330 To use this tactic successfully, work items should be idempotent, meaning the
331 work could be done 2 or 3 times and have the same effect as if it had been
332 done only once (except wasting some resources).
333
e45676a incorporate matt sanford's feedback.
Robey Pointer authored
334 Example:
335
336 GET dirty_jobs/close/open
337 (receives job 1)
338 GET dirty_jobs/close/open
339 (closes job 1, receives job 2)
340 ...etc...
58746d2 more docs!
Robey Pointer authored
341
74f21d8 document expiration_timer_frequency_seconds, max_item_size, and move_…
Robey Pointer authored
342
58746d2 more docs!
Robey Pointer authored
343 Server stats
344 ------------
345
66c498b docs!
Robey Pointer authored
346 Global stats reported by kestrel are:
347
348 - `uptime` - seconds the server has been online
349 - `time` - current time in unix epoch
350 - `version` - version string, like "1.2"
351 - `curr_items` - total of items waiting in all queues
352 - `total_itmes` - total of items that have ever been added in this server's
353 lifetime
354 - `bytes` - total byte size of items waiting in all queues
355 - `curr_connections` - current open connections from clients
356 - `total_connections` - total connections that have been opened in this
357 server's lifetime
358 - `cmd_get` - total `GET` requests
359 - `cmd_set` - total `SET` requests
360 - `cmd_peek` - total `GET/peek` requests
361 - `get_hits` - total `GET` requests that received an item
362 - `get_misses` - total `GET` requests on an empty queue
363 - `bytes_read` - total bytes read from clients
364 - `bytes_written` - total bytes written to clients
365
366 For each queue, the following stats are also reported:
367
368 - `items` - items waiting in this queue
369 - `bytes` - total byte size of items waiting in this queue
370 - `total_items` - total items that have been added to this queue in this
371 server's lifetime
372 - `logsize` - byte size of the queue's journal file
373 - `expired_items` - total items that have been expired from this queue in this
374 server's lifetime
375 - `mem_items` - items in this queue that are currently in memory
376 - `mem_bytes` - total byte size of items in this queue that are currently in
377 memory (will always be less than or equal to `max_memory_size` config for
378 the queue)
379 - `age` - time, in milliseconds, that the last item to be fetched from this
380 queue had been waiting; that is, the time between `SET` and `GET`; if the
381 queue is empty, this will always be zero
382 - `discarded` - number of items discarded because the queue was too full
383 - `waiters` - number of clients waiting for an item from this queue (using
384 `GET/t`)
721bf6b document the new stat.
Robey Pointer authored
385 - `open_transactions` - items read with `/open` but not yet confirmed
66c498b docs!
Robey Pointer authored
386
387
388 Kestrel as a library
389 --------------------
390
391 You can use kestrel as a library by just sticking the jar on your classpath.
392 It's a cheap way to get a durable work queue for inter-process or inter-thread
393 communication. Each queue is represented by a `PersistentQueue` object:
394
1531d0d finish cleaning up the guide.
Robey Pointer authored
395 class PersistentQueue(val name: String, persistencePath: String,
396 @volatile var config: QueueConfig, timer: Timer,
397 queueLookup: Option[(String => Option[PersistentQueue])]) {
66c498b docs!
Robey Pointer authored
398
399 and must be initialized before using:
400
401 def setup(): Unit
58746d2 more docs!
Robey Pointer authored
402
66c498b docs!
Robey Pointer authored
403 specifying the path for the journal files (if the queue will be journaled),
1531d0d finish cleaning up the guide.
Robey Pointer authored
404 the name of the queue, a `QueueConfig` object (derived from `QueueBuilder`),
405 a timer for handling timeout reads, and optionally a way to find other named
406 queues (for `expireToQueue` support).
58746d2 more docs!
Robey Pointer authored
407
66c498b docs!
Robey Pointer authored
408 To add an item to a queue:
58746d2 more docs!
Robey Pointer authored
409
1531d0d finish cleaning up the guide.
Robey Pointer authored
410 def add(value: Array[Byte], expiry: Option[Time]): Boolean
58746d2 more docs!
Robey Pointer authored
411
66c498b docs!
Robey Pointer authored
412 It will return `false` if the item was rejected because the queue was full.
0c1b88b lots more docs.
Robey Pointer authored
413
66c498b docs!
Robey Pointer authored
414 Queue items are represented by a case class:
415
1531d0d finish cleaning up the guide.
Robey Pointer authored
416 case class QItem(addTime: Time, expiry: Option[Time], data: Array[Byte], var xid: Int)
66c498b docs!
Robey Pointer authored
417
418 and several operations exist to remove or peek at the head item:
419
420 def peek(): Option[QItem]
421 def remove(): Option[QItem]
422
423 To open a reliable read, set `transaction` true, and later confirm or unremove
424 the item by its `xid`:
425
426 def remove(transaction: Boolean): Option[QItem]
427 def unremove(xid: Int)
428 def confirmRemove(xid: Int)
429
1531d0d finish cleaning up the guide.
Robey Pointer authored
430 You can also asynchronously remove or peek at items using futures.
66c498b docs!
Robey Pointer authored
431
1531d0d finish cleaning up the guide.
Robey Pointer authored
432 def waitRemove(deadline: Option[Time], transaction: Boolean): Future[Option[QItem]]
433 def waitPeek(deadline: Option[Time]): Future[Option[QItem]]
66c498b docs!
Robey Pointer authored
434
435 When done, you should close the queue:
436
437 def close(): Unit
438 def isClosed: Boolean
439
440 Here's a short example:
441
1531d0d finish cleaning up the guide.
Robey Pointer authored
442 var queue = new PersistentQueue("work", "/var/spool/kestrel", config, timer, None)
66c498b docs!
Robey Pointer authored
443 queue.setup()
444
445 // add an item with no expiration:
446 queue.add("hello".getBytes, 0)
447
448 // start to remove it, then back out:
449 val item = queue.remove(true)
450 queue.unremove(item.xid)
451
452 // remove an item with a 500msec timeout, and confirm it:
1531d0d finish cleaning up the guide.
Robey Pointer authored
453 queue.waitRemove(500.milliseconds.fromNow, true)() match {
454 case None =>
455 println("nothing. :(")
456 case Some(item) =>
457 println("got: " + new String(item.data))
458 queue.confirmRemove(item.xid)
66c498b docs!
Robey Pointer authored
459 }
0c1b88b lots more docs.
Robey Pointer authored
460
66c498b docs!
Robey Pointer authored
461 queue.close()
Something went wrong with that request. Please try again.