Skip to content

Commit 8a27372

Browse files
authored
Update README.md
1 parent f0929d3 commit 8a27372

File tree

1 file changed

+20
-1
lines changed

1 file changed

+20
-1
lines changed

README.md

Lines changed: 20 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,21 @@
11
# SD
2-
String dictionary with front coding compression
2+
C++ String dictionary with front coding compression and serialization
3+
4+
sd is a C++ class implementing a compressed string dictionary. The class provides fast add, locate and
5+
extract using front coding compression. Front coding compression means that common prefixes are compressed. This makes this string dictionary suitable for compact storage and fast access to a huge amount of strings like URL:s or file paths. Using sd it may be possible to hold a complete dictionary in RAM instead of having to use slower disk-based storage.
6+
7+
Note:
8+
* Strings must be added in strict ascending lexicographic order.
9+
* Strings are stored as raw bytes, but '\0' cant't be used in a string because that char is used as string terminator inside the data structure.
10+
* BUCKET_SIZE may be changed to shift the trade-off between compression level and speed of access. Big BUCKET_SIZE means higher compression but slower access. Sane values are between 16 and 2048.
11+
12+
This software is simplified version of StringDictionaryPFC from libCSD:
13+
```
14+
==========================================================================
15+
"Compressed String Dictionaries"
16+
Nieves R. Brisaboa, Rodrigo Cánovas, Francisco Claude,
17+
Miguel A. Martínez-Prieto, and Gonzalo Navarro
18+
10th Symposium on Experimental Algorithms (SEA'2011), p.136-147, 2011.
19+
==========================================================================
20+
```
21+
See: https://github.com/migumar2/libCSD

0 commit comments

Comments
 (0)