This release contains a number of changes to the : BaseKB output, most importantly:
- $-escapes are now converted to Unicode in keys and almost all raw strings
- there is no longer a sieve3 horizontal subdivision
- output triples are grouped and sorted by subject and divided into 210 shards
Numerous changes have happened behind the scenes, the most important of which is that the Spring XML that defines the weekly job has been moved into the bakemono project and is exported in a small JAR file that haruhi reads.
This release has cleared away obstacles to some big changes in dependencies which will happen soon.
This version of Infovore is linked against Centipede 99.6 and includes a version bump to Spring 4.0.5.
In other news, several half-baked utilities have been checked in, for instance, you can do
haruhi run ssh i-598b673e
to ssh to a machine using an AMZN instance id instead of an ip address.
Haruhi now writes a tag with the Hadoop job id to all line items for the job so we can add up line items with this tag to calculate that cost of a job after the fact. When running a flow (multiple jobs), Haruhi now uses the command line arguments of the flow to determine the name of the flow.