Skip to content
Fetching contributors…
Cannot retrieve contributors at this time
178 lines (132 sloc) 4.63 KB
<!DOCTYPE html>
<!-- saved from url=(0014)about:internet -->
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8"/>
<title>What&#39;s new in 2.0.1</title>
<style type="text/css">
body, td {
font-family: sans-serif;
background-color: white;
font-size: 12px;
margin: 8px;
}
tt, code, pre {
font-family: 'DejaVu Sans Mono', 'Droid Sans Mono', 'Lucida Console', Consolas, Monaco, monospace;
}
h1 {
font-size:2.2em;
}
h2 {
font-size:1.8em;
}
h3 {
font-size:1.4em;
}
h4 {
font-size:1.0em;
}
h5 {
font-size:0.9em;
}
h6 {
font-size:0.8em;
}
a:visited {
color: rgb(50%, 0%, 50%);
}
pre {
margin-top: 0;
max-width: 95%;
border: 1px solid #ccc;
white-space: pre-wrap;
}
pre code {
display: block; padding: 0.5em;
}
code.r {
background-color: #F8F8F8;
}
table, td, th {
border: none;
}
blockquote {
color:#666666;
margin:0;
padding-left: 1em;
border-left: 0.5em #EEE solid;
}
hr {
height: 0px;
border-bottom: none;
border-top-width: thin;
border-top-style: dotted;
border-top-color: #999999;
}
@media print {
* {
background: transparent !important;
color: black !important;
filter:none !important;
-ms-filter: none !important;
}
body {
font-size:12pt;
max-width:100%;
}
a, a:visited {
text-decoration: underline;
}
hr {
visibility: hidden;
page-break-before: always;
}
pre, blockquote {
padding-right: 1em;
page-break-inside: avoid;
}
tr, img {
page-break-inside: avoid;
}
img {
max-width: 100% !important;
}
@page :left {
margin: 15mm 20mm 15mm 10mm;
}
@page :right {
margin: 15mm 10mm 15mm 20mm;
}
p, h2, h3 {
orphans: 3; widows: 3;
}
h2, h3 {
page-break-after: avoid;
}
}
</style>
</head>
<body>
<h1>What&#39;s new in 2.0.1</h1>
<h2>Compatibility</h2>
<p>Works with CDH3, CDH4, Apache 1.0.4 and MapR 2.0.1. See <a href="compatibility.md">Compatibility</a></p>
<h2>Fixes</h2>
<ul>
<li>Fixed <code>rmr.sample</code>. The function in 2.0 was broken due to a commit mishap.</li>
<li>Hardened <code>keyval</code> argument recycling feature against some edge cases (not <code>NULL</code> but 0-length keys or values).</li>
<li>Local mode again compatible with mixed type input files. The local backend failed when multiple inputs were provided but they represented different record types, which is typical of joins. No longer.</li>
<li>Fixed equijoin with new reduce default. The 2.0 version had a number of problems due to an incomplete port to the 2.0 API. The new reduce default does a <code>merge</code> of left and right side in most cases, returns left and right groups as they are when lists are involved.</li>
<li>Hardened streaming backend against user code or packages writing to <code>stdout</code>. This output stream is reserved for hadoop streaming use. We now capture all output at package load time and from the map and reduce functions and redirect it to standard error. There is one case we can&#39;t handle, see #40.</li>
<li>Improved environment load on the nodes. In some cases local variables could be overwritten by global environment variables, not anymore.</li>
<li>Hardened hdfs functions against some small platform variations. This seemed to give rise only to error messages without further consequences in most cases, but had been a source of alarm for users.</li>
</ul>
<h2>Features</h2>
<ul>
<li>automatic package install &amp; update feature (experimental). At default, nothing changes, you need to install all the packages you need on all the nodes. If you set the package option <code>install.args</code> to a list, even empty, rmr will try to install missing packages using that list as additional arguments for <code>install.packages</code> (the <code>libs</code> argument is set to the current list of loaded packages) . Same logic with <code>update.args</code>. This feature is experimental and, as we said, off by default. Testing has been very limited, please consider this a preview of a feature targted for 2.1</li>
</ul>
<h2>Behind the scenes</h2>
<ul>
<li>prettified a lot of code to show directly in documents and presentations using literate programming. This is important because the same code is showed everywhere, no copy and paste, no pseudo-code, no nothing. The code in the [Tutorial] and other documents is always real, fresh and tested. Documents are generated with knitr and presentation with knitr and pandoc. Thanks to Yihui Xie and John MacFarlane for these great software and direct support.</li>
</ul>
</body>
</html>
Jump to Line
Something went wrong with that request. Please try again.