Skip to content

Commit

Permalink
Resolving comment 67
Browse files Browse the repository at this point in the history
  • Loading branch information
newtoncalegari committed Apr 27, 2016
1 parent b373bef commit 66cf069
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions bp.html
Expand Up @@ -3895,13 +3895,13 @@ <h3>Data Enrichment</h3>
<p class="practicedesc">Enrich your data by generating new data from the raw data when doing so will enhance its value.</p>
<section class="axioms">
<h4 class="subhead">Why</h4>
<p>Enrichment can greatly enhance processability, particularly for unstructured data. Missing values can be filled in, and new attributes and measures can be added. Publishing more complete datasets enhances trust. Deriving additional values that are of general utility saves users time and encourages more kinds of reuse. There are many intelligent techniques that can be used to enrich data, making the dataset an even more valuable asset.</p>
<p>Enrichment can greatly enhance processability, particularly for unstructured data. Under some circumstances, missing values can be filled in, and new attributes and measures can be added. Publishing more complete datasets can enhance trust, if done properly and ethically. Deriving additional values that are of general utility saves users time and encourages more kinds of reuse. There are many intelligent techniques that can be used to enrich data, making the dataset an even more valuable asset.</p>
</section>
<section class="outcome">
<h4 class="subhead">Intended Outcome</h4>
<p>A dataset that has missing values is enhanced if it is possible to fill in those values. Additional relevant measures or attributes should be added if they enhance utility. Unstructured data can be given structure in this way as well.</p>
<p>Because inference-based enrichment may introduce errors into the data, values generated by such techniques should be labeled as such, and it should be possible to retrieve any original values replaced by enrichment.</p>
<p>Whenever licensing permits, the code used to enrich the data should be made available along with the dataset. Sharing such code is particularly important for scientific data. </p>
<p>Data that is unstructured should be given structure if possible. In structured data, missing values should be added if they enhance utility, but only if the addition does not distort analytical results, significance, or statistical power.</p>
<p>Values generated by inference-based techniques should be labeled as such, and it should be possible to retrieve any original values replaced by enrichment.</p>
<p>Whenever licensing permits, the code used to enrich the data should be made available along with the dataset.</p>
</section>
<section class="how">
<h4 class="subhead">Possible Approaches to Implementation</h4>
Expand Down

0 comments on commit 66cf069

Please sign in to comment.