close #43

strengejacke · Jan 31, 2017 · c903355 · c903355
1 parent 2253cea
commit c903355
Show file tree

Hide file tree

Showing 18 changed files with 2,609 additions and 440 deletions.
diff --git a/.gitignore b/.gitignore
@@ -1,56 +1,41 @@
-# History files
-.Rhistory
-
-# Example code in package build process
-*-Ex.R
-
-# R data files from past sessions
-.Rdata
-
-# =========================
-# Operating System Files
-# =========================
-
-# OSX
-# =========================
-
+# History files
+.Rhistory
+
+# Example code in package build process
+*-Ex.R
+# R data files from past sessions
+.Rdata
+# =========================
+# Operating System Files
+# OSX
 .DS_Store
 .AppleDouble
 .LSOverride
-
 # Icon must end with two \r
-Icon
-
+Icon
 # Thumbnails
 ._*
-
 # Files that might appear on external disk
 .Spotlight-V100
 .Trashes
-
 # Directories potentially created on remote AFP share
 .AppleDB
 .AppleDesktop
 Network Trash Folder
 Temporary Items
 .apdisk
-
-# Windows
-# =========================
-
-# Windows image file caches
-Thumbs.db
-ehthumbs.db
-
-# Folder config file
-Desktop.ini
-
-# Recycle Bin used on file shares
-$RECYCLE.BIN/
-
-# Windows Installer files
-*.cab
-*.msi
-*.msm
-*.msp
-.Rproj.user
+# Windows
+# Windows image file caches
+Thumbs.db
+ehthumbs.db
+# Folder config file
+Desktop.ini
+# Recycle Bin used on file shares
+$RECYCLE.BIN/
+# Windows Installer files
+*.cab
+*.msi
+*.msm
+*.msp
+.Rproj.user
+inst/doc
diff --git a/DESCRIPTION b/DESCRIPTION
@@ -33,7 +33,10 @@ Imports:
     tidyr (>= 0.6.0)
 Suggests:
     mice,
-    sjstats (>= 0.7.0)
+    sjstats (>= 0.7.0),
+    knitr,
+    rmarkdown
 URL: https://github.com/sjPlot/sjmisc
 BugReports: https://github.com/sjPlot/sjmisc/issues
 RoxygenNote: 5.0.1
+VignetteBuilder: knitr
diff --git a/NEWS b/NEWS
diff --git a/NEWS.md b/NEWS.md
@@ -1,120 +1,124 @@
-# sjmisc 2.3.0
-
-## New functions
-
-* `center()` and `std()` are moving from package `sjstats` to `sjmisc`.
-* `add_columns()` to bind columns of first data frame at the end of all data frames.
-
-## Changes to functions
-
-* `frq()` now has the same argument-structure as `flat_table()`.
-* Following functions now follow a consistent tidyverse-approach, with the data being the first argument, followed by variable names: `add_labels()`, `replace_labels()`, `remove_labels()`, `count_na()`, `rec()`, `dicho()`, `split_var()`, `drop_labels()`, `fill_labels()`, `group_var()`, `group_labels()`, `ref_lvl()`, `recode_to()`, `replace_na()`, `set_na()` and `set_labels()`..
-
-# sjmisc 2.2.1
-
-## Bug fixes
-
-* The `sort.frq` did not work `frq()`.
-
-# sjmisc 2.2.0
-
-## New functions
-
-* `zap_inf()` to "clean" vectors from `NaN` and infinite values.
-* `descr()` to provide basic descriptive statistics (similar to `describe()` in the psych-package), but including variable labels and usable in pipe-workflows. Also works with grouped data frames.
-
-## Changes to functions
-
-* `rec()`, `split_var()` and `dicho()` get an argument `suffix`, to append a suffix to variable (column) names, if applied on a data frame.
-* Value labels in `rec()` can now directly be assigned inside the `recodes`-syntax (see 'Details' in `?rec`).
-* `find_var()` gets a `as.df`-argument, to return a data frame with matching variables, instead of their column indices only.
-* `find_var()` gets a `as.varlab`-argument, to return a "summary" data frame with column number, variable name and variable label.
-* `flat_table()` now also accepts grouped data frames.
-* `flat_table()` gets a `show.values`-argument, to add values to associated labels in output.
-* `frq()` now also accepts grouped data frames.
-* `frq()` gets a `weight.by`-argument to weight frequencies.
-* `set_na()` can now also find values by their value labels and replace them with NA.
-* `set_na()` now removes unused value labels from values that have been replaced with NA.
-* The `as.tag`-argument in `set_na()` now defaults to `FALSE`.
-* `get_labels()` now always returns labels in sorted order of the associated values.
-* `get_labels()` gets a `drop.unused`-argument, to automatically drop labels from values that don't occur in the vector.
-* For a named vector as `labels`-argument, `set_labels()` now always sorts labels in sorted order of the associated values.
-* `is_empty()` gets a `first.only`-argument, to evaluate either first or all elements of a character vector.
-
-## Bug fixes
-
-* `set_na()` did not work on vectors of class `Date` when argument `as.tag = TRUE`.
-* `flat_table()` did not show values that had no value labels. Now all categories are shown in the frequency table.
-* `rec()` did not properly copy labels of tagged NA values when not all recoded values appeared in the vector.
-* `frq()` did not show correct values, when value labels of a vector were not sorted according their values.
-* `set_labels()` did not set labels properly for ordered factors.
-* `remove_labels()` returned NA-values for value labels (instead of no value labels) when the last value label of a vector was removed.
-
-
-# sjmisc 2.1.0
-
-## New functions
-
-* `find_var()` to find variables in data frames by name or label.
-* `var_labels()` as "tidyversed" alternative to `set_label()` to set variable labels.
-* `var_rename()` to rename variables.
-
-## Changes to functions
-
-* Following functions now get an ellipses-argument `...`, to apply function only to selected variables, but return the complete data frame (thus, overwriting existing variables in a data frame, if requested): `to_factor()`, `to_value()`, `to_label()`, `to_character()`, `to_dummy()`, `zap_labels()`, `zap_unlabelled()`, `zap_na_tags()`.
-
-## Bug fixes
-
-* Fixed bug with copying attributes from tibbles for `merge_df()`.
-* Fixed wrong argument-description in docs of `frq()`.
-
-# sjmisc 2.0.1
-
-## General
-
-* Removed package `coin` from Imports.
-
-## New functions
-
-* `count_na()` to print a frequency table of tagged NA values.
-
-## Changes to functions
-
-* `set_na()` gets a `drop.levels` argument to keep or drop factor levels of values that have been replaced with NA.
-* `set_na()` gets a `as.tag` argument to set NA values as regular or tagged NA.
-
-
-# sjmisc 2.0.0
-
-## General
-
-* **sjmisc** now supports _tagged_ `NA` values, a new structure for labelled missing values introduced by the [haven-package](https://cran.r-project.org/package=haven). This means that functions or arguments that are no longer useful, have been removed while other functions dealing with NA values have been largely revised.
-* All statistical functions have been removed and are now in a separate package, [sjstats](https://cran.r-project.org/package=sjstats).
-* Removed some S3-methods for `labelled`-class, as these are now provided by the haven-package.
-* Functions no longer check input for type `matrix`, to avoid conflicts with scaled vectors (that were recognized as matrix and hence treated as data frame).
-* `table(*, exclude = NULL)` was changed to `table(*, useNA = "always")`, because of planned changes in upcoming R version 3.4.
-* More functions (like `trim()` or `frq()`) now also have data frame- or list-methods.
-
-## New functions
-
-* `zap_na_tags()` to turn tagged NA values into regular NA values.
-* `spread_coef()` to spread coefficients of multiple fitted models in nested data frames into columns.
-* `merge_imputations()` to find the most likely imputed value for a missing value.
-* `flat_table()` to print flat (proportional) tables of labelled variables.
-* Added `to_character()` method.
-* `big_mark()` to format large numbers with big marks.
-* `empty_cols()` and `empty_rows()` to find variables or observations with exclusively NA values in a data frame.
-* `remove_empty_cols()` and `remove_empty_rows()` to remove variables or observations with exclusively NA values from a data frame.
-
-## Changes to functions
-* `str_contains()` gets a `switch` argument to switch the role of `x` and `pattern`.
-* `word_wrap()` coerces vectors to character if necessary.
-* `to_label()` gets a `var.label` and `drop.levels` argument, and now preserves variable labels by default.
-* Argument `def.value` in `get_label()` now also applies to data frame arguments.
-* If factor levels are numeric and factor has value labels, these are used in `to_value()` by default.
-* `to_factor()` no longer generates `NA` or `NaN`-levels when converting input into factors.
-
-## Bug fixes
-* `rec()` did not recode values, when these were the first element of a multi-line string of the `recodes` argument.
-* `is_empty()` returned `NA` instead of `TRUE` for empty character vectors.
-* Fixed bug with erroneous assignment of value labels to subset data when using `copy_labels()` ([#20](https://github.com/sjPlot/sjmisc/issues/20))
+# sjmisc 2.3.0
+
+## General
+
+* Added package-vignettes.
+
+## New functions
+
+* `center()` and `std()` are moving from package `sjstats` to `sjmisc`.
+* `add_columns()` to bind columns of first data frame at the end of all data frames.
+
+## Changes to functions
+
+* `frq()` now has the same argument-structure as `flat_table()`.
+* Following functions now follow a consistent tidyverse-approach, with the data being the first argument, followed by variable names: `add_labels()`, `replace_labels()`, `remove_labels()`, `count_na()`, `rec()`, `dicho()`, `split_var()`, `drop_labels()`, `fill_labels()`, `group_var()`, `group_labels()`, `ref_lvl()`, `recode_to()`, `replace_na()`, `set_na()` and `set_labels()`..
+
+# sjmisc 2.2.1
+
+## Bug fixes
+
+* The `sort.frq` did not work `frq()`.
+
+# sjmisc 2.2.0
+
+## New functions
+
+* `zap_inf()` to "clean" vectors from `NaN` and infinite values.
+* `descr()` to provide basic descriptive statistics (similar to `describe()` in the psych-package), but including variable labels and usable in pipe-workflows. Also works with grouped data frames.
+
+## Changes to functions
+
+* `rec()`, `split_var()` and `dicho()` get an argument `suffix`, to append a suffix to variable (column) names, if applied on a data frame.
+* Value labels in `rec()` can now directly be assigned inside the `recodes`-syntax (see 'Details' in `?rec`).
+* `find_var()` gets a `as.df`-argument, to return a data frame with matching variables, instead of their column indices only.
+* `find_var()` gets a `as.varlab`-argument, to return a "summary" data frame with column number, variable name and variable label.
+* `flat_table()` now also accepts grouped data frames.
+* `flat_table()` gets a `show.values`-argument, to add values to associated labels in output.
+* `frq()` now also accepts grouped data frames.
+* `frq()` gets a `weight.by`-argument to weight frequencies.
+* `set_na()` can now also find values by their value labels and replace them with NA.
+* `set_na()` now removes unused value labels from values that have been replaced with NA.
+* The `as.tag`-argument in `set_na()` now defaults to `FALSE`.
+* `get_labels()` now always returns labels in sorted order of the associated values.
+* `get_labels()` gets a `drop.unused`-argument, to automatically drop labels from values that don't occur in the vector.
+* For a named vector as `labels`-argument, `set_labels()` now always sorts labels in sorted order of the associated values.
+* `is_empty()` gets a `first.only`-argument, to evaluate either first or all elements of a character vector.
+
+## Bug fixes
+
+* `set_na()` did not work on vectors of class `Date` when argument `as.tag = TRUE`.
+* `flat_table()` did not show values that had no value labels. Now all categories are shown in the frequency table.
+* `rec()` did not properly copy labels of tagged NA values when not all recoded values appeared in the vector.
+* `frq()` did not show correct values, when value labels of a vector were not sorted according their values.
+* `set_labels()` did not set labels properly for ordered factors.
+* `remove_labels()` returned NA-values for value labels (instead of no value labels) when the last value label of a vector was removed.
+
+
+# sjmisc 2.1.0
+
+## New functions
+
+* `find_var()` to find variables in data frames by name or label.
+* `var_labels()` as "tidyversed" alternative to `set_label()` to set variable labels.
+* `var_rename()` to rename variables.
+
+## Changes to functions
+
+* Following functions now get an ellipses-argument `...`, to apply function only to selected variables, but return the complete data frame (thus, overwriting existing variables in a data frame, if requested): `to_factor()`, `to_value()`, `to_label()`, `to_character()`, `to_dummy()`, `zap_labels()`, `zap_unlabelled()`, `zap_na_tags()`.
+
+## Bug fixes
+
+* Fixed bug with copying attributes from tibbles for `merge_df()`.
+* Fixed wrong argument-description in docs of `frq()`.
+
+# sjmisc 2.0.1
+
+## General
+
+* Removed package `coin` from Imports.
+
+## New functions
+
+* `count_na()` to print a frequency table of tagged NA values.
+
+## Changes to functions
+
+* `set_na()` gets a `drop.levels` argument to keep or drop factor levels of values that have been replaced with NA.
+* `set_na()` gets a `as.tag` argument to set NA values as regular or tagged NA.
+
+
+# sjmisc 2.0.0
+
+## General
+
+* **sjmisc** now supports _tagged_ `NA` values, a new structure for labelled missing values introduced by the [haven-package](https://cran.r-project.org/package=haven). This means that functions or arguments that are no longer useful, have been removed while other functions dealing with NA values have been largely revised.
+* All statistical functions have been removed and are now in a separate package, [sjstats](https://cran.r-project.org/package=sjstats).
+* Removed some S3-methods for `labelled`-class, as these are now provided by the haven-package.
+* Functions no longer check input for type `matrix`, to avoid conflicts with scaled vectors (that were recognized as matrix and hence treated as data frame).
+* `table(*, exclude = NULL)` was changed to `table(*, useNA = "always")`, because of planned changes in upcoming R version 3.4.
+* More functions (like `trim()` or `frq()`) now also have data frame- or list-methods.
+
+## New functions
+
+* `zap_na_tags()` to turn tagged NA values into regular NA values.
+* `spread_coef()` to spread coefficients of multiple fitted models in nested data frames into columns.
+* `merge_imputations()` to find the most likely imputed value for a missing value.
+* `flat_table()` to print flat (proportional) tables of labelled variables.
+* Added `to_character()` method.
+* `big_mark()` to format large numbers with big marks.
+* `empty_cols()` and `empty_rows()` to find variables or observations with exclusively NA values in a data frame.
+* `remove_empty_cols()` and `remove_empty_rows()` to remove variables or observations with exclusively NA values from a data frame.
+
+## Changes to functions
+* `str_contains()` gets a `switch` argument to switch the role of `x` and `pattern`.
+* `word_wrap()` coerces vectors to character if necessary.
+* `to_label()` gets a `var.label` and `drop.levels` argument, and now preserves variable labels by default.
+* Argument `def.value` in `get_label()` now also applies to data frame arguments.
+* If factor levels are numeric and factor has value labels, these are used in `to_value()` by default.
+* `to_factor()` no longer generates `NA` or `NaN`-levels when converting input into factors.
+
+## Bug fixes
+* `rec()` did not recode values, when these were the first element of a multi-line string of the `recodes` argument.
+* `is_empty()` returned `NA` instead of `TRUE` for empty character vectors.
+* Fixed bug with erroneous assignment of value labels to subset data when using `copy_labels()` ([#20](https://github.com/sjPlot/sjmisc/issues/20))