-
Notifications
You must be signed in to change notification settings - Fork 15
Native input specification
DISCLAIMER
This document is for developers' reference only and/or for a pair of curious eyes who'd like an early-bird preview of new input specifications and rapport's course of development. Feel free to rant if you think that something we're working on is pure rubbish.
Rationale: old inputs suck because they differ from native R classes. Instead of rolling our own inventions, why not embrace the (implicit) conventions from R ecosystem and build/define inputs according to them? Writing templates should be straightforward and rapport should make you warm and fuzzy inside.
Native input specification takes leverage of R objects' classes, thus not relying on custom conventions when defining template inputs. Following specifications are to depict the current state of implementation along with conventions used in it. Unlike old input specification that relied on rather cumbersome custom syntax, new input specification is 100% pure YAML, and as such should be more intuitive and native to R.
Inputs can be divided into two categories:
-
dataset inputs (non-standalone) - inputs that match the named element in the object provided in
dataformal argument inrapportcall. This is usually adata.frameobject, but can/should be any object that allows subsetting by name. Dataset inputs cannot have the default value, so they always require user input. Note that this is not the same asrequiredinput attribute. -
standalone inputs don't depend on object passed in
data. They accept (default)valueattribute in the template definition, and user can override that value by passing an appropriate object (of matchingclass) in therapportcall.
Following options are available for all inputs (except option, which we'll discuss separately):
-
name(character string) - input name. Cannot be blank, as mapped inputs will be assigned to that name in evaluation environment. -
label(character string) - input label. Can be blank, but it's useful to have something in there in order to get pretty output in a report (e.g.Number of hoursis by far more descriptive thannwhours). Defaults to empty string. -
description(character string) - input description. It can be blank, but sometimes it's convenient to have a lengthy description of provided input. Defaults to empty string. -
class(character string) - defines aclassof an input (d'uh). It can be omitted (defaults toany), but most of the times you'll find it useful to fine-tune inputs. Currently supported classes are:any(default),character,complex,factor,integer,logical,numeric,option,raw. Apart fromoption(which is custom, and we may soon replace it with something more native to R), all the others are basic R classes. -
required(logical value) - is input required (defaults toFALSE). IfTRUE, input must match the subset of object provided indataargument (for dataset inputs) or user has to provide a value for that input/default value has to be defined in the template (ifstandalone). -
standalone(logical value) - does input depend on provideddata. Defaults toFALSE. -
length- provides set of rules for input'slengthrestriction. Defaults to (exactly: 1). It can accept various attributes, e.g.- if omitted or
NULL, it will default toexactly: 1 - if
integervalue is provided, it refers to exactlyNinputs:
- if omitted or
length: 10
which is equivalent to (and will be parsed as):
length:
exactly: 10
-
fromandtoattributes can be passed to define a range that input'slengthmust fall into:
length:
from: 1
to: 10
from or to can be omitted, and the sane defaults will be set implicitly. For instance, by omitting to, it will default to Inf, and if omitting from it will be set to 1.
length:
from: 1
is identical to:
length:
from: 1
to: Inf
or:
length:
to: 10
is equivalent to:
length:
from: 1
to: 10
-
value(a vector of an equivalentclass). Only available for standalone inputs, and must match theclassandlengthof a given input. Additional checks are performed based on the inputclass. See Class-specific options for details.
An "ordinary" input that has no-class specific options (see below) should look pretty much like this:
- name: l
label: Logical input
description: Nothing special about this, really. Just an ordinary logical input...
standalone: TRUE
value: TRUE
length: 1
required: FALSE
This will define a standalone logical input with TRUE as default value, that will accept only one logical value and is not required.
-
regexp(character value) - a regular expression that all values in the input should match.regexpis omitted by default, and check will be performed only if attribute is non-NULL. -
nchar- sets restrictions on the number of characters. Accepts the same options aslengthattribute, only this time number of characters are checked.ncharis omitted from input definition by default. Checks are performed only if attribute is properly defined.
-
limit(a named list withminandmaxattributes) - checks if values ofnumeric/integerinputs fall into range defined byminandmax. Bothminandmaxshould be length-one vectors of appropriate class.limits are omitted by default. Checks are performed only iflimitattribute is provided.
-
nlevels(integer value) - defines number of levels given factor is allowed to have. Attribute is omitted by default and checks will be performed only if non-NULL.
NOTE TO SELF: this is lame, nlevels should accept the same format as nchar and/or length.
Currently the only non-native input. Its main purpose is to provide means to define set of values to match against, just like in match.arg R function. Options are provided in value attribute, length restrictions should be set in order to allow multiple matches. It can only be standalone and does not have any class-specific options. Example of option input:
- name: o
label: Option
description: Option input
value:
- fee
- fi
- foo
- fam
This will define fee, fi, foo and fam as values to match against. See ?match.arg for details.