SLIDER: Software for LongItudinal Data Exploration with R
Interactive tool for exploratory analysis of longitudinal data.
GNU AFFERO GENERAL PUBLIC LICENSE v3
Please cite the following references in publications:
- For the slide plot and the SLIDER application: Commenges H., Pistre P., Cura R. (2014) SLIDER: Software for LongItudinal Data Exploration with R, Cybergeo: European Journal of Geography.
- For the parallel coordinates plot: Bürgin R., Ritschard G. (2012) Categorical parallel coordinate plot, LaCOSA Lausanne Conference On Sequence Analysis, University of Lausanne, June 6th-8th (Poster).
- For transition rate, index plot, frequency plot and distribution plot: Gabadinho A., Ritschard G., Mueller N.S., Studer M. (2011) Analyzing and Visualizing State Sequences in R with TraMineR, Journal of Statistical Software, 40(4), pp.1-37.
Format and variables. The data format is the so-called STate-Sequence (STS) format (see references below, Gabadinho et al.). Each individual may be described with two kinds of variables:
Factor variablesare categorical variables (nominal or ordinal) such as sex, profession, age groups, etc.
Time variablesare categorical variables characterizing the following states for each individual at several time steps.
Time variables. Can be integer or character. The only requirement is that the format must be consistent with a unique
alphabetfor the whole time steps, i.e. each field must be filled with the same type (integer or character) describing the same states.
Factor variables. Can be integer or character. If you want to draw readable labels on the plots you should fill these fields with short character labels.
Alphabet. The alphabet is the list of distinct modalities found in the time variables. The maximum length of the alphabet is set to 12 distinct modalities. Beyond this value, the plots are no longer readable.
Load example data. Load example data from the study by McVicar and Anyadike-Danes (2002) on transition from school to work. This dataset is used in the TraMineR package (mvad) and in the reference paper (see
Import CSV file. The default parameters are comma as separator and no quoting character. You can change this setting, before uploading the CSV file, by checking the
Choose time steps. Mandatory : choose two or more fields containing the time steps you want to explore.
Select weighting variable. Optional : choose a weighting numerical variable, it will be taken into account for the computation and the graphical outputs.
Choose factors. Optional : choose a field containing a factor if you want to explore the patterns for different groups (for example, the sex of the individuals).
Choose a group. Optional : choose the value assigned to the group for which you want to draw the plots.
Download. Plots can be downloaded with the
Downloadbutton. It creates a SVG file with a default size of 20cm (w) x 15 cm (h). You can set the size from 1 to 30 cm. Be aware that index plots can be very heavy if there is a lot of time steps and/or sequences.
- Frequencies. Choose whether to show
absolute frequenciesof transitions,
Index, frequency and distribution plots
- Draw border. Choose whether or not to draw borders between successive states.
- Index of sequences. Choose which sequences to draw: you can select set any set of sequences from 1 to max.
Parallel coordinates plot for sequence data
Embedding. The embedding options of the TraMineR function are fixed to "non-embeddable" for the type of sequence and "most-frequent" for the embedding method.
Squared symbol size, Line width and Translation zone. Choose the size of the squared symbols drawn at the origin and destination of each segment, choose the line width factor and the size of the grey squares containing the squared symbols.
Slide plot options
Download slide plot function. The implementation of the slide plot may be downloaded with the dowload button (side panel). By clicking this button, you download a text file with .R extension which can be executed or sourced.
Threshold. The slide plot often draws a huge number of micro variations which may harm the readability. Values under the set
thresholdwill be treated as residuals. The threshold value is considered as the minimal value.
Mask. After setting the
threshold, you can choose to draw or to mask the micro variations. If you mask them, they won't appear, if you draw them, they will appear in light grey.
Show frequencies. Use this checkbox to show or mask the frequency of transitions. By checking the box, you show all the frequencies above the
Minimal thickness. The thickness of each segment is proportional to the frequency of the aggregated trajectories it represents. The thickness depends on three parameters: the
thresholdparameter, the maximal value of the dataset and the
minimal thicknessparameter. The
minimal thicknessis assigned to the threshold value and the maximal thickness is assigned to the maximal value. Fot that reason, you can modify the overall appearance using both parameters: