# <span style="color:brown">Using R in Statistics II: Initialization</span>

---


## <span style="color:brown;">Contents</span>

In this chapter we initialize the libraries of R and the data we may wish to use throughout the Jupyter book covering the contents of the Statistics II subject.


## <span style="color:blue;">Preparing R and the data</span>

---

To illustrate the concepts we have introduced, and to motivate possible choices of good estimators, we will consider specific examples, mostly based on real data, which we will process using <span style="color:blue;font-family:monospace;font-size:90%;">R</span>.

We start by preparing <span style="color:blue;font-family:monospace;font-size:90%;">R</span> to read and manipulate the data mentioned above. In the following <span style="color:blue;font-family:monospace;font-size:90%;">R</span> <span style="color:brown">code cell</span> we:

1. Load the <span style="color:blue;font-family:monospace;font-size:90%;">R</span> libraries we are going to need for our examples.
2. Define a function, <span style="color:blue;font-family:monospace;font-size:90%;">table_prnt</span>, specifying the format for the tables that will present the numerical results in this lesson.
3. Introduce information to work with the available data sets.

The <span style="color:brown;">available data sets</span> and their identifying codes are:

1. Hourly prices for the Iberian electricity market
2. Grades for a Statistics subject in UC3M
3. Share prices for a company (Iberdrola) from the IBEX index
4. Simulated data from a N(80,30) distribution (var 1), an Exp(lambda=1/30) distribution (var 2) and a Binom(20,0.4) distribution (var 3)
5. Data from the Sustainable Develpment Report 2021, with the scores by country for goals 1 and 2

In order to add another data set to this collection, you should include information for each of the following variables: the <span style="color:blue;font-family:monospace;font-size:90%;">.csv</span> file containing the data and a text with a short description for the data.

It is also important to ensure that the <span style="color:brown;">working directory</span> has been <span style="color:brown;">selected correctly,</span> as the directory that includes all the data sets that could be used in this lesson.

To execute the commands in the cell, select the cell by clicking on it, and then <span style="color:blue;">press the **RUN** button</span> in the menu bar, or press <span style="color:blue;">Shift-Enter.</span>


In [19]:
#options(jupyter.plot_mimetypes = c("text/plain","image/png"))

# Load libraries with R functions

suppressMessages(library(tidyverse))
suppressMessages(library(huxtable))
library(knitr)
suppressMessages(library(kableExtra))
library(IRdisplay)
suppressMessages(library(sjPlot))
suppressMessages(library(gridExtra))
suppressMessages(library(qqplotr))
suppressMessages(library(GGally))
suppressMessages(library(car))
library(grid)

# Define a function to format and print the results of interest

outp.type = 0   # = 1 for html output, = 0 for Jupyter Books

if (outp.type == 1) {
    table_prnt <- function(p.df,p.capt) {
    # A function to control the presentation of tables with numerical results
    p.df %>% kable("html",caption=paste0('<em>',p.capt,'</em>'),align='r') %>%
    kable_styling(full_width = F, position = "left") %>% as.character() %>% display_html()
    }
    } else {
    table_prnt <- function(p.df,p.capt) {
    # A function to control the presentation of tables with numerical results
    p.df %>% kable("simple",caption=p.capt,align='r')
    }
}
