# Objective:
Using your collected data, you will assess the quality of items in your scale through descriptive statistics, visualizations, and correlations. You will use these results to determine which items may need to be revised or removed.

## Load packages

There are two focal packages that we will use. The `psych` package must first be installed. Then, we will load the `psych` and `tidyverse` packages.

## 1) Install and load both packages

Use a separate chunk for installing and loading the packages. Installing the `psych` package takes a while so if you mess up, you'll have to re-do that step every time.

In [None]:
## Install package(s)


In [None]:
## Load packages


## Load the class data

Each group was given a unique identifier for their items. This identifier will be how we specifically extract *your* group's data (i.e., items). This first chunk will load all of the data from our class survey.

**Data Structure**
- These data are in "wide" format wherein each respondent is placed on a row and their columns represent responses to each question/variable.

- All items have been recoded into numeric values for you. The lowest response option (e.g., "Strongly disagree") was converted to a 1 and the highest (e.g., "Strongly agree") to the highest value (e.g., 5 or 7).

- Columns represent individual items/variables.

In [None]:
## Load data
## Set the URL to Casey's GitHub page where the dataset is located
FileURL <- "https://raw.githubusercontent.com/CaseyGio/Psyc6263/refs/heads/main/Datasets/TestData.csv"

## Read the csv file from GitHub and create a new object
QualtricsData <- read_csv(url(FileURL))

## View the first few rows of data
QualtricsData %>% head(n = 10)

## Data wrangling

Now that data are loaded, we will conduct two steps for our data wrangling: (a) select your group's items and ignore the other items and (b) create a composite score.

## 2) Select *your* group's data

In this chunk, we will use the `starts_with()` function, along with the column `select()` function, to specifically extract your group's items. Inside the quotation marks, insert your group identifier. If done correctly, you will have an object, named `GrpData` (you can change this) with all of your items.

Note: I have added your group's items in the same order that you shared them with me on Canvas. Your first item is represented by [identifier]Q01 and your last as [identifier]Q10.

In [None]:
## Select YOUR data
GrpData <-
  QualtricsData %>%
  select(starts_with(""))

## View the first few rows of data
GrpData %>%
  head(n = 10)

## 3) Create a composite score (average)

In case we have missing data (e.g., a respondent did not answer an item), we shall default to using the average/mean to calculate our composite scale score. In this step, use the `rowMeans()` and `mutate()` functions, like in our previous week's demonstration, to create the composite score.

- Create a new object (e.g., GrpClean) to store these changes

- Use the `mutate()` and `rowMeans()` functions to create a new variable/column representing the total score. Name this column (e.g., [identifier]Total).

In [None]:
## Create a new object with the composite/total score


## Descriptive statistics

In this section, we shall being our item analyses by calculating descriptive statistics.

## 4) Calculate the following descriptive statistics:

- The mean/average score for every item, including for the total score

- The standard deviation (SD) for every item, including for the total score

- The skewness (skew) for every item, including for the total score

In [None]:
## Calculate item descriptive statistics


## 5) Identify any potentially problematic items

Based on these descriptive statistics, begin to identify any problematic items. Which items have unusual means (e.g., too high/low)? Standard deviations (e.g., too low)? Skewnesses (e.g., deviates too much from 0)?

Please elaborate on *why* you think these items are problematic. If you think all of your items are "good," please describe why you think that is the case (e.g., what value or range of values are you considering "good").

[You can edit this textbox to write your answers here]

## 6) Visualize item distributions

Let's visualize our item-level response distributions. Recall that we can accomplish this in two steps/code chunks: (a) reshape our data from "wide" to "long" format using the `pivot_longer()` function then (b) using the ggplot package with the `facet_wrap()` function.
- Given our number of response options, a histogram (`geom_histogram()`) may be best.

In [None]:
## Reshape the data into long format using pivot_longer


In [None]:
## Visualize the item distributions with face_wrap


## 7) Interpret the visualizations

Based on the quantitative analyses in Q4/Q5 above and the visualizations you just created, describe any usual or potentially problematic items. There may or may not be new/different interpretations given the visualization data. It is okay to repeat yourself if the quantitative and qualitative (visualization) portions produce the same conclusions.

[You may edit this textbox to put your answers here]

## 8) Calculate the correlation matrix for your items

The dimensions of this matrix (i.e., number of rows & columns) is based on the number of items you have, including the total score.

In [None]:
## Create the correlation matrix amongst your items


## 9) Interpret the item correlations

Based on your correlation matrix, particularly the item-total correlations, have you identified any potentially problematic or unusual items? Please elaborate on *why* you think the item(s) is/are problematic/unusual. If you think none are problematic, please elaborate on why you think this is the case.

[You may edit this textbox to include your answers]

## 10) Summarize your item analyses and conclusions

Based on all of your item analyses, what are the items that you think are showing signs of problems or are unusual and merit further investigation? You may refer to the actual wording of the items, too. Be sure to discuss the **item difficulty** and **item discrimination** concepts we have learned about in class.

[You may edit this textbox to include your answers]