# SEQ Hypothesis testing

In [11]:
# Load necessary libraries
import pandas as pd
import statsmodels.formula.api as smf

df = pd.read_csv("seq.csv")

# Add a unique Participant ID assuming each row is a unique participant
df["Participant"] = range(len(df))

# Reshape the dataset into long format
df_long = df.melt(id_vars=["Participant", "Controller"], var_name="Trial", value_name="SEQ")

# Keep only real-world trials and remove simulation trials
df_rw_long = df_long[df_long["Trial"].str.startswith("RW")].copy()

# Extract Modality (WITH-VR or NO-VR)
df_rw_long["Modality"] = df_rw_long["Trial"].apply(lambda x: "WITH-VR" if "WITH-VR" in x else "NO-VR")

# Extract Trial Number correctly (Only real-world trials contain numbers)
df_rw_long["Trial_Num"] = df_rw_long["Trial"].str.extract(r'(\d+)').astype(float)

# Fit the Linear Mixed-Effects Model
model = smf.mixedlm("SEQ ~ Controller * Modality * Trial_Num", df_rw_long, groups=df_rw_long["Participant"])
result = model.fit()

# Display the model summary
print(result.summary())


                          Mixed Linear Model Regression Results
Model:                         MixedLM            Dependent Variable:            SEQ      
No. Observations:              120                Method:                        REML     
No. Groups:                    20                 Scale:                         0.9640   
Min. group size:               6                  Log-Likelihood:                -181.1326
Max. group size:               6                  Converged:                     Yes      
Mean group size:               6.0                                                        
------------------------------------------------------------------------------------------
                                                Coef.  Std.Err.   z    P>|z| [0.025 0.975]
------------------------------------------------------------------------------------------
Intercept                                        4.567    0.516  8.849 0.000  3.555  5.578
Controller[T.WBC]         

## Statistical Approach

To analyze the SEQ (Single Ease Question) scores collected from the user study, a Linear Mixed-Effects Model (LMM) was applied. This test was chosen due to the study's mixed design, which includes a between-subjects factor, controller type (SBC vs. WBC), where each participant used only one controller, and two within-subjects factors, modality (WITH-VR vs. NO-VR), where each participant tested both, and trial number, representing repeated measures, as each participant performed three trials per modality. LLLM includes random effects at the participant level to control for individual differences, making it more robust than repeated-measures ANOVA, which assumes sphericity. LMM can also handle hierarchy, missing data and unbalanced data better. The main effect of controller type (SBC vs. WBC) was not significant, with a p-value of 0.386. This suggests that the mean SEQ scores for SBC and WBC cannot be statistically differentiated, indicating similar post-trial interface complexity perception. However, the main effect of modality (WITH-VR vs. NO-VR) was significant, with a a p-value of 0.003, indicating that modality significantly affects SEQ scores, with WITH-VR consistently rated lower than NO-VR. Participants found tasks significantly harder in the WITH-VR condition compared to NO-VR. The main effect of trial number, showed a p-value of 0.068, suggesting a slight but not statistically significant increase in SEQ scores over trials. This indicates that there may be a learning effect, where participants improve slightly with repeated trials, finding it easier to accomplish the proposed task. Examining the interaction effects, no significant interactions were found. In conclusion, modality is the strongest factor influencing SEQ scores, with WITH-VR leading to significantly lower ratings than NO-VR. The controller type does not significantly impact SEQ scores, and while there is a slight increase in SEQ scores over trials, this effect is not statistically significant. No significant interaction effects were found, meaning that the effect of trials or controllers remains consistent across modalities.

