CHS Analysis GRM

The present analysis implements a Bayesian graded response model (GRM) to estimate the item response to 44 items in a cybersecurity habits scale (CHS), where the latent trait (ψ) is understood to be participants' cybersecurity habits (i.e. how secure is their behaviour when using electronic devices such as phones, computers, etc.). Although the scale incorporates 3 dimensions (blocking, hiding, inspecting) representing possible types of cybersecurity behaviours/habits, the present model does not make any assumption about differences across these dimensions.

Model

The model follows a conventional GRM approach, with a discrimination parameter ranging over items (δ_i), a latent trait parameter ranging over subjects/participants (ψ_s), and a conventionality parameter ranging over items and ordered categories of scores (κ_i,c), which indicates how common/uncommon it is to engage in the behaviour described by the item (this is referred to as difficulty when a test involves correct/incorrect responses). In other words, an item answered with the highest score by most participants, irrespective of their cybersecurity habits level, would indicate behaviour that is too conventional and vice-versa. In the present case the discrimination parameter (δ) is only multiplied by the participants' trait parameter (ψ); which provided better convergence and fit. Reasonable if we think of δ and κ as the scale and location of ψ respectively. The model is completed by an ordered logistic distribution over the estimated parameter (ŷ).

δ_i ~ Log-normal(0, 0.5), item_i=1...item_i=44

ψ_s ~ Normal(0, 0.5), subject_s=1...subject_s=134

μ ~ Normal(0, 0.05)

σ ~ Half-normal(0.5)

κ_i,_c ~ Normal(μ, σ), item_i=1...item_i=44, cutpoint_c=1...cutpoint_i=C-1, C=5

η = δ_iψ_s

logit^-1(x) = 1/(1 + e^-x)

ŷ = 1 - logit^-1(η - κ₁), if c = 0

ŷ = logit^-1(η - κ_c-1) - logit^-1(η - κ_c), if 0 < c < C

ŷ = logit^-1(η - κ_c-1), if c = C, C = 5

Before running the model, we ensured that priors were weakly informative but sensible via a prior predictive check. With no specific expectations per item, we placed the greater probability on the median of the Likert scale (2 points). Image below gives an example of prior predictive check probability over item 33.

Results

We sampled the model using Markov chain Monte Carlo (MCMC) No U-turn sampling (NUTS) with 2000 tuning steps, 2000 samples, 4 chains and 0.99 acceptance target. The model sampled well, with 1.01 > R̂ > 0.99; BFMIs > 0.8, and bulk ESS > 1800 for all parameters. Posterior predictive checks show excellent predictive capacity, as indicated in the image below.

Expectations show very good precision (max ~15%), though slightly higher than previous structural equations models (https://github.com/SimonErnesto/bsem_precision_analysis). An example in the image below.

Informative items should present item characteristic curves (ICCs) where the probability of giving low scores to an item should be low if a participant has low cybersecurity habits (ψ) and vice-versa. The ICC curve is simply the expected probability of the model ranging across the estimated cybersecurity habits parameter (ψ). Image below shows an example of an informative item, where low cybersecurity habits are associated with a greater probability (68%) of giving a 0 score, while high cybersecurity habits are associated with a high probability (63%) of giving a 4 score.

Similarly, informative items should present item information curves (ITCs) where the probability of giving low scores to an item should be low if a participant has low cybersecurity habits (ψ) and vice-versa. The ITC curve follows the function I(ψ) = δ²(p(1-p)), where p = logit^-1(δ_iψ_s - κ_i,_c) , ranging across the estimated cybersecurity habits parameter (ψ). Image below shows an example of an informative item, where information peaks per score concentrate at increasingly higher values of ψ. This indicates that item's scores are informative respect to participants' cybersecurity levels.

Finally, the test characteristic curve (TCC: sum of expected probability across items) and the test information curve (TIC: Σ^S_s=1 = I_s(ψ) ) indicate that, overall, the CHS scale provides good information about cybersecurity habits. Images below show TCC and TIC. TCCs indicates a stronger preference for higher scores (3 and 4) when cybersecurity habits are high, but a high overlap of scores when cybersecurity habits are low. TICs suggest that scores are generally informative at sensible cybersecurity habits values (chv), with score 0 peaking (~20 info) between -1.5 and -1.0 chv, score 1 peaking (~25 info) between -1.0 and -0.5 chv, score 2 peaking (~20 info) between -0.5 and 0.0 chv, score 3 peaking (~28 info) around 0.5 chv, and score 4 peaking (~25 info) between 0.5 and 1.0 chv. Namely, low scores appropriately represent low habits and vice-versa.

Conclusion

Present results indicate that the present cybersecurity habits scale (CHS) is generally effective for inferring and predicting cybersecurity habits. Even so, most items seem to be lowly informative, which may suggest that reducing the scale is a relevant further step. CHS can be tuned to become a useful instrument for measuring cybersecurity behaviours regarding cyber-hygiene habits.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
item_characteristics		item_characteristics
item_information		item_information
prior_preds		prior_preds
response_prob		response_prob
results_clearness		results_clearness
trace		trace
README.md		README.md
analysis_clearness.py		analysis_clearness.py
chs_GRM.py		chs_GRM.py
clean_data.py		clean_data.py
clean_data_clearness.py		clean_data_clearness.py
energy.png		energy.png
mod_graph		mod_graph
mod_graph.pdf		mod_graph.pdf
model.png		model.png
model_math.png		model_math.png
ppc.png		ppc.png
reject_subs.py		reject_subs.py
reliability_posterior.png		reliability_posterior.png
reliability_score_posteriors.png		reliability_score_posteriors.png
results.rar		results.rar
summary.csv		summary.csv
summary_clearness_subject.csv		summary_clearness_subject.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CHS Analysis GRM

Model

Results

Conclusion

About

Releases

Packages

Languages

ebrlab/CHS_analysis_GRM

Folders and files

Latest commit

History

Repository files navigation

CHS Analysis GRM

Model

Results

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages