112 update ses by arcaldwell49 · Pull Request #114 · Lakens/TOSTER

arcaldwell49 · 2026-03-13T20:07:23Z

Updated SES functions, documentation, and added other features for robust TOST. This should be the final big changes before writing the updated preprint and submitting to AMPSS.

Allow for permutation tests directly on the SES scale.

Needed to improve workflow when working back and forth together on code improvements.

Added more stable method for small samples with the two-sample case.

added default behavior for SE calculations for non-parametric SES calculations.

Update SE method

Finally got BCa some people have been asking for! Not the default and computationally heavy but it is there if people want to use it.

Needed to studentized test. Changed now and fixed. Checked locally with other tests.

Need some fixes to compare wilcox.test to hodges_lehmann

Need to add score method to the SES calculation

want to make it run faster on CRAN and not overload the system

smd_calc functions now allow direct specification of the denominator to use for SMD calculations.

Mainly, two-sample tests now include mean difference in output

Error where paired didn't implement plusone properly for randomization test. Also added reference document for future calibration for TOST

Essentially avoids edge case errors. Also made slight edits to SMD_calcs vignette and added some key references.

Adding new labels for rank probability estimates. Also, updating labels so output is easier to understand. Need to update labeling for simple_htest, perm_t_test, boot_t_test, etc to follow similar conventions.

Note, similar implementation still needed for other htest functions

Streamlined and easier (?) to understand now.

still working out kinks. need to update NEWS to reflect incoming changes.

reflect correlation test changes.

…o 112-update-ses

Need to fix error for boot_ses_calc

Introduced error in description. This should fix it to avoid issue in example.

noticed var.equal got ignored.

Bootstrap t-test has error due to trimming

Need to fix tests for output checks on string. Also, should probably update labels for cor_test/corsum_test/boot_cor_test.

…o 112-update-ses

Very broken at the moment. Need to fix later.

Copilot

Pull request overview

This PR finalizes a broad update to TOSTER’s robust testing and effect-size infrastructure, including new transformations/utilities, expanded bootstrap CI support (incl. BCa), updated documentation/vignettes, and strengthened test coverage.

Changes:

Added new effect-size helpers (trans_rank_prob(), rank_diff()) and expanded labeling/sample-size reporting across htest outputs.
Added/extended bootstrap CI methods (including BCa) and aligned bootstrap p-values with the selected CI method.
Updated documentation (Rd + vignettes) and added extensive regression tests for robust procedures (Brunner–Munzel, bootstrap tests, etc.).

Reviewed changes

Copilot reviewed 101 out of 111 changed files in this pull request and generated 10 comments.

Show a summary per file

File	Description
vignettes/hypothesis_testing.R	Adds new hypothesis-testing vignette content (as an `.R` spin-style file).
vignettes/correlations.Rmd	Enables citation linking and refines narrative guidance.
vignettes/correlations.R	Removes previously included “advanced usage” examples from the spun `.R`.
vignettes/SMD_calcs.Rmd	Expands SMD vignette (robust trimming section, hypothesis-testing examples).
vignettes/SMD_calcs.R	Updates spun vignette code to match new examples.
vignettes/IntroductionToTOSTER.Rmd	Enables citation linking.
vignettes/IntroductionToTOSTER.R	Normalizes chunk header formatting in spun `.R`.
vignettes/IntroTOSTt.Rmd	Enables citation linking.
tests/testthat/test-wilcox.R	Adds bootstrap SES tests and reorganizes one-/two-/paired coverage.
tests/testthat/test-twoprop.R	Splits a long randomized test into a separate `skip_on_cran()` block.
tests/testthat/test-trans_rank_prob.R	Adds unit tests for `trans_rank_prob()` transformations and round-trips.
tests/testthat/test-tTOST.R	Adds explicit boot-vs-analytic SMD parity tests; adjusts older expectations.
tests/testthat/test-smd-se.R	Updates tests to request `output = "data.frame"` explicitly.
tests/testthat/test-rank_diff.R	Adds full coverage for new `rank_diff()` transformation.
tests/testthat/test-log.R	Marks heavy tests `skip_on_cran()` and adds CI/p-value agreement checks.
tests/testthat/test-htest.R	Updates expectations given new estimate structure/labels; adds regression test for `$estimate`.
tests/testthat/test-brunner_munzel_scale.R	Adds tests for new `scale` reporting in `brunner_munzel()`.
tests/testthat/test-brunner_munzel_perm_pval.R	Adds regression tests for paired permutation `p_method` routing.
tests/testthat/test-brunner_munzel_degenerate.R	Adds regression tests for multiple Brunner–Munzel edge cases.
tests/testthat/test-boot_t_test_variance_fix.R	Adds regression tests for a bootstrap variance/centering fix.
tests/testthat/test-boot_t_test_trimmed.R	Adds comprehensive tests for trimmed-mean support in `boot_t_test()`.
tests/testthat/test-bootTOST.R	Adds CI/p-value agreement tests for `boot_t_TOST()`.
man/z_cor_test.Rd	Documents `se_method` and updated stderr/estimate semantics.
man/wilcox_TOST.Rd	Documents new `se_method` options for Wilcoxon-based TOST.
man/trans_rank_prob.Rd	Adds new exported documentation for `trans_rank_prob()`.
man/stud_ci.Rd	Documents internal studentized CI helper for correlations.
man/smd_calc.Rd	Documents new denom/output/hypothesis-testing/trimming capabilities.
man/ses_calc.Rd	Documents new `score` SE method + correction options and boundary handling.
man/rank_diff.Rd	Adds new exported documentation for `rank_diff()`.
man/perm_t_test.Rd	Documents NPC-based equivalence/minimal-effect permutation approach.
man/log_TOST.Rd	Updates seealso links (adds `boot_ses_test`).
man/hodges_lehmann.Rd	Clarifies permutation limitations for equivalence/MET and improves narrative.
man/dot-fisher_z_se.Rd	Documents internal Fisher-z SE helper.
man/corsum_test.Rd	Updates stderr/estimate semantics and method details.
man/brunner_munzel.Rd	Documents new `scale` argument and permutation p-value options.
man/boot_t_test.Rd	Documents `tr` and `boot_ci = "bca"` and CI-consistent p-values.
man/boot_t_TOST.Rd	Documents `boot_ci = "bca"` and CI-consistent p-values.
man/boot_smd_calc.Rd	Documents denom/output/hypothesis-testing/trimming + BCa CI option.
man/boot_ses_calc.Rd	Documents BCa CI support + “none” alternative and CI-consistent p-values.
man/boot_pvalue.Rd	Documents internal `boot_pvalue()` dispatcher.
man/boot_log_TOST.Rd	Documents BCa CI option and CI-consistent p-values.
man/boot_cor_test.Rd	Documents new CI methods and how p-values match the CI method.
man/bca_params.Rd	Documents internal BCa parameter helper.
man/bca_ci.Rd	Documents internal BCa CI helper.
junk/debug_direction2.R	Adds local debugging script (non-production).
junk/debug_direction.R	Adds local debugging script (non-production).
junk/check_installed.R	Adds local debugging script (non-production).
_pkgdown.yml	Reorganizes reference sections; adds new helpers and `boot_ses_test`.
R/wilcox_TOST.R	Adds `se_method` argument and routes through `ses_calc()` accordingly.
R/trans_rank_prob.R	Adds new exported transformer + internal label helpers.
R/simple_htest.R	Adds sample-size reporting; fixes missing `$estimate` in some paths; improves labeling.
R/rank_diff.R	Adds new exported Kornbrot rank-difference transformer.
R/perm_t_test.R	Expands labeling, adds sample-size to output, and relabels formula outputs.
R/others.R	Adds internal BCa CI helpers (`bca_ci`, `bca_params`).
R/methods.TOSTt.R	Prints note for BCa CI usage.
R/htest_labels.R	Adds internal label builders for `htest`-style estimates and SMD notation.
R/htest_helpers.R	Adds internal relabel helper for formula interfaces.
R/globals.R	Adds globals for new symbols used in the package.
R/corsum_test.R	Updates SE formulas (incl. Bonett–Wright), stderr structure, and method string.
R/corr_calcs.R	Extends CI computation to accept SE overrides; adds studentized CI and bootstrap p-value dispatch.
R/cor_test.R	Adds `se_method` argument and jackknife SE support; updates stderr structure.
R/boot_t_TOST.R	Adds BCa CI option + CI-consistent p-values; adds jackknife plumbing.
R/boot_log_TOST.R	Adds BCa CI option + CI-consistent p-values; adds jackknife plumbing.
NEWS.md	Documents new helpers, BCa CI support, labeling changes, and correlation SE improvements.
NAMESPACE	Exports new functions and registers S3 methods; imports `stats::dlogis`.
DESCRIPTION	Updates `RoxygenNote`.
CLAUDE.md	Adds contributor workflow and code-style guidance.
.github/workflows/test-coverage.yaml	Sets `NOT_CRAN` env for CI runs.
.github/workflows/R-CMD-check.yaml	Sets `NOT_CRAN` env for CI runs.
.Rbuildignore	Ignores `.claude` artifacts.

You can also share your feedback on Copilot code review. Take the survey.

R/perm_t_test.R

R/simple_htest.R

tests/testthat/test-boot_t_test_variance_fix.R

.Rbuildignore

R/corr_calcs.R

R/others.R

man/rank_diff.Rd

vignettes/hypothesis_testing.R

vignettes/SMD_calcs.Rmd

man/hodges_lehmann.Rd

Copilot

Pull request overview

Updates TOSTER’s standardized effect size (SES/SMD) and robust testing ecosystem by expanding bootstrap/CI methods (incl. BCa), adding rank/probability-scale helpers, and aligning documentation/vignettes/tests with the new APIs.

Changes:

Added new effect-size utilities (rank_diff(), trans_rank_prob()) and expanded robust reporting/labeling across multiple htest producers.
Added BCa and studentized CI support to multiple bootstrap functions and introduced shared bootstrap p-value dispatch (boot_pvalue()).
Updated man pages, vignettes, pkgdown reference structure, and significantly expanded/regressed test coverage for new/changed behaviors.

Reviewed changes

Copilot reviewed 104 out of 119 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
vignettes/SMD_calcs.Rmd	Adds citation linking and new robust SMD trimming section + updated SMD guidance/examples
vignettes/SMD_calcs.R	Syncs vignette extracted R code with new examples (bootstrap + trimming)
vignettes/IntroductionToTOSTER.Rmd	Enables link-citations for better reference rendering
vignettes/IntroductionToTOSTER.R	Updates chunk header formatting in extracted vignette code
vignettes/IntroTOSTt.Rmd	Enables link-citations
tests/testthat/test-wilcox.R	Updates hush sink target and reorganizes/extends SES/Wilcoxon bootstrap tests
tests/testthat/test-twoprop.R	Updates hush sink target and splits loop test into its own block
tests/testthat/test-trans_rank_prob.R	Adds unit tests for new probability-scale transformation helper
tests/testthat/test-tTOST.R	Adjusts SMD expectations to new output defaults and adds bootstrap consistency tests
tests/testthat/test-smd-se.R	Updates `smd_calc()` calls to request legacy data.frame output explicitly
tests/testthat/test-rank_diff.R	Adds tests for new Kornbrot rank-difference transformation
tests/testthat/test-powerraw.R	Updates hush sink target
tests/testthat/test-power_correlation.R	Updates hush sink target
tests/testthat/test-power_consistent.R	Updates hush sink target
tests/testthat/test-old_errors.R	Updates hush sink target
tests/testthat/test-log.R	Adds CRAN skips and CI/p-value agreement tests for `boot_log_TOST()`
tests/testthat/test-known_results.R	Updates hush sink target
tests/testthat/test-htest.R	Adjusts expectations for changed estimate labeling/structure and adds regression test for missing estimates
tests/testthat/test-ftests.R	Updates hush sink target
tests/testthat/test-data_summary_equivalent.R	Updates hush sink target
tests/testthat/test-compare_smds.R	Updates hush sink target
tests/testthat/test-brunner_munzel_scale.R	Adds tests for new `brunner_munzel(scale=...)` behavior
tests/testthat/test-brunner_munzel_perm_pval.R	Adds regression tests for paired permutation p-value computation and `p_method`
tests/testthat/test-brunner_munzel_degenerate.R	Adds regression tests for degenerate/edge case handling in Brunner–Munzel paths
tests/testthat/test-boot_t_test_variance_fix.R	Adds regression tests around bootstrap variance/p-value coherence fixes
tests/testthat/test-boot_t_test_trimmed.R	Adds comprehensive tests for trimmed-means support in `boot_t_test()`
tests/testthat/test-bootTOST.R	Adds CRAN skips and CI/p-value agreement tests for `boot_t_TOST()`
man/z_cor_test.Rd	Documents new `se_method` and revised stderr/estimate naming for correlation tests
man/wilcox_TOST.Rd	Documents new `se_method` argument for Wilcoxon TOST wrapper
man/trans_rank_prob.Rd	Adds documentation for new `trans_rank_prob()` helper
man/stud_ci.Rd	Documents new internal correlation studentized CI helper
man/smd_calc.Rd	Documents expanded SMD API (denom/output/testing/trimming)
man/ses_calc.Rd	Documents expanded SES API (score SE method, correction flag, boundary handling)
man/rank_diff.Rd	Adds documentation for new `rank_diff()` helper
man/perm_t_test.Rd	Documents permutation approach for equivalence/minimal effect testing
man/log_TOST.Rd	Adds cross-links to new robust SES bootstrap test helper
man/hodges_lehmann.Rd	Clarifies permutation limitations for equivalence/MET and updates method details
man/dot-fisher_z_se.Rd	Documents internal Fisher-z SE helper
man/corsum_test.Rd	Updates SE method description and stderr/estimate naming
man/brunner_munzel.Rd	Documents new `scale` argument and permutation p-value notes
man/boot_t_test.Rd	Documents trimming + BCa + CI-consistent p-values
man/boot_t_TOST.Rd	Documents BCa + CI-consistent p-values
man/boot_smd_calc.Rd	Documents expanded bootstrap SMD API (denom/output/testing/trimming/BCa)
man/boot_ses_calc.Rd	Documents expanded bootstrap SES API (BCa + optional testing)
man/boot_pvalue.Rd	Documents new internal p-value dispatch helper
man/boot_log_TOST.Rd	Documents BCa + CI-consistent p-values
man/boot_cor_test.Rd	Documents BCa/stud CI options and CI-inverted p-values
man/bca_params.Rd	Documents internal BCa parameter extraction helper
man/bca_ci.Rd	Documents internal BCa CI helper
junk/debug_direction2.R	Adds local debugging script (non-package logic)
junk/debug_direction.R	Adds local debugging script (non-package logic)
junk/check_installed.R	Adds local debugging script (non-package logic)
_pkgdown.yml	Reorganizes reference sections and adds new helpers to docs
R/wilcox_TOST.R	Adds `se_method` plumbed through to SES effect size computation
R/trans_rank_prob.R	Adds new exported transformer + shared label helpers
R/simple_htest.R	Adds sample size info, restores missing estimates for certain alts, and improves estimate labeling
R/rank_diff.R	Adds new exported Kornbrot rank-difference transformation
R/perm_t_test.R	Improves estimate labeling, adds sample size, and expands docs/comments (incl. formula relabeling)
R/others.R	Adds BCa CI + BCa parameter helpers
R/methods.TOSTt.R	Prints note for BCa CI method
R/htest_labels.R	Adds shared internal label helpers for estimates and SMD notation
R/htest_helpers.R	Adds internal relabel helper for formula interfaces
R/globals.R	Adds globals to satisfy checks for new symbols used in NSE contexts
R/corsum_test.R	Updates SE logic (Bonett–Wright for Spearman) and stderr structure
R/corr_calcs.R	Adds `stud_ci()`, `boot_pvalue()`, and Fisher-z SE helper; enhances CI SE override capability
R/cor_test.R	Adds `se_method` (analytic/jackknife) and updates SE + CI construction
R/boot_t_TOST.R	Adds BCa support; replaces p-values with CI-consistent p-values via `boot_pvalue()`
R/boot_log_TOST.R	Adds BCa support; replaces p-values with CI-consistent p-values via `boot_pvalue()`
NEWS.md	Documents new functions and major API/behavior changes (BCa, labels, correlation SE changes)
NAMESPACE	Exports new helpers and registers new S3 methods
DESCRIPTION	Updates `RoxygenNote`
CLAUDE.md	Adds contributor workflow/style guidance (repo-local edits, junk/, headings)
.github/workflows/test-coverage.yaml	Sets `NOT_CRAN=true` in CI env for conditional test behavior
.github/workflows/R-CMD-check.yaml	Sets `NOT_CRAN=true` in CI env for conditional test behavior
.Rbuildignore	Adds ignore for `^.claude$`

You can also share your feedback on Copilot code review. Take the survey.

R/perm_t_test.R

tests/testthat/test-wilcox.R

junk/debug_direction.R

R/htest_helpers.R

vignettes/SMD_calcs.Rmd

R/boot_t_TOST.R

R/corr_calcs.R

arcaldwell49 added 30 commits January 30, 2026 10:36

Add perm_ses_test

8e9fbe8

Allow for permutation tests directly on the SES scale.

Update Claude settings

c639906

Needed to improve workflow when working back and forth together on code improvements.

Added smd_calc htest support

7f39c18

Update vignette

0c29f37

Fix issue with Hodges-Lehmann

9c11861

Fix hodges_lehmann docs

188bf6a

Added score method

4479799

Added more stable method for small samples with the two-sample case.

update perm_ses_test

e50ebae

Delete test_debug.R

0a7eb91

Add "auto" for se_method

c209d65

added default behavior for SE calculations for non-parametric SES calculations.

Update boot_ses_calc.R

a7fd605

Update SE method

Add BCa bootstrap

9a6d592

Finally got BCa some people have been asking for! Not the default and computationally heavy but it is there if people want to use it.

Bug in bootsrap SMD/SES calc

00da234

Needed to studentized test. Changed now and fixed. Checked locally with other tests.

Add tests for simple_htest

7aed44e

Need some fixes to compare wilcox.test to hodges_lehmann

Update robustTOST.Rmd

f0c56d8

Need to add score method to the SES calculation

Updated tests

d4b7a92

want to make it run faster on CRAN and not overload the system

Add trimming for bootstrapping

488d892

Added denom argument

b629f89

smd_calc functions now allow direct specification of the denominator to use for SMD calculations.

Update NEWS.md

fb253c1

Update estimate outputs

749f8e3

Mainly, two-sample tests now include mean difference in output

Add trimming for SMD

df05bbb

knitted vignette

04c4e4f

Add htest vignette

60622a2

Small edits

4dd53f9

Fixed brunner munzel paired issues

029009a

Error where paired didn't implement plusone properly for randomization test. Also added reference document for future calibration for TOST

update documentation

d5a2287

Fix degenerate BM test cases

929dbda

Essentially avoids edge case errors. Also made slight edits to SMD_calcs vignette and added some key references.

Small change to SMD vignette

a25ce1a

Add new labels for BM/SES

e5ed162

Adding new labels for rank probability estimates. Also, updating labels so output is easier to understand. Need to update labeling for simple_htest, perm_t_test, boot_t_test, etc to follow similar conventions.

Update labeling

1cabe59

Note, similar implementation still needed for other htest functions

arcaldwell49 added 22 commits February 19, 2026 09:40

fix errors

223d670

Add Kornbot rank difference

2c95797

minor change to labels

ac660f8

small change to documentation

d6a18d9

Update SMD calc notation

f3976af

Streamlined and easier (?) to understand now.

Update correlation method

6bd977a

still working out kinks. need to update NEWS to reflect incoming changes.

Update NEWS.md

fc31a87

reflect correlation test changes.

Merge branch '112-update-ses' of https://github.com/Lakens/TOSTER int…

8acacb1

…o 112-update-ses

Added p-value method for bootstrap correlations

4fe7f53

Need to fix error for boot_ses_calc

Fix error in example

d11c42a

Introduced error in description. This should fix it to avoid issue in example.

Update cohend_calcs.R

c60b15f

noticed var.equal got ignored.

Update label

1f74d8d

Fixing bootstrap error

1826f93

Bootstrap t-test has error due to trimming

Update for boot p-value match CI

1958b66

Update hypothesis_testing.Rmd

fd53706

Update hypothesis_testing.Rmd

148b076

Need to fix tests for output checks on string. Also, should probably update labels for cor_test/corsum_test/boot_cor_test.

Fixed label errors

ea2e587

Merge branch '112-update-ses' of https://github.com/Lakens/TOSTER int…

59531ba

…o 112-update-ses

fixed hodges_lehmann permutation.

0e8d8d8

Started boot_ses_test

aadb856

Very broken at the moment. Need to fix later.

Added parametric bootstrap for ses

dd92ca4

Updated vignettes and documentation

85e0116

Copilot AI review requested due to automatic review settings March 13, 2026 20:07

arcaldwell49 linked an issue Mar 13, 2026 that may be closed by this pull request

Update SES calculations, add Hodges-Lehmann test, and documentation on Robust tests #112

Closed

Copilot AI reviewed Mar 13, 2026

View reviewed changes

fixes to errors

b70e312

arcaldwell49 requested a review from Copilot March 16, 2026 19:13

Copilot AI reviewed Mar 16, 2026

View reviewed changes

small fixes

23c6892

arcaldwell49 merged commit da58845 into master Mar 16, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

112 update ses#114

112 update ses#114
arcaldwell49 merged 60 commits intomasterfrom
112-update-ses

arcaldwell49 commented Mar 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

arcaldwell49 commented Mar 13, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants