Big int cohort id fix #163

azimov · 2024-04-03T19:49:26Z

Resolves #162

This reverts commit de27d57.

…ntegers

# Conflicts: # DESCRIPTION

schuemie · 2024-04-08T05:06:09Z

R/Analyses.R

  checkmate::assertLogical(outcomeOfInterest, add = errorMessages)
  checkmate::assertNumeric(trueEffectSize, len = 1, null.ok = TRUE, add = errorMessages)
  checkmate::assertInt(riskWindowStart, null.ok = TRUE, add = errorMessages)
  checkmate::assertInt(riskWindowEnd, null.ok = TRUE, add = errorMessages)
  checkmate::reportAssertions(collection = errorMessages)
  if (!is.null(startAnchor) && !grepl("start$|end$", startAnchor, ignore.case = TRUE)) {
-    stop("startAnchor should have value 'cohort start' or 'cohort end'")
+    stop("startAnchor should have value \'cohort start\' or \'cohort end\'")


Could you help me understand why you're escaping single quotes? It is not needed when the string uses double quotes

schuemie · 2024-04-08T05:07:05Z

R/RunAnalyses.R

    createCmDataTask <- function(i) {
-      refRow <- subset[i, ]
+      refRow <- subset[i,]


Per our code guidelines, a comma should be followed by a space. Why remove it here?

schuemie · 2024-04-08T05:09:40Z

R/RunAnalyses.R

-      )
-      return(task)
+
+      if (file.exists(file.path(outputFolder, refRow$psFile))) {


Why the condition? Is there something wrong with the code for generating the reference table that will specify strataFiles to be created without there being psFiles? Maybe best to fix it there?

schuemie · 2024-04-08T05:14:06Z

R/RunAnalyses.R

    studyPop <- readRDS(file.path(outputFolder, refRow$studyPopFile))
    studyPop <- addPsToStudyPopulation(studyPop, ps)
-    saveRDS(studyPop, file.path(outputFolder, refRow$psFile))
+    tryCatch({


Is the error message thrown by saveRDS not informative enough? If so, maybe we should have a generic wrapper for saveRDS in CohortMethod that we always use?

schuemie · 2024-04-08T05:16:21Z

R/RunAnalyses.R

-                                   params$psFile))
-  ps <- applyTrimMatchStratify(ps, params$args)
-  saveRDS(ps, params$strataFile)
+  tryCatch({


Yikes, this tryCatch block covers multiple function calls, but the error message will be about the saving of the stataFile.

schuemie · 2024-04-08T05:17:24Z

R/RunAnalyses.R

-    if (length(covariatesToExclude) != 0) {
-      covariates <- covariates %>%
-        filter(!.data$covariateId %in% covariatesToExclude)
+  tryCatch({


Double yikes! What is the point in wrapping such a large block of code in a try catch?

schuemie · 2024-04-08T05:18:16Z

R/RunAnalyses.R

+    attr(class(filteredCohortMethodData), "package") <- "CohortMethod"
+    saveCohortMethodData(filteredCohortMethodData, params$prefilteredCovariatesFile)
+  }, error = function(err) {
+    ParallelLogger::logError(err)


This code has no effect. Errors are automatically caught by ParalllelLogger. Just rethrowing the error will achieve the same thing

schuemie · 2024-04-08T05:20:15Z

I'm sorry, I can't accept this PR. It contains a lot of changes that have nothing to do with the issue at hand.

I'm open to rethinking how we deal with errors, but that should be a separate issue and a separate PR (but only after we agree on the approach).

Also, please adhere to the style guide (as also implemented by the styler package).

azimov · 2024-04-08T14:24:22Z

My apologies, I did a diff with the wrong branch and included the code I used to find issues. This now has been removed. From: Martijn Schuemie ***@***.***> Date: Sunday, April 7, 2024 at 10:20 PM To: OHDSI/CohortMethod ***@***.***> Cc: Gilbert, James [JANUS] ***@***.***>, Author ***@***.***> Subject: [EXTERNAL] Re: [OHDSI/CohortMethod] Big int cohort id fix (PR #163) I'm sorry, I can't accept this PR. It contains a lot of changes that have nothing to do with the issue at hand. I'm open to rethinking how we deal with errors, but that should be a separate issue and a separate PR (but only after we agree on the approach). Also, please adhere to the style guide (as also implemented by the styler package). — Reply to this email directly, view it on GitHub<#163 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AABMDM4GAOJGV4FUXTWTQ53Y4ISKNAVCNFSM6AAAAABFV6W7U2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANBRHA4DINZWGY>. You are receiving this because you authored the thread.Message ID: ***@***.***>

schuemie · 2024-04-09T14:20:54Z

Thanks! Not sure why we're failing check now. Seems to be some SQL issue. I'll merge and debug

schuemie · 2024-04-09T15:02:45Z

Ah, the problem is that SQLite does not allow changing the column type of an existing table. The good news is that SQLite INT is dynamic, and will contain BIGINT values.

Why do we have cohort IDs > 2^31?

azimov · 2024-04-09T16:48:21Z

Why do we have cohort IDs > 2^31?

These are generally cohorts made from templates or subsets where we do something like concept_id * 1000 as cohort_definition_id, this happened when running this on the reward cohorts

schuemie · 2024-04-10T06:34:27Z

Ok, we'll probably need fixes throughout HADES then. I always assumed, since we're assigning cohort IDs ourselves, we would not exceed 2^31.

I think I fixed the unit tests now, by fixing the SqlRender translation of ALTER TABLE ALTER COLUMN.

I also removed the PostgreSQL-specific version of the migration script. We really shouldn't have platform-specific code in HADES packages other than SqlRender and DatabaseConnector.

azimov added 23 commits January 4, 2024 14:41

Revert "Revert "Fixes to support big int cohorts ids""

6181952

This reverts commit de27d57.

Description to denote fork

e8ba028

Change to support big int in outcomeIds

dfbbd0f

Error handling weird rds save issues

1ddfc9c

Description

b7257e0

Missing files

30e9e89

Error handling for tasks to continue

c8c10a6

Error handling for tasks to continue

09c47d8

Only summarize results for outcome files that exist to stop crash

9810835

Fix path in outcome file find

7555d98

Added migrations script

dfd29f4

Postgres specific migration

7ef04ee

Moved migration to correct path

648d11c

Moved migration to correct path

93378da

Missing columns

9d43b7c

Missing kw

7139e1e

Typo

b5cdb42

Typo

dacd00c

Typo

ce5b9a9

Updated spec to match ddl

8b8dc4d

checks to allow big integers that overflow Integerish but are still i…

8181fdf

…ntegers

Merge remote-tracking branch 'origin/develop' into big_int_cohort_id_fix

184a097

# Conflicts: # DESCRIPTION

Fixed type

1fbd8dd

schuemie reviewed Apr 8, 2024

View reviewed changes

Removed changes from testing branch

8feeca5

azimov added 2 commits April 8, 2024 07:27

Removed changes from testing branch

545c755

Whitespace

9675062

schuemie merged commit 8ef8bcb into develop Apr 9, 2024
2 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Big int cohort id fix #163

Big int cohort id fix #163

azimov commented Apr 3, 2024

schuemie Apr 8, 2024

schuemie Apr 8, 2024

schuemie Apr 8, 2024

schuemie Apr 8, 2024

schuemie Apr 8, 2024

schuemie Apr 8, 2024

schuemie Apr 8, 2024

schuemie commented Apr 8, 2024

azimov commented Apr 8, 2024 via email

schuemie commented Apr 9, 2024

schuemie commented Apr 9, 2024

azimov commented Apr 9, 2024

schuemie commented Apr 10, 2024

Big int cohort id fix #163

Big int cohort id fix #163

Conversation

azimov commented Apr 3, 2024

schuemie Apr 8, 2024

Choose a reason for hiding this comment

schuemie Apr 8, 2024

Choose a reason for hiding this comment

schuemie Apr 8, 2024

Choose a reason for hiding this comment

schuemie Apr 8, 2024

Choose a reason for hiding this comment

schuemie Apr 8, 2024

Choose a reason for hiding this comment

schuemie Apr 8, 2024

Choose a reason for hiding this comment

schuemie Apr 8, 2024

Choose a reason for hiding this comment

schuemie commented Apr 8, 2024

azimov commented Apr 8, 2024 via email

schuemie commented Apr 9, 2024

schuemie commented Apr 9, 2024

azimov commented Apr 9, 2024

schuemie commented Apr 10, 2024