Null check/legacy code in DataFile.java #7849

landreev · 2021-05-04T21:24:09Z

TL;DR:
there's code in DataFile.java that checks if the file is harvested, based on whether the storageidentifier starts with http(s*)://, without a null check. To be safe, this needs to be addressed before #7741 makes it into a release.

The following code is in DataFile.java:

public boolean isHarvested() {
        
        // (storageIdentifier is not nullable - so no need to check for null
        // pointers below):
        if (this.getStorageIdentifier().startsWith("http://") || this.getStorageIdentifier().startsWith("https://")) {
            return true;
        }
        
        Dataset ownerDataset = this.getOwner();
        if (ownerDataset != null) {
            return ownerDataset.isHarvested(); 
        }
        return false; 
    }

The lines 3-7 are wrong on many levels. "storageidentifier is not nullable" is NOT true. (this is very old legacy code, left over from the time where the storageidentifier field lived in DataFile (and not in DvObject). The absence of the null check may become a problem in a legacy database fixed by the script in #7741. (Although there is a decent chance that our, Harvard prod. db is the only one affected).
Ideally, we should not rely on looking at the storageidentifier at all, in order to determine whether a file is harvested or not. (The code in the second half of the method above does the proper test - a harvested file is a file in a dataset that is harvested...).

The text was updated successfully, but these errors were encountered:

fix for the null pointer in isHarvested() method. (#7849)

djbrooke added this to IQSS Team - In Progress 💻 in IQSS/dataverse (TO BE RETIRED / DELETED in favor of project 34) May 12, 2021

landreev self-assigned this May 12, 2021

landreev added a commit that referenced this issue May 12, 2021

removed the check-by-url from the isHarvested() method. (#7849)

a6caf8e

landreev mentioned this issue May 12, 2021

fix for the null pointer in isHarvested() method. (#7849) #7868

Merged

landreev added a commit that referenced this issue May 12, 2021

a slightly better/safer check in another method. (#7849)

aa010f7

djbrooke removed this from IQSS Team - In Progress 💻 in IQSS/dataverse (TO BE RETIRED / DELETED in favor of project 34) May 13, 2021

kcondon closed this as completed in #7868 May 13, 2021

kcondon added a commit that referenced this issue May 13, 2021

Merge pull request #7868 from IQSS/7849-null-check-fix

80251f9

fix for the null pointer in isHarvested() method. (#7849)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Null check/legacy code in DataFile.java #7849

Null check/legacy code in DataFile.java #7849

landreev commented May 4, 2021

Null check/legacy code in DataFile.java #7849

Null check/legacy code in DataFile.java #7849

Comments

landreev commented May 4, 2021