After harvesting datasets we are finding differences between client and server. This mainly consider dates.
When I check api by calling /api/datasets/export?exporter=dataverse_json&persistentId=doi%3A10.57903/UJ/RH0J5L
I can see proper data in fields like lastUpdateTime and releaseTime.
However, the Dataverse UI shows the dates related to the harvesting moment.
Those wrong dates I can find in db by SELECT * FROM dvobject WHERE dtype = 'Dataset' AND authority = '10.xxx' AND identifier = 'xxx'
What is more sometimes even the title is wrong but with the same pattern: dataverse_json show correct values, but wrong one are stored in db and displayed to user.
This is confusing and should not happen. Problem is not related to dataset versions.
- When does this issue occur?
This is a bug related to harvesting and was noticed in the context of dates displayed to Harvester users.
When trying to fix by deleting harvesting client/server and reharvesting data from Dataverse collection - all datasets received one date on harvester: current date (as mentioned above: misleading dates are in dvobject as lastUpdateTime and releaseTime - when I correct them manually in db, then they are correctly displayed for end user)
Harvested data are incorrect.
- To whom does it occur (all users, curators, superusers)?
all
- What did you expect to happen?
harvesting client and server should display same data for users.
Which version of Dataverse are you using?
6.7.1 at this moment.
Screenshots:
dataverse_json:
displayed on harvester page:

After harvesting datasets we are finding differences between client and server. This mainly consider dates.
When I check api by calling /api/datasets/export?exporter=dataverse_json&persistentId=doi%3A10.57903/UJ/RH0J5L
I can see proper data in fields like lastUpdateTime and releaseTime.
However, the Dataverse UI shows the dates related to the harvesting moment.
Those wrong dates I can find in db by
SELECT * FROM dvobject WHERE dtype = 'Dataset' AND authority = '10.xxx' AND identifier = 'xxx'What is more sometimes even the title is wrong but with the same pattern: dataverse_json show correct values, but wrong one are stored in db and displayed to user.
This is confusing and should not happen. Problem is not related to dataset versions.
This is a bug related to harvesting and was noticed in the context of dates displayed to Harvester users.
When trying to fix by deleting harvesting client/server and reharvesting data from Dataverse collection - all datasets received one date on harvester: current date (as mentioned above: misleading dates are in dvobject as lastUpdateTime and releaseTime - when I correct them manually in db, then they are correctly displayed for end user)
Which page(s) does it occurs on?
Harvester pages visible for users.
What happens?
Harvested data are incorrect.
all
harvesting client and server should display same data for users.
Which version of Dataverse are you using?
6.7.1 at this moment.
Screenshots:
dataverse_json:
displayed on harvester page:
