[Metadata Improvement]: Evaluate ChatGPT performance on measurementTechnique Extraction #132

gtsueng · 2024-04-11T15:36:59Z

Issue Name

Evaluate ChatGPT performance on measurementTechnique Extraction

Issue Description

It would be good to be able to evaluate how well ChatGPT extracts measurementTechniques based on Dataset names and descriptions. While the performance may vary based on data type and repository, it would be good to at least evaluate some datasets that already have curated values for measurementTechniques to see how well the results overlap.

Approach:

Identify records which have measurementTechnique values from the following repositories (@DylanWelzel since you've already done this, can you send @ZubairQazi the list of record ids?
- NCBI GEO
- LINCS
- REFRAMEDB
Randomly select 25 records from the measurementTechnique-containing subset of each of the above repositories
Run the ChatGPT measurementTechnique prompt (providing only the name and description) for each of the 75 records (25 per repo)
Confirm presence/absence of the measurementTechnique values for each record in the predictions by ChatGPT

Issue Discussion

No response

Please select the type of metadata improvement

Standardization (normalizing free text to an ontology)
Augmentation (adding values for metadata fields missing values)
Clean up (addressing redundancy or messy metadata)
Structure (changing the structuring of the metadata to support front end UI features)

Meta URL

No response

Related WBS task

https://github.com/NIAID-Data-Ecosystem/nde-roadmap/issues/13

For internal use only. Assignee, please select the status of this issue

Not yet started
In progress
Blocked
Will not address

Status Description

No response

Request status check list

This metadata improvement has yet to be discussed between NIAID, Scripps, Leidos
This metadata improvement does not need to be discussed between NIAID, Scripps, Leidos
This metadata improvement has been discussed/reported between NIAID, Scripps, Leidos
This metadata improvement has been implemented locally to generate data for review
This metadata improvement has been implemented on Dev
This metadata improvement has been implemented on Dev and the results have been reviewed and approved for staging
This metadata improvement has been implemented on Staging
This page/documentation/change has been approved for Production
This page/documentation/change has been implemented on Production

ZubairQazi · 2024-06-05T17:22:10Z

ChatGPT predictions using the measurement technique prompt:
https://docs.google.com/spreadsheets/d/1jkhidFmsp0f_yL8S5wpZ-oBA-eLhQQmESQEq4Lrhx3M/edit#gid=1310822148

gtsueng added the enhancement New feature or request label Apr 11, 2024

gtsueng assigned ZubairQazi and DylanWelzel Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Metadata Improvement]: Evaluate ChatGPT performance on measurementTechnique Extraction #132

[Metadata Improvement]: Evaluate ChatGPT performance on measurementTechnique Extraction #132

gtsueng commented Apr 11, 2024

ZubairQazi commented Jun 5, 2024

[Metadata Improvement]: Evaluate ChatGPT performance on measurementTechnique Extraction #132

[Metadata Improvement]: Evaluate ChatGPT performance on measurementTechnique Extraction #132

Comments

gtsueng commented Apr 11, 2024

Issue Name

Issue Description

Issue Discussion

Please select the type of metadata improvement

Meta URL

Related WBS task

For internal use only. Assignee, please select the status of this issue

Status Description

Request status check list

ZubairQazi commented Jun 5, 2024