Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a Word Model to generate Taxonomy for catalog.data.gov categories #4088

Closed
3 tasks
nickumia-reisys opened this issue Nov 29, 2022 · 0 comments
Closed
3 tasks
Labels
component/catalog Related to catalog component playbooks/roles

Comments

@nickumia-reisys
Copy link
Contributor

User Story

In order to make data more discoverable on catalog.data.gov, the Data.gov NLP team wants to create model from which a word taxonomy can be created.

Acceptance Criteria

[ACs should be clearly demoable/verifiable whenever possible. Try specifying them using BDD.]

  • GIVEN [a contextual precondition]
    [AND optionally another precondition]
    WHEN [a triggering event] happens
    THEN [a verifiable outcome]
    [AND optionally another verifiable outcome]

Background

The theory behind this issue stems from extracting meaningful connections from our data to describe our data. We have partnerships with various federal/state/local governments/agencies. These organizations were created for a purpose and their mission drives the data that they create. If a tag such as water appears on a dataset within department of energy and another dataset from department of health, a different sense of the word can be understood and used to inform contextual information for further processing.

Security Considerations (required)

[Any security concerns that might be implicated in the change. "None" is OK, just be explicit here!]

Sketch

[Notes or a checklist reflecting our understanding of the selected approach]

  • Create a word model highlighting frequency of tags per organization
  • Determine if tags can be correlated with organizations to give rise to categories based on organizational missions.
@nickumia-reisys nickumia-reisys changed the title Create a Taxonomy for catalog.data.gov categories Create a Word Model to generate Taxonomy for catalog.data.gov categories Dec 1, 2022
@hkdctol hkdctol added the component/catalog Related to catalog component playbooks/roles label Dec 1, 2022
@hkdctol hkdctol closed this as completed Mar 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/catalog Related to catalog component playbooks/roles
Projects
Status: 🗄 Closed
Development

No branches or pull requests

2 participants