Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Affiliation data format and parsers #367

Open
sbenthall opened this issue Feb 3, 2020 · 3 comments
Open

Affiliation data format and parsers #367

sbenthall opened this issue Feb 3, 2020 · 3 comments
Assignees
Milestone

Comments

@sbenthall
Copy link
Collaborator

  • Develop a portable data format for storing and transferring information about the affiliations between individuals and organizations
  • This will be a special case of a more general serializable data format that is both machine- and human- readable (e.g. YAML)
  • Deliver this format, and tooling for parsing and writing it in Python, as code in BigBang
@npdoty
Copy link
Collaborator

npdoty commented Mar 18, 2020

Is this the same as #352?

@npdoty
Copy link
Collaborator

npdoty commented Jul 19, 2021

What can we pull from the IETF Datatracker? Or from RFCs which list affiliations of authors/editors in the credits? Or from some other data source?

And then drop that into a dataframe or some basic format, so that it can be consumed by other code working on analysis of organizational influence/distribution within working groups.

@npdoty npdoty self-assigned this Jul 19, 2021
@sbenthall
Copy link
Collaborator Author

See #25 for discussion of this.

DataTracker attendance data is a nice source for affilation and nationality data for IETF members.

An actionable way to approach this might be:

  • Develop the affiliation data format and corresponding object, which tracks individuals, their email addresses (with times/duration), their affiliations (with times/duration), their nationalities (with times/duration) (where duration is inferred from time point data).
  • Have a script for population such an object from IETF Attendance data as an example of how to do it.
  • demonstrate how this can be used to plot affiliate interactions in a notebook as per @Christovis 's visualizatiosn from IAB-AID-1

Ultimately, such an object could wrap an ORM with an actual database behind it, and/or work with the entity resolution code to better resolve (its own) organization/affiliation references.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants