Skip to content

TheBoatyMcBoatFace/allusgov

 
 

Repository files navigation

All US Federal Government

This project attempts to map the organization of the US Federal Government by gathering and consolidating information from various directories.

Current sources:

Each source is scraped (see out directory) into a JSON format, including fields for the organizational unit name and parent name (if any).

To merge the lists, the organizational hierarchy "path" is generated, by following the parent fields. These "paths" are then fuzzy matched and (if a threshold is met) merged into a single entry.

Note that the fuzzy matching is imperfect and may have some inaccurate mappings (although most appear OK) and will certainly have some entries which actually should be merged, but aren't.

The final merged dataset is written in JSON and flattened CSV format.

About

This project attempts to map the organization of the US Federal Government by gathering and consolidating information from various directories.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%