Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get more bills #3

Open
dmytro-ustynov opened this issue Apr 7, 2022 · 2 comments
Open

Get more bills #3

dmytro-ustynov opened this issue Apr 7, 2022 · 2 comments
Assignees
Labels
help wanted Extra attention is needed question Further information is requested

Comments

@dmytro-ustynov
Copy link
Contributor

In repository provided on start there were several files in xml format such as samples/congress/116/uslm/BILLS-116hconres9enr.xml etc.
And there were a pasing script to get sections from each bill.
But looks like that this script doesn't work with the other bills from the set we download via congress tool.
So the main question is:
How (where) can I get more, preferrably the whole set of bills that i can split to sections for further work?
May be (that's just my suggestion) we should transform the parsing script so it would parse that set?
Or there is some step of transformation that i still haven't found yet, isn't it?

Anyway the main point is to get more bills to get get more sections from them.

@dmytro-ustynov dmytro-ustynov added help wanted Extra attention is needed question Further information is requested labels Apr 7, 2022
@dmytro-ustynov dmytro-ustynov self-assigned this Apr 7, 2022
@dmytro-ustynov
Copy link
Contributor Author

@aih , what do yo think?

@aih
Copy link
Collaborator

aih commented Apr 7, 2022

The parser works with uslm, which requires namespaces in the XPath for lxml. To use the downloaded files directly, comment that out and use XPath without namespaces. Let me know if you try that with a file and you have trouble.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants