New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is EDGAR Bulk API of any use? #257
Comments
Yes, this would be useful to add. I wrote some comments on this in #227. Would you be willing to help make this part of the package? I am happy to help walk you through the process. |
Well, some structured approach to getting this data would be nice. I haven't found a list of fields that Even if
|
Download required setting user-agent. curl -O https://www.sec.gov/Archives/edgar/daily-index/xbrl/companyfacts.zip --user-agent "No company <private@email.com>"
curl -O https://www.sec.gov/Archives/edgar/daily-index/bulkdata/submissions.zip --user-agent "No company <private@email.com>" |
|
|
84% 484097 - CIK0001561746.json 18 hours and still processing. |
Using |
If you are on Linux or UNIX you can use tree |
The file layout is pretty simple - it is 781211 JSON files in a single dir. The diagrams I need are about JSON structure. I have no idea how to find which fields should contain executive compensation from DEF 14A. Maybe there are no such fields at all. |
I am currently trying to use the bulk downloads to retrieve SEC Form 4s. It appears that each JSON has accession numbers and form types for each filing by CIK, but I haven't figured out how to actually get the URL to that filing. Does anyone know how to get the URL to filings? I haven't been able to figure out the URL structure yet. |
No idea. I've got a strong feeling that SEC is doing its job so poorly on purpose. If the democracy works, they should just hire the maintainers and contributors to this repo to make things right for people. |
Here is an example of URL for Nike form 4: https://www.sec.gov/Archives/edgar/data/320187/000112760223015552/0001127602-23-015552.txt You have the CIK (stripped of leading zeros), then the accession number (stripped of hyphens), then the accession number with .txt at the end. |
Is Bulk Data download from https://www.sec.gov/edgar/sec-api-documentation useful? It looks more accessible than dealing with imposed API restrictions.
I am specifically interested in executive compensation from DEF-14A but I have no idea what it takes to extract it for all companies.
The text was updated successfully, but these errors were encountered: