Update the pacer_url property on PACER documents to use show_case_doc URLs #774

mlissner · 2017-11-20T23:18:13Z

Two things here, per @johnhawkinson's discovery in freelawproject/recap#214, we a have a way of looking up a doc1 ID from a document number/attachment_number/case_number triplet.

We can use this to:

Update every document we currently have to add the doc1 ID. This will be a marvelous improvement for the historical data before the doc1's were prevalent.
Tweak the pacer_url property so that if we don't have the pacer_doc_id value, we can take the user directly to the document URL instead of taking them to the docket, as we do presently.

2 will be easy. 1 will take a bit of work, but should be fairly easy too.

The text was updated successfully, but these errors were encountered:

PACER has a way of taking a document number and case number and getting a doc1 URL in response. This code lands an API to use that system as carefully as possible. freelawproject/courtlistener#774

pacer_doc_ids are available via a link we recently discovered called show_case_doc. The input for the link is the document number, attachment number, and case number, and in return it gives you the pacer_doc_id. The code in this commit sets us up to get about 3.5M of those IDs that we're currently missing. Partially addresses #774

mlissner · 2017-12-08T23:47:18Z

OK, there's a mega scrape now happening to get the pacer_doc_id values that we're currently lacking. That'll only work on non-bankruptcy PACER courts, but it'll still be a huge improvement.

Any item that lacks a pacer_doc_id is now updated to have a better URL (this works even on bankruptcy courts, it's just that the bankruptcy courts don't do the nice, scrapable redirection we get in normal district courts).

johnhawkinson mentioned this issue Nov 21, 2017

"Buy on PACER" button doesn't work with RECAP #768

Closed

mlissner closed this as completed Dec 8, 2017

mlissner mentioned this issue Dec 28, 2017

Upload RECAP content to Internet Archive #783

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the pacer_url property on PACER documents to use show_case_doc URLs #774

Update the pacer_url property on PACER documents to use show_case_doc URLs #774

mlissner commented Nov 20, 2017

mlissner commented Dec 8, 2017

Update the pacer_url property on PACER documents to use show_case_doc URLs #774

Update the pacer_url property on PACER documents to use show_case_doc URLs #774

Comments

mlissner commented Nov 20, 2017

mlissner commented Dec 8, 2017