Daemon.jsonrpc_file_read: read claims from a file and download them #3423

belikor · 2021-09-14T18:00:54Z

This follows after #3422.

The idea with #3422 is to produce a file with a list of claims. With this pull request we take that written file, parse it to get the claim IDs, and then download each of the streams. The file is a comma-separated values (CSV) file, although by default we use the semicolon ; as separator.

lbrynet file summary --file=sumary.txt

lbrynet file read --file=summary.txt

Basically, the idea is that we can share lists of claims to other users of the LBRY network, and they can import these lists into their own computers (through lbrynet or the LBRY Desktop application) so that they can download the same claims that we have, and thus help seed the same content that we are seeding.

This is a prototype implementation; it works when the number of claims is relatively small; however, once the number of claims is large, more than 500 or so, the Daemon.jsonrpc_file_read method will time out, so it won't finish processing the list. I'm not sure what can be done to make sure it processes a big list without timeouts.

The obvious solution is to not implement this in the SDK itself, but parse the file, and call lbrynet get on each of the claims.

# Pseudocode

lines = parse_file("summary.txt")

for item in lines:
    lbrynet get item["claim_id"]

Then each call to get will be separate from each other, each will have its own timeout.

Also, since the file is meant to contain the 'claim_id', get should be able to handle claim IDs, as proposed in #3411.

@somechannel

This allows printing a list of all claim streams that were downloaded to the system. The list is printed to the terminal or to a specific file. It accepts some parameters to control the information that is printed. ``` lbrynet file summary --blobs --show_channel --title --stream_type --path lbrynet file summary --show=incomplete --start=10 --end=40 lbrynet file summary --sort=claim_name --reverse --sep=' ;.;' ``` The `--file` option writes the list of claims to a file which then can be shared with other users of LBRY in order to download the same claims and contribute to seeding that content. ``` lbrynet file summary --channel=@somechannel --file=summary.txt --fdate ``` By default it will print the date of the claim which is based on the `'claim_height'`, when the claim was registered in the blockchain, the `'claim_id'`, the `'claim_name'`, and whether the media is present or not in the download directory. ``` 1/42; 20200610_10:23:37-0500; b231714456ee832daeba4b8356803e7591126dff; "07-S"; no-media 2/42; 20200610_10:27:06-0500; 31700ff11f900429d742f2f137ba25393bdb3b0a; "09-S"; media 3/42; 20200609_23:14:47-0500; 70dfefa510ca6eee7023a2a927e34d385b5a18bd; "04-S"; no-media ```

With `lbrynet file summary` we are able to produce a file with a list of claims. With `lbrynet file read` we are able to parse that file, get the claim IDs, and then download each of the streams. ``` lbrynet file read --file=summary.txt ```

coveralls · 2021-09-15T00:59:14Z

Coverage decreased (-0.5%) to 67.453% when pulling d9acdb8 on belikor:print-summary-read into 561566e on lbryio:master.

eukreign · 2021-09-20T15:44:33Z

@lyoshenka this PR involves API changes, please review

belikor · 2021-09-20T20:42:08Z

it works when the number of claims is relatively small; however, once the number of claims is large, more than 500 or so, the Daemon.jsonrpc_file_read method will time out,

Is there a way to increase the time out? I wonder if I can just pass the --timeout option all the way to the jsonrpc_get method. The idea is that if we pass a file with an arbitrary number of claims, say 5000, the method will process every single item.

lyoshenka · 2021-09-21T18:42:16Z

I'd prefer not to add this feature. It can be accomplished with a few lines of scripting, and as you pointed out it doesn't work when there are many claims (at which point you fall back to scripting anyway).

As I said in #3422 (comment), we should aim to keep the API simple.

lbry-bot assigned eukreign Sep 14, 2021

belikor force-pushed the print-summary-read branch from 9f42e49 to 8ce7a8e Compare September 14, 2021 20:49

belikor force-pushed the print-summary-read branch 2 times, most recently from bb4a666 to 147bf42 Compare September 14, 2021 22:50

belikor force-pushed the print-summary-read branch from 147bf42 to d9acdb8 Compare September 15, 2021 00:40

eukreign assigned lyoshenka and unassigned eukreign Sep 15, 2021

lyoshenka closed this Sep 21, 2021

belikor mentioned this pull request Sep 22, 2021

Daemon.jsonrpc_file_summary: new method to print a summary of claims #3422

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Daemon.jsonrpc_file_read: read claims from a file and download them #3423

Daemon.jsonrpc_file_read: read claims from a file and download them #3423

belikor commented Sep 14, 2021

coveralls commented Sep 15, 2021

eukreign commented Sep 20, 2021

belikor commented Sep 20, 2021

lyoshenka commented Sep 21, 2021

Daemon.jsonrpc_file_read: read claims from a file and download them #3423

Daemon.jsonrpc_file_read: read claims from a file and download them #3423

Conversation

belikor commented Sep 14, 2021

coveralls commented Sep 15, 2021

eukreign commented Sep 20, 2021

belikor commented Sep 20, 2021

lyoshenka commented Sep 21, 2021