net, rpc: expose connection type in getpeerinfo #19883

jonatack · 2020-09-05T15:33:17Z

Expose conn_type via a practical API for JSON-RPC consumers.

returns the conn_type as an integer id for API clients. It is a simple and small change to implement and maintain, and the API can remain stable even if the ConnectionType element naming or order changes.
adds a uint8_t type to the ConnectionType enum class; if preferred, this can be dropped

$ ./src/bitcoin-cli help getpeerinfo
...
    "inbound" : true|false,        (boolean) Inbound (true) or Outbound (false)
    "addnode" : true|false,        (boolean) Whether connection was due to addnode/-connect or if it was an automatic/inbound connection
    "conn_type" : n,               (numeric) Connection type between 0 and 5:
                                   0 - inbound (initiated by the peer)
                                   1 - outbound-full-relay (default automatic connections)
                                   2 - manual (added using the -addnode/-connect configuration options or the addnode RPC)
                                   3 - feeler (short-lived automatic connection to test addresses)
                                   4 - block-relay-only (does not relay transactions or addresses)
                                   5 - addr-fetch (short-lived automatic connection to request addresses)

src/rpc/net.cpp

promag

I'm not sure if it's good idea to return enum's int value. If the enum is changed/refactored clients get broken. IMO string is fine and enough.

jonatack · 2020-09-06T10:40:00Z

I'm not sure if it's good idea to return enum's int value. If the enum is changed/refactored clients get broken. IMO string is fine and enough.

If a string was returned, changing the string name would be a breaking change, and string names are likely to be bike-shed or changed. OTOH there is no reason why the enum integer values would ever have to change. The enum class itself can separate the int values from the rest with no need for extra methods; it can remain stable even if the enum element naming or order is otherwise changed. And the integer is the easiest data for API clients to parse, bounds check and use as an index into a data structure.

promag

I think nobody cares about the int value, if that was the case we wouldn't use enum class. Sending int and have this documented is fine too (but the redundant string is then unnecessary) but the way it is implemented doesn't look safe, mainly because not all conn_type are tested.

This code would be much simpler, and IMO safer, if you just add std::map<ConnectionType, std::string> CONNECTION_TYPE_NAME, CONNECTION_TYPE_DESCRIPTION.

src/rpc/net.cpp

jonatack · 2020-09-06T11:46:27Z

I think nobody cares about the int value

Exactly -- no one caring about the int value is an advantage. The enum class int values can remain stable for the API even if people want to change the enum element naming or order.

Sending int and have this documented is fine too (but the redundant string is then unnecessary)

Thanks for reviewing and making this better, @promag. Dropped the string field and put the doc directly in RPCHelpMan.

promag

Ideally this needs a test for all possible conn_type before merge. You can also add a release note of the new field.

jnewbery · 2020-09-07T09:57:04Z

I prefer #19725. We shouldn't leak our internal enum indexes out to a public API (since that locks us into a specific implementation)

jonatack · 2020-09-07T10:10:26Z

since that locks us into a specific implementation

If the ConnectionType enum were to be abandoned for a hypothetical different implementation, it would just require adding a method to serialise the ids. For now that's not needed.

promag · 2020-09-07T10:12:24Z

@jnewbery I've made that point too, but you can also assume that ATM the mapping function used is the identity function.

amitiuttarwar · 2020-09-08T04:22:26Z

this PR isn't quite a replacement for #19725 because it doesn't include the deprecation of getpeerinfo.addnode, or the logging improvement in net_processing.

I think the proposal of adding conn_type as an integer id is a reasonable proposal, but wanted to clarify the differences for reviewers. I'm personally -0 because I think a string suffices.

laanwj · 2020-09-08T07:36:32Z

I think the proposal of adding conn_type as an integer id is a reasonable proposal, but wanted to clarify the differences for reviewers. I'm personally -0 because I think a string suffices.

I agree here that a string suffices. I don't think it's wise to expose the enumeration IDs on the JSON-RPC interface, as they are an internal implementation detail, and for better or worse (no real enum type) the common way is to use strings as enumerators in JSON.

jonatack · 2020-09-08T09:29:59Z

By any objective technical criteria, ersatz long-format string ids in the place of integer ones seem a substantially worse choice

code
complexity
robustness
API stability
API flexibility for clients
memory
speed
network bandwidth
maintainability

in every way maybe an order of magnitude worse.

An API client can bounds check an integer id, then call a vector element with it.

With a long format string, the client has to match against every possible expected value first in order to error check the value.

This is a lean, clean implementation that adds 2 lines to net.{h,cpp} versus 30.

Overall, I think it's objectively multiple times better for both Bitcoin Core and for software clients of the RPC API.

Also, this unbundles #19725 which adds extraneous logging refactoring and a controversial deprecation into the same PR. Just unbundling it adds value before considering the order of magnitude technical improvement.

jonatack · 2020-09-08T09:33:23Z

The ids can be serialised via a separate method, but that doesn't seem needed here and would just be added complexity for no gain. The enum itself can separate the order and naming from the id values.

jonatack · 2020-09-08T09:48:26Z

this PR isn't quite a replacement for #19725 because it doesn't include the deprecation of getpeerinfo.addnode, or the logging improvement in net_processing.

Yes, it's not intended to replace the logging refactoring or the deprecation. I'm -0.9 on both for the reasons I've stated in that PR.

ajtowns · 2020-09-19T07:13:52Z

If a string was returned, changing the string name would be a breaking change, and string names are likely to be bike-shed or changed. OTOH there is no reason why the enum integer values would ever have to change

The addition to getpeerinfo and the logging change is for helping humans understand what's going on, not for interoperability (there's no standard and different behaviours within the existing specs are perfectly reasonably), nor for command and control (there's no way for other programs to act on the connection type info), so no, changing the string names isn't a breaking change. There are obvious reasons why enum values change: if an entry is removed, or if the entries are rearranged. Yes, you can hardcode the values to prevent that, but there's no reason to do so: this isn't a standard, it's an aid for debugging problems with your node.

This is a small, focused, simple, performant alternative

getpeerinfo isn't a performance critical call, outputting strings via it isn't performance critical (both since all the numbers are encoded as strings anyway -- it's json; and since we're already decoding services to an array of strings via servicenames), and even if none of that were true, you should be providing benchmarks if claiming a performance improvement.

JSON-RPC expects integer ids.

I don't know where this is coming from, but it's not even literally true: "id - The request id. This can be of any type." and "id .. MUST contain a String, Number, or NULL value if included"

sipa · 2020-09-19T07:33:50Z

There is precedent for string values for enumerated types too already: the transaction types in listtransactions ("receive", "send", "generate", "immature", ...), and the branch types in getchaintips come to mind.

DrahtBot · 2020-09-19T13:54:04Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Conflicts

Reviewers, this pull request conflicts with the following ones:

net, rpc, cli: expose peer network in getpeerinfo; simplify/improve -netinfo #20002 (net, rpc, cli: expose GetNetClass()/ConnectedViaTor() in getpeerinfo, use in -netinfo by jonatack)
[test] clarify rpc_net & p2p_disconnect_ban functional tests #19877 ([test] clarify rpc_net & p2p_disconnect_ban functional tests by amitiuttarwar)
net processing: Move block inventory state to net_processing #19829 (net processing: Move block inventory state to net_processing by jnewbery)
net, rpc: expose high bandwidth mode state via getpeerinfo #19776 (net, rpc: expose high bandwidth mode state via getpeerinfo by theStack)
[RPC] Add connection type to getpeerinfo, improve logs #19725 ([RPC] Add connection type to getpeerinfo, improve logs by amitiuttarwar)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

jonatack · 2020-09-20T10:06:04Z

If a string was returned, changing the string name would be a breaking change, and string names are likely to be bike-shed or changed. OTOH there is no reason why the enum integer values would ever have to change

The addition to getpeerinfo and the logging change

Unless I'm mistaken, the logging change would have the same output.

is for helping humans understand what's going on

From what I've been able to understand, the CLI client-side options are the human-first ones and not constrained by API stability constraints, which is why I added features to -getinfo and also created -netinfo that has been described as getpeerinfo for humans. The RPC API, on the other hand, is more-or-less machine-first and constrained by stability and the desire to avoid causing suffering for software clients downstream, even if it does not for human users of the RPC like us. That is what this PR focuses on. An additional human-friendly convenience field could be an option if people feel the tradeoffs are worth it, but that's orthogonal to this proposal.

changing the string names isn't a breaking change

Renaming a connection type not only entails cascading codebase changes for long-format name ids, it is also a breaking API change if the client no longer recognizes the connection type. This can be avoided by using a standard integer id decoupled from the naming. I will provide a demonstration with code a bit later.

getpeerinfo isn't a performance critical call

This field is called in a loop. API clients may be interested in performance, either because they call this frequently or at high frequency (I do) or from clients/to servers that are constrained in CPU, memory, or internet bandwidth. If it's available with a couple of lines, why dunk on it?

all the numbers are encoded as strings anyway -- it's json

I could be wrong but don't think this is true. https://tools.ietf.org/html/rfc7159

and even if none of that were true, you should be providing benchmarks

Requesting benchmarks for the simplest, standard practice is a bit pedantic. No one asked for benchmarks, for instance, when in #19731 I proposed to send Unix epoch times instead of a human-friendly datetime format.

I don't know where this is coming from, but it's not even literally true: "id - The request id. This can be of any type." and "id .. MUST contain a String, Number, or NULL value if included"

Thanks for confirming that we can use numbers. Now...do you see any long-format, non-numerical string ids in https://www.jsonrpc.org/specification#examples and https://www.jsonrpc.org/specification_v1#a4.CommunicationExamples? Right, neither do I (maybe I need glasses; I have been putting it off for a long time). Sure, you could use ersatz long-format strings, but ideally not as your primary id. "I want to depend on long-format names that people are already proposing to change, please send me that" said no API client to me ever. However, the clients might be ok with it as an optional convenience field that they can request on a per-call basis.

At any rate, thank you for having a look. Mind giving a concept ACK?

jonatack · 2020-09-20T10:15:43Z

There is precedent for string values for enumerated types too already: the transaction types in listtransactions ("receive", "send", "generate", "immature", ...), and the branch types in getchaintips come to mind.

Sure, but like the kebab-case and snake_case config args, precedents may have varying degrees of desirability and relevance.

I wonder if we shouldn't have more separation between the API for software and the CLI for humans (seems to be a trend already?)... e.g. the human-friendly CLI versions could have human-readable datetime formats instead of Unix epoch time, etc.

michaelfolkson · 2020-09-20T11:29:53Z

Approach ACK

I can see why some people think this argument is a touch pedantic. But at the very least this isn't a worse approach than #19725 and it is definitely simpler. Plus there appears there could be benefits to this approach downstream.

(PR #19725 could still do the logging refactoring and deprecation. I'm not convinced these are controversial.)

sipa · 2020-09-26T01:49:15Z

Concept NACK.

I don't think we should be exposing internal enums, as its mapping between numbers and connection type semantics is arbitary. Exposing it via RPC is cementing it in stone, for no good reason. The set of available connection types will change over time, and that will very likely mean that some types that currently exist won't remain.

As I've pointed out #19725 (comment), if you want an actual stable mapping with the advantages of machine-readability of numbers over the strings assigned by #19725, I think you'd need to maintain a separate set of numbers for the RPC interface, in which old numbers may retire if connection types are removed/split or even just substantially change meaning. I don't think that's worth the effort.

DrahtBot · 2020-09-26T17:21:48Z

🐙 This pull request conflicts with the target branch and needs rebase.

_{Want to unsubscribe from rebase notifications on this pull request? Just convert this pull request to a "draft".}

promag reviewed Sep 5, 2020

View reviewed changes

src/rpc/net.cpp Outdated Show resolved Hide resolved

DrahtBot added P2P RPC/REST/ZMQ labels Sep 5, 2020

jonatack force-pushed the getpeerinfo-conn-type branch from 0d24183 to 515e30d Compare September 5, 2020 17:46

promag reviewed Sep 6, 2020

View reviewed changes

src/rpc/net.cpp Outdated Show resolved Hide resolved

src/rpc/net.cpp Outdated Show resolved Hide resolved

jonatack force-pushed the getpeerinfo-conn-type branch from 515e30d to 4ac4ee5 Compare September 6, 2020 11:39

net, rpc: expose connection type in getpeerinfo

bd2aa75

jonatack force-pushed the getpeerinfo-conn-type branch from 4ac4ee5 to bd2aa75 Compare September 6, 2020 11:39

promag reviewed Sep 6, 2020

View reviewed changes

jonatack mentioned this pull request Sep 8, 2020

[RPC] Add connection type to getpeerinfo, improve logs #19725

Merged

fjahr mentioned this pull request Sep 13, 2020

"Good First Review" label #19941

Closed

This was referenced Sep 19, 2020

[test] clarify rpc_net & p2p_disconnect_ban functional tests #19877

Merged

net processing: Move block inventory state to net_processing #19829

Merged

net, rpc: expose high bandwidth mode state via getpeerinfo #19776

Merged

DrahtBot mentioned this pull request Sep 23, 2020

net: Add CNode::ConnectedThroughNetwork member function #19998

Merged

naumenkogs mentioned this pull request Sep 23, 2020

doc: Better document features of feelers #19958

Merged

DrahtBot mentioned this pull request Sep 23, 2020

net, rpc, cli: expose peer network in getpeerinfo; simplify/improve -netinfo #20002

Merged

DrahtBot added the Needs rebase label Sep 26, 2020

jonatack closed this Sep 28, 2020

bitcoin locked as resolved and limited conversation to collaborators Feb 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

net, rpc: expose connection type in getpeerinfo #19883

net, rpc: expose connection type in getpeerinfo #19883

jonatack commented Sep 5, 2020 •

edited

Loading

promag left a comment

jonatack commented Sep 6, 2020 •

edited

Loading

promag left a comment

jonatack commented Sep 6, 2020 •

edited

Loading

promag left a comment

jnewbery commented Sep 7, 2020

jonatack commented Sep 7, 2020

promag commented Sep 7, 2020

amitiuttarwar commented Sep 8, 2020

laanwj commented Sep 8, 2020

jonatack commented Sep 8, 2020 •

edited

Loading

jonatack commented Sep 8, 2020

jonatack commented Sep 8, 2020

ajtowns commented Sep 19, 2020

sipa commented Sep 19, 2020

DrahtBot commented Sep 19, 2020 •

edited

Loading

jonatack commented Sep 20, 2020 •

edited

Loading

jonatack commented Sep 20, 2020

michaelfolkson commented Sep 20, 2020

sipa commented Sep 26, 2020

DrahtBot commented Sep 26, 2020

net, rpc: expose connection type in getpeerinfo #19883

net, rpc: expose connection type in getpeerinfo #19883

Conversation

jonatack commented Sep 5, 2020 • edited Loading

promag left a comment

Choose a reason for hiding this comment

jonatack commented Sep 6, 2020 • edited Loading

promag left a comment

Choose a reason for hiding this comment

jonatack commented Sep 6, 2020 • edited Loading

promag left a comment

Choose a reason for hiding this comment

jnewbery commented Sep 7, 2020

jonatack commented Sep 7, 2020

promag commented Sep 7, 2020

amitiuttarwar commented Sep 8, 2020

laanwj commented Sep 8, 2020

jonatack commented Sep 8, 2020 • edited Loading

jonatack commented Sep 8, 2020

jonatack commented Sep 8, 2020

ajtowns commented Sep 19, 2020

sipa commented Sep 19, 2020

DrahtBot commented Sep 19, 2020 • edited Loading

Conflicts

jonatack commented Sep 20, 2020 • edited Loading

jonatack commented Sep 20, 2020

michaelfolkson commented Sep 20, 2020

sipa commented Sep 26, 2020

DrahtBot commented Sep 26, 2020

jonatack commented Sep 5, 2020 •

edited

Loading

jonatack commented Sep 6, 2020 •

edited

Loading

jonatack commented Sep 6, 2020 •

edited

Loading

jonatack commented Sep 8, 2020 •

edited

Loading

DrahtBot commented Sep 19, 2020 •

edited

Loading

jonatack commented Sep 20, 2020 •

edited

Loading