Support libp2p relays for NAT traversal #186

Vahe1994 · 2023-01-08T11:23:13Z

Added relay options to servers
Enabled relay options by default
Changed hivemind version to 1.1.5
Moved reachability check to be performed after blocks are loaded

2. Enabled relay options by default 3. Changed hivemind version to 1.1.5

mryab · 2023-01-08T12:44:34Z

src/petals/cli/run_server.py

@@ -127,7 +127,10 @@ def main():
    parser.add_argument("--mean_balance_check_period", type=float, default=60,
                        help="Check the swarm's balance every N seconds (and rebalance it if necessary)")

+    parser.add_argument("--auto_relay", action='store_true', help="Enabling relay for NAT traversal")


Suggested change

parser.add_argument("--auto_relay", action='store_true', help="Enabling relay for NAT traversal")

parser.add_argument("--auto_relay", action='store_true', help="Enable relay for NAT traversal")

mryab · 2023-01-08T12:49:00Z

src/petals/server/server.py

@@ -78,6 +78,8 @@ def __init__(
        load_in_8bit: Optional[bool] = None,
        tensor_parallel_devices: Optional[Sequence[torch.device]] = None,
        skip_reachability_check: bool = False,
+        use_relay: bool = True,


It's best to either remove default argument values of to remove these arguments completely: we might forget to change defaults here in the future, and required values will be passed to kwargs anyway

I see your point but I think we should keep it, since the convention in Petals is that all Server defaults match to the defaults of run_server.py (in turn, the hivemind default for use_auto_relay is different).

But all you said would have applied if the defaults here matched with hivemind.

I will agree with @borzunov here. Here are my arguments:

use_relay will not be passed from run_server and we want it by default to be True

it is nice to see in the arguments all parameters that matters

usually , it is not a good idea to be dependent on default argument from another library . They could be changed without notice and can lead to strange behavior

I see your point about explicitly indicating arguments for creation of Server, though it somewhat contradicts the existence of **kwargs in init. My primary concern is that we should strive to have consistent defaults across different locations: one way to do this in an error-proof way would be to declare a common constant with the default value and use it in both locations. Besides, petals-cli is a part of Petals, so these files belong to the same library

We would need to create constants for all defaults in this case (tens of them). I think this is a more general problem that should be addressed outside of this PR (maybe we should use smth like reflection).

borzunov

Thanks for the PR!

I have questions about relays, I need to talk to @justheuristic before we ship this code.

borzunov · 2023-01-09T15:31:20Z

src/petals/server/reachability.py

+logger = get_logger(__file__)
+
+
+def check_reachability(peer_id, wait_time: float = 7 * 60, retry_delay: float = 15) -> None:


Moved from server.py

borzunov

All urgent issues have been resolved.

1. Added relay options to servers

5ee93df

2. Enabled relay options by default 3. Changed hivemind version to 1.1.5

Vahe1994 requested review from justheuristic and borzunov January 8, 2023 11:23

- style reformatting

d641b9b

Vahe1994 marked this pull request as ready for review January 8, 2023 11:45

Vahe1994 requested a review from mryab January 8, 2023 11:47

mryab approved these changes Jan 8, 2023

View reviewed changes

borzunov changed the title ~~Support circuit relay v2~~ Support libp2p relays for NAT traversal Jan 8, 2023

borzunov force-pushed the relay_auto branch from 8a9e36f to 9d2cd2a Compare January 8, 2023 13:52

Refactor CLI arg to look as --use_auto_relay False/True

d99d710

borzunov force-pushed the relay_auto branch from 9d2cd2a to d99d710 Compare January 8, 2023 13:53

borzunov requested changes Jan 8, 2023

View reviewed changes

borzunov and others added 3 commits January 9, 2023 06:58

Delay reachability check, add retries to it

5e1c9fc

Merge branch 'main' into relay_auto

a8f20de

Leave --no_auto_relay argument only

a39e544

borzunov force-pushed the relay_auto branch from 14fdfc8 to a39e544 Compare January 9, 2023 13:25

Shorten info message

59f465f

borzunov force-pushed the relay_auto branch from 6415ae8 to 59f465f Compare January 9, 2023 14:00

borzunov added 2 commits January 9, 2023 14:13

Improve "GPU is not available" message

6d8322e

Perform reachability check once blocks are loaded to avoid delays

43b1997

borzunov force-pushed the relay_auto branch from 8c69eb5 to 43b1997 Compare January 9, 2023 15:24

Update constant and comment

434b630

borzunov force-pushed the relay_auto branch from 5e6064c to 434b630 Compare January 9, 2023 15:30

borzunov reviewed Jan 9, 2023

View reviewed changes

Fix imports

dbf504b

borzunov approved these changes Jan 9, 2023

View reviewed changes

borzunov merged commit 93bed7d into main Jan 9, 2023

borzunov deleted the relay_auto branch January 9, 2023 16:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support libp2p relays for NAT traversal #186

Support libp2p relays for NAT traversal #186

Vahe1994 commented Jan 8, 2023 •

edited by borzunov

Loading

mryab Jan 8, 2023

mryab Jan 8, 2023

borzunov Jan 8, 2023 •

edited

Loading

Vahe1994 Jan 8, 2023 •

edited

Loading

mryab Jan 8, 2023

borzunov Jan 8, 2023

borzunov left a comment

borzunov Jan 9, 2023

borzunov left a comment

	parser.add_argument("--auto_relay", action='store_true', help="Enabling relay for NAT traversal")
	parser.add_argument("--auto_relay", action='store_true', help="Enable relay for NAT traversal")

		logger = get_logger(__file__)


		def check_reachability(peer_id, wait_time: float = 7 * 60, retry_delay: float = 15) -> None:

Support libp2p relays for NAT traversal #186

Support libp2p relays for NAT traversal #186

Conversation

Vahe1994 commented Jan 8, 2023 • edited by borzunov Loading

mryab Jan 8, 2023

Choose a reason for hiding this comment

mryab Jan 8, 2023

Choose a reason for hiding this comment

borzunov Jan 8, 2023 • edited Loading

Choose a reason for hiding this comment

Vahe1994 Jan 8, 2023 • edited Loading

Choose a reason for hiding this comment

mryab Jan 8, 2023

Choose a reason for hiding this comment

borzunov Jan 8, 2023

Choose a reason for hiding this comment

borzunov left a comment

Choose a reason for hiding this comment

borzunov Jan 9, 2023

Choose a reason for hiding this comment

borzunov left a comment

Choose a reason for hiding this comment

Vahe1994 commented Jan 8, 2023 •

edited by borzunov

Loading

borzunov Jan 8, 2023 •

edited

Loading

Vahe1994 Jan 8, 2023 •

edited

Loading