Refactor address byte swapping #968

tcharding · 2022-04-26T01:27:54Z

Refactor address byte swapping

When encoding a network::Address two of the fields are encoded
big-endian instead of little-endian as is done by consensus_encode. In
order to achieve this we have a helper function addr_to_be that swaps
the bytes. This function is miss-named because it is not converting to a
specific endian-ness (which implies different behaviour on machines with
different endian-ness) but is reversing the byte order irrespective of
the underlying architecture.

Remove function addr_to_be
Inline the endian-ness code when encoding an address
Remove TODO and use to_be_bytes when encoding port
Add a function for reading big-endian bytes read_be_address
Use read_be_address when decoding Address and Addrv2

Refactor only, no logic changes. Code path is already covered by
unit tests.

Kixunil · 2022-04-26T08:23:12Z

Your observation about naming looks correct, the comment was talking about port though. Also I'd call the function swap_addr_bytes or something like that to distinguish it from u16::swap_bytes.

Anyway, I think the whole code looks over-complicated and confusing - why not just write to the writer in a for loop, converting individual u16s on the fly? It should be more obvious what it does.

tcharding · 2022-04-26T21:57:05Z

Your observation about naming looks correct, the comment was talking about port though. Also I'd call the function swap_addr_bytes or something like that to distinguish it from u16::swap_bytes.

I used the same name on purpose, it's the same method implemented on a different type.

Anyway, I think the whole code looks over-complicated and confusing - why not just write to the writer in a for loop, converting individual u16s on the fly? It should be more obvious what it does.

The method is used when decoding as well, in multiple places.

Kixunil · 2022-04-27T10:06:18Z

The reason I don't like swap_bytes being intentionally similar is it doesn't reorder the words, only swaps endianess of each individual word.

I think in a tricky code like this it may be worth a bit of code repetition (decoding should still be a single function) if it increases clarity. But maybe it's just me who has harder time tracking endianess. 🤷‍♂️

I wouldn't block the PR for any of my objections, it's just for consideration.

tcharding · 2022-04-28T00:30:36Z

Force push is total re-write, based on review by @Kixunil. PR description has also been re-written.

Kixunil

read(...) != 2 is definitely a bug, the rest looks OK.

Kixunil · 2022-04-28T11:22:35Z

src/network/address.rs

+    let mut buf = [0u8; 2];
+
+    for i in 0..8 {
+        if io::Read::read(&mut r, &mut buf)? != 2 {


This should be read_exact. read returning less bytes is not an error.

Thanks man, I learned a bit more about readers.

Kixunil · 2022-04-28T11:24:25Z

src/network/address.rs

+        if io::Read::read(&mut r, &mut buf)? != 2 {
+            return Err(encode::Error::ParseFailed("missing address bytes"));
+        }
+        address[i] = u16::from_be_bytes(buf);


Why not for word in &mut address and then *word = u16::from_be_bytes(buf);? (feel free to come up with better var name)

Ah nice, thanks. Implemented as suggested.

tcharding · 2022-04-29T00:15:04Z

Changes in force push:

Rewrite the read_be_address function with suggestions from review above.
Rebase on master.

tcharding · 2022-04-29T00:18:24Z

I have no idea why I did this PR other at this moment in time, converting to draft to let the edition bump stuff go in.

Kixunil

Sorry for not finding the other one sooner.

Kixunil · 2022-04-29T04:41:25Z

src/network/address.rs

+        let mut len = self.services.consensus_encode(&mut s)?;
+
+        for word in &self.address {
+            len += io::Write::write(&mut s, &word.to_be_bytes())?;


Just realized we have the same issue as with read_exact, we need write_all() here. Also while changing you could use s.write_all() instead, no need to repeat the whole trait since it can't get confused anyway thanks to generics.

Legend man, late is better than never :) Will fix as suggested.

tcharding · 2022-05-02T00:09:53Z

Changes in force push: Use write_all as suggested.

Kixunil · 2022-05-02T12:02:44Z

Looks good now, will ACK when it's un-drafted.

When encoding a `network::Address` two of the fields are encoded big-endian instead of little-endian as is done by `consensus_encode`. In order to achieve this we have a helper function `addr_to_be` that swaps the bytes. This function is miss-named because it is not converting to a specific endian-ness (which implies different behaviour on machines with different endian-ness) but is reversing the byte order irrespective of the underlying architecture. - Remove function `addr_to_be` - Inline the endian-ness code when encoding an address - Remove TODO and use `to_be_bytes` when encoding port - Add a function for reading big-endian bytes `read_be_address` - Use `read_be_address` when decoding `Address` and `Addrv2` Refactor only, no logic changes. Code path is already covered by unit tests.

apoelstra

ACK 07c7530

Kixunil

ACK 07c7530

tcharding · 2022-05-24T08:59:06Z

Good to have you back @Kixunil!

07c7530 Refactor address byte swapping (Tobin C. Harding) Pull request description: Refactor address byte swapping When encoding a `network::Address` two of the fields are encoded big-endian instead of little-endian as is done by `consensus_encode`. In order to achieve this we have a helper function `addr_to_be` that swaps the bytes. This function is miss-named because it is not converting to a specific endian-ness (which implies different behaviour on machines with different endian-ness) but is reversing the byte order irrespective of the underlying architecture. - Remove function `addr_to_be` - Inline the endian-ness code when encoding an address - Remove TODO and use `to_be_bytes` when encoding port - Add a function for reading big-endian bytes `read_be_address` - Use `read_be_address` when decoding `Address` and `Addrv2` Refactor only, no logic changes. Code path is already covered by unit tests. ACKs for top commit: apoelstra: ACK 07c7530 Kixunil: ACK 07c7530 Tree-SHA512: 186bc86512e264a7b306f3bc2e18d1619f3cd84fc54412148cfc2663e8d6e9616ea9e2fe19eafec72d76cc11367a9b39cac2b73210d9e43eb8f453bd253b33de

tcharding mentioned this pull request Apr 26, 2022

Remove MSRV todo comments #952

Merged

tcharding force-pushed the to-be-bytes branch from 8aa4e7d to d96396c Compare April 26, 2022 01:37

tcharding force-pushed the to-be-bytes branch from d96396c to e2dfcf4 Compare April 28, 2022 00:29

tcharding force-pushed the to-be-bytes branch from e2dfcf4 to 1a3c3b8 Compare April 28, 2022 00:50

Kixunil requested changes Apr 28, 2022

View reviewed changes

tcharding force-pushed the to-be-bytes branch from 1a3c3b8 to c1580c2 Compare April 29, 2022 00:14

tcharding marked this pull request as draft April 29, 2022 00:18

Kixunil requested changes Apr 29, 2022

View reviewed changes

tcharding force-pushed the to-be-bytes branch from c1580c2 to 67e055b Compare May 2, 2022 00:09

tcharding force-pushed the to-be-bytes branch from 67e055b to 07c7530 Compare May 19, 2022 06:03

tcharding marked this pull request as ready for review May 19, 2022 06:19

apoelstra approved these changes May 19, 2022

View reviewed changes

Kixunil approved these changes May 24, 2022

View reviewed changes

apoelstra merged commit 324fa0f into rust-bitcoin:master May 24, 2022

tcharding deleted the to-be-bytes branch May 24, 2022 21:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor address byte swapping #968

Refactor address byte swapping #968

tcharding commented Apr 26, 2022 •

edited

Kixunil commented Apr 26, 2022

tcharding commented Apr 26, 2022

Kixunil commented Apr 27, 2022

tcharding commented Apr 28, 2022

Kixunil left a comment

Kixunil Apr 28, 2022

tcharding Apr 29, 2022

Kixunil Apr 28, 2022

tcharding Apr 29, 2022

tcharding commented Apr 29, 2022

tcharding commented Apr 29, 2022

Kixunil left a comment

Kixunil Apr 29, 2022

tcharding Apr 29, 2022

tcharding commented May 2, 2022

Kixunil commented May 2, 2022

apoelstra left a comment

Kixunil left a comment

tcharding commented May 24, 2022

Refactor address byte swapping #968

Refactor address byte swapping #968

Conversation

tcharding commented Apr 26, 2022 • edited

Kixunil commented Apr 26, 2022

tcharding commented Apr 26, 2022

Kixunil commented Apr 27, 2022

tcharding commented Apr 28, 2022

Kixunil left a comment

Choose a reason for hiding this comment

Kixunil Apr 28, 2022

Choose a reason for hiding this comment

tcharding Apr 29, 2022

Choose a reason for hiding this comment

Kixunil Apr 28, 2022

Choose a reason for hiding this comment

tcharding Apr 29, 2022

Choose a reason for hiding this comment

tcharding commented Apr 29, 2022

tcharding commented Apr 29, 2022

Kixunil left a comment

Choose a reason for hiding this comment

Kixunil Apr 29, 2022

Choose a reason for hiding this comment

tcharding Apr 29, 2022

Choose a reason for hiding this comment

tcharding commented May 2, 2022

Kixunil commented May 2, 2022

apoelstra left a comment

Choose a reason for hiding this comment

Kixunil left a comment

Choose a reason for hiding this comment

tcharding commented May 24, 2022

tcharding commented Apr 26, 2022 •

edited