various byteSwap and bit_cast cleanups #3227

neheb · 2025-04-03T19:55:12Z

No description provided.

kevinbackhouse · 2025-04-18T12:17:44Z

src/bmffimage.cpp

+  std::array<char, sizeof(uint32_t)> p;
+  std::memcpy(p.data(), &n, sizeof(uint32_t));
+  std::string result(p.begin(), p.end());


I think this function might be a cleaner as a loop that mask the bottom byte and then does n >>= 8 on each iteration. That way, there's no need to think about endianness.

I just tried this.

godbolt output ballooned from 39 lines to 287. The code looks incredibly inefficient with operator new and memcpy being called.

Second attempt went from 39 to 49. Maybe that's good enough.

edit: now 31.

kevinbackhouse · 2025-04-18T12:32:05Z

src/image.cpp

+  uint16_t v;
+  std::memcpy(&v, buf.c_data(offset), sizeof(uint16_t));


The read_uint8 method checks for out-of-bound accesses, so it isn't a good idea to replace it with a memcpy.

It would be better to use methods like read_uint16 and read_uint32 which have a ByteOrder parameter. But this function has a bSwap parameter which doesn't mean the same thing, so a bit of refactoring would be needed.

Turns out that even though MSVC is supposed to support std::byteswap, I can't seem to get it to compile with Godbolt. Whats more, the fallback path MSVC cannot optimize. Signed-off-by: Rosen Penev <rosenp@gmail.com>

neheb · 2025-04-19T00:09:42Z

Before I go on I should really add a big endian CI here.

Avoids having to deal with endianness. Signed-off-by: Rosen Penev <rosenp@gmail.com>

We can just treat the data as big endian with bit shifting and ORing. Signed-off-by: Rosen Penev <rosenp@gmail.com>

It's the same function. Signed-off-by: Rosen Penev <rosenp@gmail.com>

Signed-off-by: Rosen Penev <rosenp@gmail.com>

neheb · 2025-04-19T04:19:35Z

removed bad commits. This was tested on the big endian CI.

Copilot

Pull Request Overview

This PR modernizes byte-swapping and binary-to-float conversions and consolidates endian-dependent routines.

Use std::bit_cast in getDouble when available
Centralize byte-swapping via Image::byteSwap and MSVC intrinsics
Refactor platform-dependent toAscii and GUID parsing into explicit byte loops

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
src/types.cpp	Added `std::bit_cast` path under `__cpp_lib_bit_cast` guard
src/pgfimage.cpp	Removed redundant `byteSwap_` overload; call `Image::byteSwap`
src/jp2image.cpp	Replaced platform-based `reverse` with explicit byte-shift loop
src/image.cpp	Added MSVC-specific `_byteswap_` intrinsics for all word sizes
src/bmffimage.cpp	Refactored `toAscii` to loop with explicit mapping (`_`, `.`)
src/asfvideo.cpp	Replaced `memcpy` + conditional swaps in `GUIDTag` with shifts

Comments suppressed due to low confidence (4)

src/types.cpp:316

Add unit tests covering the new std::bit_cast path in getDouble to verify correct behavior for both little- and big-endian inputs on compilers supporting C++20.

#ifdef __cpp_lib_bit_cast

src/asfvideo.cpp:49

[nitpick] Add a comment explaining that these shifts construct the field in little-endian order to clarify the byte index mapping.

data1_ = (static_cast<uint32_t>(bytes[3]) << 24) | (static_cast<uint32_t>(bytes[2]) << 16) | (static_cast<uint32_t>(bytes[1]) << 8) | (static_cast<uint32_t>(bytes[0]));

src/jp2image.cpp:116

[nitpick] Consider extracting this byte-to-char mapping logic into a shared helper function, since a nearly identical loop exists in bmffimage.cpp.

for (size_t i = 0; i < result.size(); ++i) {

src/bmffimage.cpp:89

[nitpick] The toAscii implementation here mirrors jp2image.cpp; extracting it into a common utility would reduce duplication and ease future updates.

std::string result(sizeof(uint32_t), '\0');

neheb force-pushed the 3 branch from 587aa0b to c967e4d Compare April 3, 2025 19:57

neheb requested a review from kevinbackhouse April 5, 2025 04:17

neheb force-pushed the 3 branch from c967e4d to cf2dea2 Compare April 10, 2025 17:33

kevinbackhouse reviewed Apr 18, 2025

View reviewed changes

byteswap: add MSVC handling

51bae6d

Turns out that even though MSVC is supposed to support std::byteswap, I can't seem to get it to compile with Godbolt. Whats more, the fallback path MSVC cannot optimize. Signed-off-by: Rosen Penev <rosenp@gmail.com>

neheb added 4 commits April 18, 2025 17:30

toAscii: replace logic with masking loop

71cb883

Avoids having to deal with endianness. Signed-off-by: Rosen Penev <rosenp@gmail.com>

asfvideo: replace memcpy and byteSwap

7d3e246

We can just treat the data as big endian with bit shifting and ORing. Signed-off-by: Rosen Penev <rosenp@gmail.com>

use Image::byteSwap

1841616

It's the same function. Signed-off-by: Rosen Penev <rosenp@gmail.com>

use more bit_cast

67bea7d

Signed-off-by: Rosen Penev <rosenp@gmail.com>

neheb force-pushed the 3 branch from cf2dea2 to 67bea7d Compare April 19, 2025 04:19

neheb requested a review from Copilot June 13, 2025 05:23

Copilot AI reviewed Jun 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

various byteSwap and bit_cast cleanups #3227

various byteSwap and bit_cast cleanups #3227

Uh oh!

neheb commented Apr 3, 2025

Uh oh!

kevinbackhouse Apr 18, 2025

Uh oh!

neheb Apr 18, 2025

Uh oh!

neheb Apr 18, 2025 •

edited

Loading

Uh oh!

kevinbackhouse Apr 18, 2025

Uh oh!

neheb commented Apr 19, 2025

Uh oh!

neheb commented Apr 19, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

		uint16_t v;
		std::memcpy(&v, buf.c_data(offset), sizeof(uint16_t));

various byteSwap and bit_cast cleanups #3227

Are you sure you want to change the base?

various byteSwap and bit_cast cleanups #3227

Uh oh!

Conversation

neheb commented Apr 3, 2025

Uh oh!

kevinbackhouse Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

neheb Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

neheb Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kevinbackhouse Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

neheb commented Apr 19, 2025

Uh oh!

neheb commented Apr 19, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

neheb Apr 18, 2025 •

edited

Loading