refactor: Rewrite AMF and property behavior logic to use smart pointers, references, and string_views over raw pointers and std::string& #1452

jadebenn · 2024-02-11T00:50:53Z

Not sure what's causing the game tests to fail right now after several hours of debugging, so marking it as draft for now. If any of you have a bit of time to look it over, it would be greatly appreciated.

Jettford

Way, way too many unrelated changes, this should be broken up.

jadebenn

Ah. Sorry about the commit history. I pulled the TryParse branch over before it was added and forgot to squash it, so all those commits were added to this branch's history.

jadebenn · 2024-02-11T00:55:28Z

Way, way too many unrelated changes, this should be broken up.

I actually stayed really focused. The commit history is just screwed.

Jettford · 2024-02-11T00:58:24Z

I am not basing my comment off of the commit history, I didn't read that.

… string_views over raw pointers and std::string&

jadebenn · 2024-02-11T01:22:20Z

Well, for what it's worth, I at least fixed the commit history now.

EmosewaMC · 2024-02-11T03:33:05Z

There is a lot here to unpack so ill bullet the points so they are clear, and also to note that i only got half way for now just so there werent too many questions to answer at once

Why is this code being changed?
a. This code is well tested (even unit tested unlike much of the code base) and has
been proven to work and be good enough for our use case.
How much performance are we really saving with a custom string hasher on small
strings?
a. AMF3 is not something used in a large portion of the codebase. Alongside that,
the strings used in it are generally small enough to fit in SSO. Also,
there are generally so few keys per AMF associative that the lookup is already
only 1 comparison so this seems like we're just adding complexity with very little gain.
Why are we using unique_ptr for the AMF values when they are just being side cast
away all the time with .get and .reset?
a. The whole purpose of unique_ptr is the object is owned and destroyed at the end
of scope. Seeing so many uses of .get, .reset and with all of the moves going on; It is fairly mis-leading what
actually owns the object at the end of scope and appears to defeat the purpose of the class.
Why are we changing the functions in the AMFDeserialize header to be static?
a. This just seems like an unnecessary change. If they are private static, they may
as well only live in the cpp file anyways without a need to be declared in the class.
Why are we using complex emplace in the insertion for the associative
part?
a. This is not a feature many people know how to use, as such this is something
others would look at and just remove and replace with a regular emplace as this would cover our use case.
Why are we using an incredibly complex string_view hasher for what will be very little
performance gain and a very difficult to maintain struct?
a. Same as point 2; What are we gaining from this? This also uses very complex
folding expressions that are hard to understand the syntax of
for those who are not fluent in meta programming. Yes you did explain the end
result and an example use case of the structure, but the way the overload struct is written implies it is used for more than
just string hashing, and more so a way to overload any lookup with dual types.
Why are we suddenly using rvalue references for container iteration?
a. The change here is a bit syntactical and is another one that is hard to understand
and maintain for others.
If we are updating all ControlBehaviorMessages to have m_ prefixes, why was
BlockDefinition skipped?
Why are variable declaration orders being shifted around?
a. Is there a particular end goal with the end format you landed on?

MergeStripsMessage.h. I stopped around this file for a reference where to begin again
after these questions are answered

jadebenn · 2024-02-11T04:34:11Z

Hey Emo: Thanks for looking over this! I'll try and address your questions point by point.

It's just a proposal as any PR is. However, I am trying to cut down on manual memory management wherever possible in the codebase, and the legwork required to do so did turn out to be more than I anticipated. My intention here is to make the ownership of resources more clear.
It's my understanding that a temporary std::string is constructed by default as part of the lookup process due to C++ legacy reasons, so it's nice to be rid of that. The main reason, though, was to allow more widespread usage of std::string_view, for reasons which I'll expand on in the next few points.
get() is being used for non-owning references. The dense and associative vectors/maps own the actual resources, which is why they're wrapped in unique pointers to begin with. If polymorphism wasn't a concern (and it unfortunately is), I would have just stored the objects in the vector directly. As-is, when those containers go out of scope, the unique pointers ensure their destructors are called as well. Reset() is being used when an element needs to explicitly be deleted or replaced - almost 1:1 with the calls to 'delete' in the unaltered code.
I'm fine with moving those static functions to the cpp file. I just noticed that a few of those functions did not actually work on 'this' objects and could be made static with no issue.
Not sure what you mean by a 'regular emplace,' here.
See point 2. To elaborate on my earlier point (and point 3 as well), the use of raw pointers in the original code made it pretty unclear when a resource was owned and when it was being provided as a reference. This is also the case for strings. Now, std::string is only used in contexts where the actual string's ownership needs to be passed around, just as unique ptr is only being used to convey the deletion responsibility is being shifted to someone else.
This can be replaced by a regular auto& in this context and I will be more than happy to if you still think it should be after I explain my reasoning. Both const auto& and auto&& are 'universal references' and will bind to anything. Given that, I prefer to use const auto& to signify a 'read only' loop and auto&& to signify a 'read-and-write' loop. In this particular case an auto& will do that job just as well, but an auto&& reference guarantees it in any context.
Oversight.
Smaller variable types (ints, floats, etc) listed first to (potentially) reduce padding. Not really a real-world concern here, but good practice and I didn't see the harm in doing it.

jadebenn · 2024-02-11T04:36:21Z

I will also note that, to my understanding, the only issue blocking the CIs currently is some strangeness with the unit tests themselves. While in game, the actual behaviors seem to load fine (at least according to the debug console output), in the game tests their IDs do not. I spent several hours trying to figure out the cause of this and I wasn't able to. Part of what motivated me to post this as a draft was the hope that one of you might have some insight.

jadebenn · 2024-02-11T08:01:01Z

assuming the CI issue was resolved in these last few commits, I'm going to pivot to splitting up this big PR into smaller ones,

…ed from the other branch commit)

EmosewaMC · 2024-02-13T02:29:52Z

the only 2 things i have time to reply to right now are

Smaller variable types (ints, floats, etc) listed first to (potentially) reduce padding. Not really a real-world concern here, but good practice and I didn't see the harm in doing it.

Unless the ordering is causing an actual problem, it can be left alone to reduce changelist size. This is not an optimization we are going for and will have next to zero effect for our use case, though i appreciate the consideration and thought behind it. Feel free to leave it in this PR and the other one, but in the future, we would rather focus on higher order optimizations than micro ones like packing outside of binary stream data.

It's just a proposal as any PR is. However, I am trying to cut down on manual memory management wherever possible in the codebase, and the legwork required to do so did turn out to be more than I anticipated. My intention here is to make the ownership of resources more clear.

We would rather, as mentioned in discord, propose the changes before the PR is put up. If a PR is put up first, then the work has already been done and we would rather not throw away work if we dont need to. In this context manual memory management was a necessary cost for abstraction and with the previous implementaion already finished and working, this is just change for the sake of change. It's not that i'm against the changes themselves, but more the code for AMF was finished as far as functionality goes and changing it may introduce bugs from a new implementation. Removing manual memory management is a good thing to work on, but in a context like this I do not see it as necessary.

…bly will revert

…n, preferred member naming conventions, and const-ref getters (#1456) * Split out BehaviorMessage class changes from PR #1452 * remove <string_view> inclusion in ActionContext.h * add the arguments nullptr check back in * remove redundant std::string constructor calls * Update AddStripMessage.cpp - change push_back to emplace_back

…Ptrs

jadebenn requested review from aronwk-aaron, EmosewaMC and Jettford February 11, 2024 00:50

Jettford requested changes Feb 11, 2024

View reviewed changes

jadebenn commented Feb 11, 2024

View reviewed changes

Rewrite AMF and behavior logic to use smart pointers, references, and…

fca5623

… string_views over raw pointers and std::string&

jadebenn force-pushed the AMFUniquePtrs branch from 4dc1cfc to fca5623 Compare February 11, 2024 01:21

jadebenn added 3 commits February 11, 2024 01:49

fix m_BehaviorID initialization

acd601f

Fix BlockDefinition member naming

c75cef2

remove redundant reset()s

07b5135

jadebenn added 6 commits February 11, 2024 02:09

Replace UB forward template declarations with header include

7e036a3

Merge remote-tracking branch 'upstream/main' into AMFUniquePtrs

08608c7

remove unneeded comment

b8498c3

remove non-const ref getters

1bb82b8

simplify default behavior id initialization

b89488b

Fix invalidated use of Getter to set a value

0f1922a

jadebenn added a commit that referenced this pull request Feb 11, 2024

Split out BehaviorMessage class changes from PR #1452

d308d58

jadebenn mentioned this pull request Feb 11, 2024

refactor: Rewrite BehaviorMessage classes to use member initialization, preferred member naming conventions, and const-ref getters #1456

Merged

jadebenn added 2 commits February 11, 2024 18:02

Update AddStripMessage.cpp - change push_back to emplace_back

ab5ed7d

fix pointer to ref conversion mistake (should not have directly grabb…

188462e

…ed from the other branch commit)

deref

797eeb5

jadebenn added 2 commits February 17, 2024 23:57

VERY experimental testing of forward declaration of templates - proba…

e381ef7

…bly will revert

Revert changes (as expected)

b2b21a8

jadebenn and others added 10 commits February 18, 2024 18:45

Merge remote-tracking branch 'upstream/main' into AMFUniquePtrs

2798ee2

Update BlockDefinition.h - remove extraneous semicolons

cee38f5

Update BlockDefinition.h - remove linebreak

c9f2b31

Merge remote-tracking branch 'upstream/main' into AMFUniquePtrs

f5e9bd8

Update Amf3.h member naming scheme

2d7165a

Merge remote-tracking branch 'upstream/AMFClassNaming' into AMFUnique…

9bb5089

…Ptrs

Merge remote-tracking branch 'upstream/main' into AMFUniquePtrs

7cfc10c

fix duplicated code

da613ad

const iterators

939eb61

const pointers

9f8cff8

jadebenn mentioned this pull request Feb 26, 2024

chore: Change AMF to use references #1479

Merged

Merge remote-tracking branch 'upstream/main' into AMFUniquePtrs

cf569f7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Rewrite AMF and property behavior logic to use smart pointers, references, and string_views over raw pointers and std::string& #1452

refactor: Rewrite AMF and property behavior logic to use smart pointers, references, and string_views over raw pointers and std::string& #1452

jadebenn commented Feb 11, 2024

Jettford left a comment

jadebenn left a comment

jadebenn commented Feb 11, 2024

Jettford commented Feb 11, 2024

jadebenn commented Feb 11, 2024 •

edited

Loading

EmosewaMC commented Feb 11, 2024

jadebenn commented Feb 11, 2024 •

edited

Loading

jadebenn commented Feb 11, 2024 •

edited

Loading

jadebenn commented Feb 11, 2024

EmosewaMC commented Feb 13, 2024

refactor: Rewrite AMF and property behavior logic to use smart pointers, references, and string_views over raw pointers and std::string& #1452

Are you sure you want to change the base?

refactor: Rewrite AMF and property behavior logic to use smart pointers, references, and string_views over raw pointers and std::string& #1452

Conversation

jadebenn commented Feb 11, 2024

Jettford left a comment

Choose a reason for hiding this comment

jadebenn left a comment

Choose a reason for hiding this comment

jadebenn commented Feb 11, 2024

Jettford commented Feb 11, 2024

jadebenn commented Feb 11, 2024 • edited Loading

EmosewaMC commented Feb 11, 2024

jadebenn commented Feb 11, 2024 • edited Loading

jadebenn commented Feb 11, 2024 • edited Loading

jadebenn commented Feb 11, 2024

EmosewaMC commented Feb 13, 2024

jadebenn commented Feb 11, 2024 •

edited

Loading

jadebenn commented Feb 11, 2024 •

edited

Loading

jadebenn commented Feb 11, 2024 •

edited

Loading