Implement get_node with a get_node_raw #14384

Desour · 2024-02-18T16:21:26Z

Goal: Improve performance of get_node.
How: get_node is implemented in builtin with a get_node_raw(x, y, z) -> content, param1, param2, pos_ok.
Why is this beneficial: In master, get_node calls lua functions to push nodes, and to read vectors. This is much faster if done in lua.

Flamegraphs:
master:

PR:

Rough benchmark results:
master: 450-850 ms
PR: around 270-300 ms (it's much more persistent, idk why)

Benchmark function adopted from #14225 by @sfence. (Added you as co-author. :))

To do

This PR is a Ready for Review.

How to test

/bench_bulk_get_node
TODO: Do we have unit tests for get_node and get_node_or_nil?

Here's some instructions I've noted down at some point to disable cpu scaling, for better testing:

info:
	https://www.kernel.org/doc/html/v5.11/admin-guide/pm/cpufreq.html
	https://wiki.archlinux.org/index.php/CPU_frequency_scaling


check:
	sudo cpupower frequency-info
	watch grep \"cpu MHz\" /proc/cpuinfo


cpu-scaling off:
	sudo sh -c "echo 0 > /sys/devices/system/cpu/cpufreq/boost"
	sudo cpupower frequency-set -g userspace
	sudo cpupower frequency-set -f 1600MHz

cpu-scaling on:
	sudo cpupower frequency-set -g schedutil
	sudo sh -c "echo 1 > /sys/devices/system/cpu/cpufreq/boost"

Co-authored-by: SFENCE <sfence.software@gmail.com>

src/script/lua_api/l_env.cpp

src/script/lua_api/l_env.h

Desour · 2024-02-23T19:53:30Z

Added the comments.

sfan5

LGTM otherwise

builtin/common/item_s.lua

SmallJoker

	Time / ms	PR / ms
	314.16	79.64
	290.42	81.25
	254.11	80.21
	193.72	80.7
	184.71	80.32
	329.17	80.05
	251.46	80.42
	301.68	79.2
	197.04	79.23
	197.5	80.82

average	251.4	80.2
MEDIAN	252.8	80.3

I find it strange that there's this much difference. pushnode already uses Lua functions to construct the table from arguments (core.set_push_node). So apparently by moving the table constructor after "returning" from C++ makes such big difference? Why's that?

Desour · 2024-03-02T16:04:31Z

In master we call lua functions from C++ to read vector and push node values. The C->Lua call overhead is probably quite big. And also we always need to look up the functions to call first, which requires a bunch of Lua C api calls.

SmallJoker

Works well. Cannot complain.

Desour · 2024-03-03T15:35:05Z

Benchmark function adopted from #14225 by @sfence. (Added you as co-author. :))

Somehow that co-authorship didn't make it into the commit. Did github squash it away?

SmallJoker · 2024-03-03T18:23:59Z

Sorry @Desour (or rather @sfence ). I totally missed that remark and removed it alongside the other commit titles upon merge. GitHub is not to blame.

Desour added @ Script API Performance labels Feb 18, 2024

Devtest: Add /bench_bulk_get_node

7ef4fc3

Co-authored-by: SFENCE <sfence.software@gmail.com>

Desour mentioned this pull request Feb 18, 2024

Implement core.get_node() in Lua so that it can be optimized by LuaJIT #12438

Closed

Desour force-pushed the get_node_pushnodeinlua branch from 2f2bbd4 to 8bd6f5f Compare February 18, 2024 16:37

Implement get_node with a get_node_raw

bee78f4

Desour force-pushed the get_node_pushnodeinlua branch from 8bd6f5f to bee78f4 Compare February 18, 2024 16:52

sfan5 reviewed Feb 21, 2024

View reviewed changes

src/script/lua_api/l_env.cpp Show resolved Hide resolved

sfan5 reviewed Feb 21, 2024

View reviewed changes

src/script/lua_api/l_env.h Show resolved Hide resolved

Zughy added the Action / change needed Code still needs changes (PR) / more information requested (Issues) label Feb 21, 2024

sfan's comments

d52de15

Desour removed the Action / change needed Code still needs changes (PR) / more information requested (Issues) label Feb 23, 2024

sfan5 approved these changes Feb 23, 2024

View reviewed changes

builtin/common/item_s.lua Outdated Show resolved Hide resolved

sfan5 added the One approval ✅ ◻️ label Feb 23, 2024

SmallJoker self-requested a review February 24, 2024 12:06

move it to item.lua

45d6930

SmallJoker reviewed Mar 2, 2024

View reviewed changes

SmallJoker approved these changes Mar 2, 2024

View reviewed changes

SmallJoker added >= Two approvals ✅ ✅ and removed One approval ✅ ◻️ labels Mar 2, 2024

SmallJoker merged commit d4d4712 into minetest:master Mar 3, 2024
15 checks passed

Desour deleted the get_node_pushnodeinlua branch March 3, 2024 15:09

Desour mentioned this pull request Mar 3, 2024

Add bulk_get_node function #14225

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement get_node with a get_node_raw #14384

Implement get_node with a get_node_raw #14384

Desour commented Feb 18, 2024 •

edited

Desour commented Feb 23, 2024

sfan5 left a comment

SmallJoker left a comment •

edited

Desour commented Mar 2, 2024

SmallJoker left a comment

Desour commented Mar 3, 2024

SmallJoker commented Mar 3, 2024 •

edited

Implement get_node with a get_node_raw #14384

Implement get_node with a get_node_raw #14384

Conversation

Desour commented Feb 18, 2024 • edited

To do

How to test

Desour commented Feb 23, 2024

sfan5 left a comment

Choose a reason for hiding this comment

SmallJoker left a comment • edited

Choose a reason for hiding this comment

Desour commented Mar 2, 2024

SmallJoker left a comment

Choose a reason for hiding this comment

Desour commented Mar 3, 2024

SmallJoker commented Mar 3, 2024 • edited

Desour commented Feb 18, 2024 •

edited

SmallJoker left a comment •

edited

SmallJoker commented Mar 3, 2024 •

edited