optimize mvps_push_or_pull #662

lolbinarycat · 2023-12-17T21:32:47Z

this function is a big bottleneck, since it contains several layers of nested loops, and has quite a few allocations in the innermost levels.

i did my best to optimize it, and i'm only confident in about half my changes, although some imprecice testing implied as much as a 10x speedup.

if someone could benchmark this properly and see what changes are worth it, that would be greatly appreciated.

… and readability

lolbinarycat · 2023-12-17T21:38:02Z

apperently the CI uses a different lua version than minetest 5.7

since it lacks table.move.

not sure what to make of that

sfan5 · 2023-12-17T22:01:41Z

table.move was added in Lua 5.3 and appears to be supported as a non-standard extension in LuaJIT.
You can't use it.

Desour · 2023-12-19T17:38:06Z

LuaJIT adds table.move. You can check at load if it exists, and implement add_list without it if it's not there.

Desour · 2023-12-19T17:41:32Z

mesecons_mvps/init.lua

@@ -71,40 +71,49 @@ function mesecon.mvps_get_stack(pos, dir, maximum, all_pull_sticky)
 	local pos_set = {}
 	local frontiers = mesecon.fifo_queue.new()
 	frontiers:add(vector.new(pos))
+	-- micro-optimization: lift local definitions out of loop


It would be new to me that lifting local variable declarations out of their scope is cheaper. May I ask where you read about this?

(Also, imho it hinders code quality.)

this is based on personal testing, and is likely not worth it.

unless LuaJIT automatically does this, a local variable would have to have space for it allocated, which necessarily takes time.

…ove() is missing

numberZero · 2023-12-24T10:53:18Z

mesecons_mvps/init.lua


-	on_mvps_move(moved_nodes)
-
+	on_mvps_move(moved_nodes)	


What was changed here, whitespace?

yeah this must've happened when i was deleting and inserting lines. are random formatting changes considered a major problem?

numberZero · 2023-12-24T12:35:30Z

mesecons_mvps/init.lua

-						if vector.equals(link, np) then
-							frontiers:add(adjpos)
+				-- optimization: don't check blocks that are already part of the stack
+				if not pos_set[minetest.hash_node_position(adjpos)] then


What’s about r==dir?

i don't fully understand, do you mean i should add an additional optimization to skip this code when r==dir, since that block will already have been added to the queue?

i guess that's a missed optimization then, yeah. do you want me to add it?

numberZero · 2023-12-24T12:37:16Z

mesecons_mvps/init.lua

-				end
+			local nodedef = minetest.registered_nodes[nn.name]
+			if nodedef and nodedef.mvps_sticky then
+				local connected = nodedef.mvps_sticky(np, nn)


lolbinarycat · 2023-12-25T17:56:36Z

all this code review is nice, but what would really be helpful is for someone benchmark this to see if the speedup is worth me fixing everything up.

or if someone could point me in the direction of some lua/minetest benchmarking tools i could try to do it myself.

my limited testing was based off of realtime and framerate, both of which can be influenced by other processes.

SmallJoker · 2023-12-27T08:25:01Z

Minetest has an internal profiler:

Settings to enable it: https://github.com/minetest/minetest/blob/335af393f09b3629587f14d41a90ded4a3cbddcd/builtin/settingtypes.txt#L1734-L1762
Chat command to export: https://github.com/minetest/minetest/blob/335af393f09b3629587f14d41a90ded4a3cbddcd/builtin/profiler/init.lua#L45-L46

According to mesecons/actionqueue.lua, the actions are executed each globalstep, thus you should see a difference in one of the mesecons -> globalstep[??] entries after running /profiler save txt (for comparison). I however do not know what the best way to test this optimization would be - 100 pistons with sand on top of them?

lolbinarycat · 2023-12-27T22:22:18Z

Minetest has an internal profiler:
* Settings to enable it: https://github.com/minetest/minetest/blob/335af393f09b3629587f14d41a90ded4a3cbddcd/builtin/settingtypes.txt#L1734-L1762

* Chat command to export: https://github.com/minetest/minetest/blob/335af393f09b3629587f14d41a90ded4a3cbddcd/builtin/profiler/init.lua#L45-L46
According to mesecons/actionqueue.lua, the actions are executed each globalstep, thus you should see a difference in one of the mesecons -> globalstep[??] entries after running /profiler save txt (for comparison). I however do not know what the best way to test this optimization would be - 100 pistons with sand on top of them?

personally i used stacks of slimeblocks on top of sticky pistons connected to a 1-tick clock.

lolbinarycat added 9 commits December 17, 2023 14:57

add profiling debug prints

00359e7

don't check for sticky blocks in locations that are already being pushed

ba9b250

more optimization of inner loops related to mvps_push

bc9ed16

micro-optimization: eliminate index in inner loop

a7c8146

add fifo_queue.add_list() and use it to eliminate an inner loop

5be0ee9

store result of table lookup in local variable, improving performance…

43a568d

… and readability

mess with locals and hide debug prints

22eae4b

cleanup

068da0c

remove profiling stuff

4c27c65

Desour reviewed Dec 19, 2023

View reviewed changes

lolbinarycat added 2 commits December 19, 2023 15:01

revert local-variable lifting

79214b5

add fallback implementation for fifo_queue.add_list() in case table.m…

ab2df3c

…ove() is missing

numberZero reviewed Dec 24, 2023

View reviewed changes

lolbinarycat closed this Mar 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize mvps_push_or_pull #662

optimize mvps_push_or_pull #662

lolbinarycat commented Dec 17, 2023

lolbinarycat commented Dec 17, 2023

sfan5 commented Dec 17, 2023

Desour commented Dec 19, 2023

Desour Dec 19, 2023 •

edited

Loading

lolbinarycat Dec 19, 2023

numberZero Dec 24, 2023

lolbinarycat Dec 25, 2023

numberZero Dec 24, 2023

lolbinarycat Dec 25, 2023

numberZero Dec 24, 2023

lolbinarycat commented Dec 25, 2023

SmallJoker commented Dec 27, 2023 •

edited

Loading

lolbinarycat commented Dec 27, 2023

optimize mvps_push_or_pull #662

optimize mvps_push_or_pull #662

Conversation

lolbinarycat commented Dec 17, 2023

lolbinarycat commented Dec 17, 2023

sfan5 commented Dec 17, 2023

Desour commented Dec 19, 2023

Desour Dec 19, 2023 • edited Loading

Choose a reason for hiding this comment

lolbinarycat Dec 19, 2023

Choose a reason for hiding this comment

numberZero Dec 24, 2023

Choose a reason for hiding this comment

lolbinarycat Dec 25, 2023

Choose a reason for hiding this comment

numberZero Dec 24, 2023

Choose a reason for hiding this comment

lolbinarycat Dec 25, 2023

Choose a reason for hiding this comment

numberZero Dec 24, 2023

Choose a reason for hiding this comment

lolbinarycat commented Dec 25, 2023

SmallJoker commented Dec 27, 2023 • edited Loading

lolbinarycat commented Dec 27, 2023

Desour Dec 19, 2023 •

edited

Loading

SmallJoker commented Dec 27, 2023 •

edited

Loading