feat: optimize common case of GlobPath #180

letFunny · 2024-12-11T09:58:39Z

Have you signed the CLA?

Results of the benchmark on my machine (using hyperfine with 10 rounds):

$ bash benchmark.sh HEAD main 
Creating rev: e2ee603c7396b33038e47352c0722b5b1202fbfe (main)
Creating rev: 19829794e6454f78334c9a67b74d8abb9bc66b25 (HEAD)
Benchmark 1: ./19829794e6454f78334c9a67b74d8abb9bc66b25 info --release ../chisel-releases/ubuntu-24.04 'python3.12_core'
  Time (mean ± σ):      5.341 s ±  0.115 s    [User: 5.528 s, System: 0.028 s]
  Range (min … max):    5.239 s …  5.635 s    10 runs
 
Benchmark 2: ./e2ee603c7396b33038e47352c0722b5b1202fbfe info --release ../chisel-releases/ubuntu-24.04 'python3.12_core'
  Time (mean ± σ):     18.366 s ±  0.139 s    [User: 19.677 s, System: 0.103 s]
  Range (min … max):   18.219 s … 18.675 s    10 runs
 
Summary
  ./19829794e6454f78334c9a67b74d8abb9bc66b25 info --release ../chisel-releases/ubuntu-24.04 'python3.12_core' ran
    3.44 ± 0.08 times faster than ./e2ee603c7396b33038e47352c0722b5b1202fbfe info --release ../chisel-releases/ubuntu-24.04 'python3.12_core'

Basically, HEAD takes 5.341s on my machine compared to 18.366s for main.

niemeyer

This looks good, thank you. Can we please just tune the suffix function so it looks like the prefix one, but inverted? See details below.

niemeyer · 2024-12-13T17:52:02Z

internal/strdist/strdist.go

+	ai := strings.LastIndexAny(a, "*?")
+	la := 0
+	if ai != -1 {
+		la = len(a) - ai - 1


These algorithms are exact opposites of one another, so I'm missing why a choice was made not to implement them as such as well.

For example, if we look at the prefix function, the opposite would be:

if ai == -1 { ai = 0 }

Right? Doing something entirely different besides being more work, it seems, is also unnecessary cognitive load.

I am probably thinking of the wrong algorithm but I do not think the above will work. We use the indexes for the prefix because they coincide with the prefix length, which is what we really want. Conversely, for the suffixes we want to compare the lengths of the suffixes to get the minimum one and, as far as I know, we cannot do that using indexes without calculating the deltas first because there is no correspondence between index and length. For example:

foo*baz foobar*b

We know that the indexes are 3 and 6 respectively. If we take the minimum of the position then I do not see how to get to the fact that we should check only the last character of both strings because the minimum length of the suffix is 1.

We must be talking about different things. I'm not suggesting a change in algorithm, but rather just pointing out that the algorithm was laid out differently between the suffix and prefix version. The actual comparison is exactly the same at the end.

I am talking about this sentence for example:

At first I thought this was wrong, again because I had the prefix logic in mind and didn't realize here it's the minimum of the delta, rather than the minimum of the position

I cannot make it like the prefix logic that uses the position because for suffix position != length of the suffix. Meaning I have to use the deltas, unless I am missing something.

Yes, I think you are missing something.

What's the difference between a[len(a)-minl:] and a[ai:]? :)

niemeyer · 2024-12-13T17:58:01Z

internal/strdist/strdist.go

+	if bi != -1 {
+		lb = len(b) - bi - 1
+	}
+	minl := min(la, lb)


The opposite would be max here. At first I thought this was wrong, again because I had the prefix logic in mind and didn't realize here it's the minimum of the delta, rather than the minimum of the position, per logic above.

niemeyer

I'm moving this on. We can discuss semantics tomorrow in our call for alignment, but this is not important enough right now, and I'd rather have it merged before holidays.

Results of the benchmark on my machine (using hyperfine with 10 rounds). Basically, HEAD takes 5.341s on my machine compared to 18.366s for main. $ bash benchmark.sh HEAD main Creating rev: e2ee603 (main) Creating rev: 1982979 (HEAD) Benchmark 1: ./19829794e6454f78334c9a67b74d8abb9bc66b25 info --release ../chisel-releases/ubuntu-24.04 'python3.12_core' Time (mean ± σ): 5.341 s ± 0.115 s [User: 5.528 s, System: 0.028 s] Range (min … max): 5.239 s … 5.635 s 10 runs Benchmark 2: ./e2ee603c7396b33038e47352c0722b5b1202fbfe info --release ../chisel-releases/ubuntu-24.04 'python3.12_core' Time (mean ± σ): 18.366 s ± 0.139 s [User: 19.677 s, System: 0.103 s] Range (min … max): 18.219 s … 18.675 s 10 runs Summary ./19829794e6454f78334c9a67b74d8abb9bc66b25 info --release ../chisel-releases/ubuntu-24.04 'python3.12_core' ran 3.44 ± 0.08 times faster than ./e2ee603c7396b33038e47352c0722b5b1202fbfe info --release ../chisel-releases/ubuntu-24.04 'python3.12_core'

feat: optimize common case of GlobPath

e943cdd

letFunny requested a review from niemeyer December 11, 2024 09:59

letFunny added 2 commits December 11, 2024 20:27

also optimize general suffix

8b5c59e

docs

1982979

letFunny mentioned this pull request Dec 11, 2024

feat: parallelism to speed up conflict search #171

Closed

1 task

letFunny added the Polish Refactorings, etc label Dec 11, 2024

niemeyer approved these changes Dec 13, 2024

View reviewed changes

niemeyer approved these changes Dec 16, 2024

View reviewed changes

niemeyer merged commit 94a1c48 into canonical:main Dec 16, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: optimize common case of GlobPath #180

feat: optimize common case of GlobPath #180

letFunny commented Dec 11, 2024 •

edited

Loading

niemeyer left a comment

niemeyer Dec 13, 2024

letFunny Dec 16, 2024

niemeyer Dec 16, 2024

letFunny Dec 16, 2024 •

edited

Loading

niemeyer Dec 16, 2024

niemeyer Dec 13, 2024

niemeyer left a comment

feat: optimize common case of GlobPath #180

feat: optimize common case of GlobPath #180

Conversation

letFunny commented Dec 11, 2024 • edited Loading

niemeyer left a comment

Choose a reason for hiding this comment

niemeyer Dec 13, 2024

Choose a reason for hiding this comment

letFunny Dec 16, 2024

Choose a reason for hiding this comment

niemeyer Dec 16, 2024

Choose a reason for hiding this comment

letFunny Dec 16, 2024 • edited Loading

Choose a reason for hiding this comment

niemeyer Dec 16, 2024

Choose a reason for hiding this comment

niemeyer Dec 13, 2024

Choose a reason for hiding this comment

niemeyer left a comment

Choose a reason for hiding this comment

letFunny commented Dec 11, 2024 •

edited

Loading

letFunny Dec 16, 2024 •

edited

Loading