Make more client services into `startstop.Service` #262

brandur · 2024-03-10T19:16:11Z

Here, we aim to move in a direction where the client can treat most of
its subservices generically by starting/stopping the notifier as a
generic service, along with converting three other smaller services over
to use the startstop.Service interface:

Client monitor. (This ends up cleaning up its internal code
significantly, and has a major improvement on its test coverage and
robustness.)
The client's statistics logging loop.
The client's leadership change handling loop.

Both the latter two are very small functions, so I implement the new
function startstop.StartStopFunc which similar to river.WorkFunc,
provides an easy way to get a small service using only a function, which
keeps the code quite clean.

It's going to take a few more changes to get our full simplification
change, but you can already see the beginnings of it as many services
can be started and stopped in ~one line:

func (c *Client[TTx]) signalStopComplete(ctx context.Context) {
    // Wait for producers and elector to exit:
    c.wg.Wait()

    // Stop all mainline services where stop order isn't important.
    startstop.StopAllParallel(c.services)

By the end of it, both the elector and producers will also become
services, and we'll be able to fully eliminate the wait group, along
with the stop channels and the roundabout stop functions like
signalStopComplete. The idea is that the client itself will also
become a start/stop service, and itself become protected against being
started multiple times, or potential races where Start/Stop may be
called by multiple goroutines.

brandur · 2024-03-10T19:17:55Z

client.go

+	//
+	// Unlike other services, it's given a background context so that it doesn't
+	// cancel on normal stops.
+	if err := c.monitor.Start(context.Background()); err != nil { //nolint:contextcheck


Won't return an error, but like other services, it has the option to do so.

brandur · 2024-03-10T19:18:46Z

client.go

+			// In case of error, stop any services that might have started. This
+			// is safe because even services that were never started will still
+			// tolerate being stopped.
+			startstop.StopAllParallel(c.services)


We now try and stop anything that started even in the event of a start error, which is kind of nice.

Here, we aim to move in a direction where the client can treat most of its subservices generically by starting/stopping the notifier as a generic service, along with converting three other smaller services over to use the `startstop.Service` interface: * Client monitor. (This ends up cleaning up its internal code significantly, and has a major improvement on its test coverage and robustness.) * The client's statistics logging loop. * The client's leadership change handling loop. Both the latter two are very small functions, so I implement the new function `startstop.StartStopFunc` which similar to `river.WorkFunc`, provides an easy way to get a small service using only a function, which keeps the code quite clean. It's going to take a few more changes to get our full simplification change, but you can already see the beginnings of it as many services can be started and stopped in ~one line: func (c *Client[TTx]) signalStopComplete(ctx context.Context) { // Wait for producers and elector to exit: c.wg.Wait() // Stop all mainline services where stop order isn't important. startstop.StopAllParallel(c.services) By the end of it, both the elector and producers will also become services, and we'll be able to fully eliminate the wait group, along with the stop channels and the roundabout stop functions like `signalStopComplete`. The idea is that the client itself will also become a start/stop service, and itself become protected against being started multiple times, or potential races where `Start`/`Stop` may be called by multiple goroutines.

brandur · 2024-03-10T19:20:07Z

client.go

 		}
-	}
+	}()


This all just moves down from above where it was previously in a closure.

brandur · 2024-03-10T19:21:22Z

client_monitor.go

-
-		shutdownOnce: &sync.Once{},
-		shutdownCh:   make(chan struct{}),
-		doneCh:       make(chan struct{}),


Lets us get rid of a whole bunch of vars, which is nice. We add a StartStopStress test case also, so we're now much more certain that this code works than before.

brandur · 2024-03-10T19:27:40Z

@bgentry I'm sure you're just going to love this one lol, but I promise it's for the greater good.

Follow up changes like #253, #262, and #263 to make the producer a start/stop service, giving it a more predictable way to invoke start and stop, making it safer to run and cleaning up caller code where it's used in the client and test cases. With all these changes taken together we'll have every service in the client using the same unified service interface, which will clean up code and let us write some neat utilities to operate across all of them. Aside from that, we clean up the producer in ways to bring it more inline with other code, like making logging uniform and having the constructor return only a `*producer` instead of `(*producer, error)` that needs to be checked despite an error always being indicative of a bug in this context. We expand the test suite, adding tests like (1) verifying that workers are really stopped when `workCtx` is cancelled, (2) verifying that the max worker slots work as expected and that the producer limits its fetches, and (3) start/stop stress. Like with #263, we give the producer a poll only mode, which also gets the full test barrage using a shared test transaction instead of full database pool. Also like #263, this poll only mode is still prospective and not yet put to full use (although it will be soon).

bgentry · 2024-03-11T15:11:16Z

@bgentry I'm sure you're just going to love this one lol, but I promise it's for the greater good.

Actually wasn't too bad at all :)

brandur · 2024-03-12T00:24:13Z

Actually wasn't too bad at all :)

Hah, okay that's a relief :) Thanks!

Follow up changes like #253, #262, and #263 to make the producer a start/stop service, giving it a more predictable way to invoke start and stop, making it safer to run and cleaning up caller code where it's used in the client and test cases. With all these changes taken together we'll have every service in the client using the same unified service interface, which will clean up code and let us write some neat utilities to operate across all of them. Aside from that, we clean up the producer in ways to bring it more inline with other code, like making logging uniform and having the constructor return only a `*producer` instead of `(*producer, error)` that needs to be checked despite an error always being indicative of a bug in this context. We expand the test suite, adding tests like (1) verifying that workers are really stopped when `workCtx` is cancelled, (2) verifying that the max worker slots work as expected and that the producer limits its fetches, and (3) start/stop stress. Like with #263, we give the producer a poll only mode, which also gets the full test barrage using a shared test transaction instead of full database pool. Also like #263, this poll only mode is still prospective and not yet put to full use (although it will be soon).

) Follow up changes like #253, #262, and #263 to make the producer a start/stop service, giving it a more predictable way to invoke start and stop, making it safer to run and cleaning up caller code where it's used in the client and test cases. With all these changes taken together we'll have every service in the client using the same unified service interface, which will clean up code and let us write some neat utilities to operate across all of them. Aside from that, we clean up the producer in ways to bring it more inline with other code, like making logging uniform and having the constructor return only a `*producer` instead of `(*producer, error)` that needs to be checked despite an error always being indicative of a bug in this context. We expand the test suite, adding tests like (1) verifying that workers are really stopped when `workCtx` is cancelled, (2) verifying that the max worker slots work as expected and that the producer limits its fetches, and (3) start/stop stress. Like with #263, we give the producer a poll only mode, which also gets the full test barrage using a shared test transaction instead of full database pool. Also like #263, this poll only mode is still prospective and not yet put to full use (although it will be soon).

brandur commented Mar 10, 2024

View reviewed changes

client.go

}

}

}()

Copy link

Contributor Author

brandur Mar 10, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This all just moves down from above where it was previously in a closure.

brandur commented Mar 10, 2024

View reviewed changes

brandur force-pushed the brandur-client-services branch from b1a01f5 to 69ef596 Compare March 10, 2024 19:21

brandur requested a review from bgentry March 10, 2024 19:27

brandur mentioned this pull request Mar 11, 2024

Make producer start/stop service + poll-only mode + expanded tests #264

Merged

bgentry approved these changes Mar 11, 2024

View reviewed changes

brandur merged commit ccea9c7 into master Mar 12, 2024
10 checks passed

brandur deleted the brandur-client-services branch March 12, 2024 00:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make more client services into `startstop.Service` #262

Make more client services into `startstop.Service` #262

brandur commented Mar 10, 2024

brandur Mar 10, 2024

brandur Mar 10, 2024

brandur Mar 10, 2024

brandur Mar 10, 2024

brandur commented Mar 10, 2024

bgentry commented Mar 11, 2024

brandur commented Mar 12, 2024

Make more client services into startstop.Service #262

Make more client services into startstop.Service #262

Conversation

brandur commented Mar 10, 2024

brandur Mar 10, 2024

Choose a reason for hiding this comment

brandur Mar 10, 2024

Choose a reason for hiding this comment

brandur Mar 10, 2024

Choose a reason for hiding this comment

brandur Mar 10, 2024

Choose a reason for hiding this comment

brandur commented Mar 10, 2024

bgentry commented Mar 11, 2024

brandur commented Mar 12, 2024

Make more client services into `startstop.Service` #262

Make more client services into `startstop.Service` #262