Document backend_config.ini #623

matsduf · 2020-06-29T08:20:44Z

All settings in backend_config.ini are not documented in Configuration.md, e.g. ZONEMASTER.force_hash_id_use_in_API_starting_from_id is not.

ghost · 2021-04-26T11:42:09Z

There are 2 last keys to document :

number_of_processes_for_frontend_testing
number_of_processes_for_batch_testing

The names used for the keys seem to express the fact that we can use these keys to limit the number of processes depending on the usage.

However looking at the current code, I can't find much on this potential limitation. What I found is the following usage, where both values are used simultaneously to compute the maximum number of processes.

https://github.com/zonemaster/zonemaster-backend/blob/5b485f7f88caa545150e011fdb835b7767941b30/lib/Zonemaster/Backend/Config.pm#L359..L410

sub new_PM {
    my $self = shift;

    my $maximum_processes = $self->NumberOfProcessesForFrontendTesting() + $self->NumberOfProcessesForBatchTesting();
    ...
    my $pm = Parallel::ForkManager->new( $maximum_processes );
    ...
}

I then tried to look for code that can limit the creation of a process for each of the usage (frontend or batch) without success.

Going through the git history I was able to find code where the limitation was explicit (this is quite old, but this is around the time where the NumberOfProfessesForFrontendTesting and NumberOfProfessesForBatchTesting (sic) were added - 4df3d9b)

https://github.com/zonemaster/zonemaster-backend/blob/4df3d9bd8ac43e8ddfccf4df7aca9b7ed2acc699/JobRunner/execute_tests.pl#L68..L84

sub can_start_new_worker {
	my ($priority, $test_id) = @_;
	my $result = 0;
	
	my @nb_instances = split(/\n+/, `ps -ef | grep "execute_zonemaster_P$priority.pl" | grep -v "sh -c" | grep -v grep | grep -v tail`);
	my @same_test_id = split(/\n+/, `ps -ef | grep "execute_zonemaster_P$priority.pl $test_id " | grep -v "sh -c" | grep -v grep | grep -v tail`);
	
	my $max_slots = 0;
	if ($priority == 5) {
		$max_slots = $batch_slots;
	}
	elsif ($priority == 10) {
		$max_slots = $frontend_slots;
	}
	
	$result = 1 if (scalar @nb_instances < $max_slots && !@same_test_id);
}

zonemaster-backend/JobRunner/execute_tests.pl

Line 93 in 4df3d9b

if (can_start_new_worker($priority, $h->{id})) {

So it looks like the file execute_tests.pl was using the limitation per type of process. This file is no more in the codebase. And it seems that its substitute has not kept the logic on frontend and batch processes (see commit 55f72d9)

https://github.com/zonemaster/zonemaster-backend/blob/55f72d9ead3ab0e06f81b4a9d390d0972ebe14de/script/zm_wb_daemon#L42..L44

my $maximum_processes =
  Zonemaster::WebBackend::Config->NumberOfProfessesForFrontendTesting() +
  Zonemaster::WebBackend::Config->NumberOfProfessesForBatchTesting();

Unless I am missing something it looks like the idea to limit the number of processes per usage has not been kept into the code. So I suggest we decide what we do about batch processing as proposed in #743.

The best I come with at this point is to specify that these values are added together and the result defines the total number of allowed processes. Despite their names, they do not limit the number of process by type (frontend or batch).

ghost · 2021-04-26T11:50:31Z

@matsduf and @mattias-p can you have a look at my above comment ? (I don't know if editing the comment with @ mentions raises a notification on your sides)

matsduf · 2021-04-26T13:03:51Z

I think it would be fine to deprecate both number_of_processes_for_frontend_testing and number_of_processes_for_batch_testing, and replace that with a new key with a shorter name for limiting the number of processes. We should also set a reasonable default value on that so that the default config.ini can be without the key. I guess 10 could be such a value.

I think it is unclear, however, what will happen when you reach the limit.

Before we do anything about #743 we should contact RIPE that we could be using the batch function.

mattias-p · 2021-04-26T16:28:43Z

I think it would be fine to deprecate both number_of_processes_for_frontend_testing and number_of_processes_for_batch_testing, and replace that with a new key with a shorter name for limiting the number of processes.

I agree with this. If you want two test agent worker pools it seems more reasonable to just run two test agent daemons.

We should also set a reasonable default value on that so that the default config.ini can be without the key. I guess 10 could be such a value.

I tried to look around to see what other people use. In one page of search engine hits I found two examples. The first one I found, Gunicorn, defaults to 1. The second one, parpool (some MatLab parallelization library or I don't know), defaults to one worker per physical core.

Having a default value of 1 worker for the test agent makes sense to me. One worker per core would make sense to me in a way if the test utilization is close to 100%. From looking at top during a single test I got a CPU utilization of around 50%. Another thing to consider is that if the test agent workers are hogging too much CPU the rpcapi workers would also be affected.

I think it is unclear, however, what will happen when you reach the limit.

Unclear in what way? Do you mean that there might be bugs in Parallel::ForkManager?

matsduf · 2021-04-26T16:56:00Z

I think it is unclear, however, what will happen when you reach the limit.

Unclear in what way? Do you mean that there might be bugs in Parallel::ForkManager?

Well, unclear to me, I guess. :-)

mattias-p · 2021-04-27T08:41:47Z

The way I read the design is that Parallel::ForkManager grows its worker pool until it reaches its given limit. If new test requests come in at a higher rate than the worker pool can handle them, the test requests start queuing up. The test requests have a priority field that is used for prioritization when consuming the queue. If more than one test requests have the same priority value they are consumed in FIFO order.

matsduf · 2021-04-27T09:20:17Z

@mattias-p, thank you!

ghost · 2022-03-23T10:20:30Z

I think that all settings are now properly documented. Can we close this issue?

matsduf · 2022-03-23T14:44:17Z

Fine to close. If we find something new that is properly documented it is better to open a new issue and specify what is missing.

matsduf added the A-Documentation Area: Documentation only. label Jun 29, 2020

matsduf added this to the v2020.1 milestone Jun 29, 2020

matsduf modified the milestones: v2020.1, v2020.2 Sep 21, 2020

ghost linked a pull request Dec 21, 2020 that will close this issue

Document all backend config keys #684

Merged

matsduf modified the milestones: v2020.2, v2021.1 Feb 9, 2021

matsduf assigned ghost Apr 15, 2021

ghost mentioned this issue Apr 29, 2021

Document keys number_of_processes_for_{frontend,batch}_testing #754

Merged

matsduf modified the milestones: v2021.1, v2021.2 May 27, 2021

matsduf modified the milestones: v2021.2, v2022.1 Dec 8, 2021

matsduf closed this as completed Mar 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document backend_config.ini #623

Document backend_config.ini #623

matsduf commented Jun 29, 2020 •

edited

Loading

ghost commented Apr 26, 2021 •

edited by ghost

Loading

ghost commented Apr 26, 2021

matsduf commented Apr 26, 2021

mattias-p commented Apr 26, 2021

matsduf commented Apr 26, 2021

mattias-p commented Apr 27, 2021 •

edited

Loading

matsduf commented Apr 27, 2021

ghost commented Mar 23, 2022

matsduf commented Mar 23, 2022

Document backend_config.ini #623

Document backend_config.ini #623

Comments

matsduf commented Jun 29, 2020 • edited Loading

ghost commented Apr 26, 2021 • edited by ghost Loading

ghost commented Apr 26, 2021

matsduf commented Apr 26, 2021

mattias-p commented Apr 26, 2021

matsduf commented Apr 26, 2021

mattias-p commented Apr 27, 2021 • edited Loading

matsduf commented Apr 27, 2021

ghost commented Mar 23, 2022

matsduf commented Mar 23, 2022

matsduf commented Jun 29, 2020 •

edited

Loading

ghost commented Apr 26, 2021 •

edited by ghost

Loading

mattias-p commented Apr 27, 2021 •

edited

Loading