Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vcl_vrt: Skip VCL execution if the client is gone (6.0) #4006

Merged
merged 6 commits into from
Oct 18, 2023

Conversation

dridi
Copy link
Member

@dridi dridi commented Oct 18, 2023

This is a port of #3998 with b4c5030 folded in for the 6.0 branch.

Not sure why 6fbb937 does not originate
from trunk.
It was particularly hard to follow once we reach client c3.
The goal is for top-level transports to report whether the client is
still present or not.
Once a client is reportedly gone, processing its VCL task(s) is just a
waste of resources. The execution of client-facing VCL is intercepted
and an artificial return(fail) is returned in that scenario.

Thanks to the introduction of the universal return(fail) proper error
handling and resource tear down is already in place, which makes this
change safe modulus unknown bugs. This adds a circuit breaker anywhere
in the client state machine where there is VCL execution.

A new Reset time stamp is logged to convey when a task does not complete
because the client is gone. This is a good complement to the walk away
feature and its original circuit breaker for the waiting list, but this
has not been integrated yet.

While the request is technically failed, it won't increase the vcl_fail
counter, and a new req_reset counter is incremented. This new behavior
is guarded by a new vcl_req_reset feature flag, enabled by default.

Refs varnishcache#3835
Refs 61a15cb
Refs e5efc2c
Refs ba54dc9
Refs 6f50a00
Refs b881699

Conflicts:
	bin/varnishd/cache/cache_vrt_vcl.c
	bin/varnishd/cache/cache_vcl_vrt.c
	bin/varnishd/mgt/mgt_param_bits.c
	include/tbl/params.h
The error check is not performed in a critical section to avoid
contention, at the risk of not seeing the error until the next
transport poll.

Conflicts:
	bin/varnishd/http2/cache_http2_session.c
@dridi
Copy link
Member Author

dridi commented Oct 18, 2023

Upon inspection, most of the failing tests were already registered in #4002 and a couple more very slow tests were added.

One failing test is related to the change set and needs further inspection.

@dridi
Copy link
Member Author

dridi commented Oct 18, 2023

The failing test that got me worried is the result of a missing back-port in #4004 of a patch that came after #3661. So I ended up troubleshooting the same bug twice with the same diagnostic, but I didn't even remember that I had already fixed it...

I pushed the missing back-port here and this should be ready to merge once CI completes.

@dridi
Copy link
Member Author

dridi commented Oct 18, 2023

This time, all of the failing tests were already registered in #4002 as slow, ready to merge.

@dridi dridi merged commit e66c57c into varnishcache:6.0 Oct 18, 2023
@dridi dridi deleted the pull_3998_6.0 branch October 18, 2023 15:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant