Recompute TTL values on each `get` #34

kppullin · 2021-12-03T23:17:35Z

TTL values must be recomputed on each get action instead
of being fixed to a constant value for the lifetime of the cached object.

Prior to this change, the constant TTL value X would cause kafka-connect
to continually reschedule connector restarts X milliseconds in the future,
effectively ensuring that the connector never actually restarts.

With this change, the reschedule actions have a timestamp that shrinks until
the TTL is reached.

Before (note contant restart time of ~30 mins):

2021-12-03 20:19:65,634 INFO   ||  Scheduling a restart of connector debezium_infra in 1799900 ms   [org.apache.kafka.connect.runtime.WorkerConfigTransformer]
2021-12-03 20:18:25,143 INFO   ||  Scheduling a restart of connector debezium_infra in 1799900 ms   [org.apache.kafka.connect.runtime.WorkerConfigTransformer]

After (note restart time decreases ~40 seconds and the log messages are ~40 seconds apart):

2021-12-03 21:09:24,228 INFO   ||  Scheduling a restart of connector debezium_infra in 2063263 ms   [org.apache.kafka.connect.runtime.WorkerConfigTransformer]
2021-12-03 21:08:45,858 INFO   ||  Scheduling a restart of connector debezium_infra in 2101945 ms   [org.apache.kafka.connect.runtime.WorkerConfigTransformer]

TTL values must be recomputed on each `get` action instead of being fixed to a constant value for the lifetime of the cached object. Prior to this change, the constant TTL value `X` would cause kafka-connect to continually reschedule connector restarts `X` milliseconds in the future, effectively ensuring that the connector never actually restarts. With this change, the reschedule actions have a timestamp that shrinks until the TTL is reached. Before (note contant restart time of ~30 mins): ``` 2021-12-03 20:19:65,634 INFO || Scheduling a restart of connector debezium_infra in 1799900 ms [org.apache.kafka.connect.runtime.WorkerConfigTransformer] 2021-12-03 20:18:25,143 INFO || Scheduling a restart of connector debezium_infra in 1799900 ms [org.apache.kafka.connect.runtime.WorkerConfigTransformer] ``` After (note restart time decreases ~40 seconds and the log messages are ~40 seconds apart): ``` 2021-12-03 21:09:24,228 INFO || Scheduling a restart of connector debezium_infra in 2063263 ms [org.apache.kafka.connect.runtime.WorkerConfigTransformer] 2021-12-03 21:08:45,858 INFO || Scheduling a restart of connector debezium_infra in 2101945 ms [org.apache.kafka.connect.runtime.WorkerConfigTransformer] ```

davidsloan · 2023-03-13T10:49:03Z

Hey @kppullin . Please accept our apologies for leaving this PR sitting here going stale.

I just have one question, and that is how to ensure the automatic scheduling:

2021-12-03 21:09:24,228 INFO || Scheduling a restart of connector debezium_infra in 2063263 ms [org.apache.kafka.connect.runtime.WorkerConfigTransformer]

I'm trying to write tests for some of these scenarios to ensure they work, but have been unable to ensure that the automatic restart ever happens (even with your fixes).

From the documentation:

A value of 'restart' indicates that Connect should restart/reload the connector with the updated configuration properties.The restart may actually be scheduled in the future if the external configuration provider indicates that a configuration value will expire in the future.

Are there any specific configuration you had to supply to ensure the connector was restarted on the TTL change? (Other than the config.action.reload that is referred to above?

kppullin · 2023-03-14T19:16:10Z

@davidsloan since it's been so long I've lost all the details and context on this change. I looked through our configs and do not see anything that'd relate or be required for the modification in this PR to work.

FWIW this change, along with #32, has been working well for a couple years now in our deployed kafka-connect clusters. However, there may be a separate need to refresh the "parent" vault credentials which expire after 32 days by default... this remains to be confirmed and something we may have seen just a couple times as we typically restart our clusters more frequently than 32 days.

davidsloan · 2023-04-05T09:04:51Z

Hi @kppullin I appreciate the feedback above!

I think the majority of these changes will be covered under #47 and #50 but we did not update the Azure secret provider yet. If you would like to rebase your PR for Azure then it will be accepted, otherwise we will try to get around to fixing this functionality in the future.

kppullin · 2023-04-14T17:45:17Z

@davidsloan - I'm unsure when I'll next have time to revisit this (as well as #32 and #33), but would hope to find time in the next few months. We are actiely looking into resolving the issue mentioned above where the parent/login token expires after 32 days in our spring boot services and then will move on to fixing this on the kafka-connect & secret-provider side next, at which point will likely include revisiting the various PRs I've pending.

davidsloan · 2023-10-26T15:32:14Z

Your Azure work has been rebased against the latest code in PR #63 .

Thanks again for the contribution!

kppullin-nt added 3 commits December 3, 2021 15:14

Fix build issues with Azure tests

0f62a81

Set the TTL to None if the duration is 0

3b2d070

kppullin mentioned this pull request Dec 21, 2021

Vault - Support default TTLs #33

Closed

davidsloan added the Rebase requested label Apr 5, 2023

davidsloan mentioned this pull request Oct 26, 2023

Recompute TTL values on each get #63

Merged

davidsloan closed this Oct 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recompute TTL values on each `get` #34

Recompute TTL values on each `get` #34

kppullin commented Dec 3, 2021

davidsloan commented Mar 13, 2023

kppullin commented Mar 14, 2023

davidsloan commented Apr 5, 2023

kppullin commented Apr 14, 2023

davidsloan commented Oct 26, 2023

Recompute TTL values on each get #34

Recompute TTL values on each get #34

Conversation

kppullin commented Dec 3, 2021

davidsloan commented Mar 13, 2023

kppullin commented Mar 14, 2023

davidsloan commented Apr 5, 2023

kppullin commented Apr 14, 2023

davidsloan commented Oct 26, 2023

Recompute TTL values on each `get` #34

Recompute TTL values on each `get` #34