fix: Reduce memory usage when publishing prediction log to kafka #525

tiopramayudi · 2024-01-29T03:05:53Z

Description

We've seen gradual increase of memory usage when model observability is enabled for a model.

First hypothesis why this happened is due to using asyncio, because we pass prediction input and output to the async function. We try to prove it by reducing sampling rate to 0, since the async function that being called need to publish to kafka, so we need to isolate this only for asyncio overhead. After set sampling rate to 0 the memory usage is stable there is no gradual increase

Since first hypothesis is not correct, we have new hypothesis that this is due to publishing the data to kafka, and we did memory profiler to the model

PS: We use memray as profiler https://github.com/bloomberg/memray

We see that the memory usage is keep increasing and producing the message to kafka contribute to this.

Modifications

To solve this problem, kafka producer must call poll after publish the message, this is necessary so ack buffer from producer will be drained and the memory usage won't gradually increase, ref: 1 , 2

After the changes

Tests

Checklist

Added PR label
Added unit test, integration, and/or e2e tests
Tested locally
Updated documentation
Update Swagger spec if the PR introduce API changes
Regenerated Golang and Python client if the PR introduces API changes

Release Notes

ghost · 2024-01-29T03:06:43Z

👇 Click on the image for a new way to code review

Legend

khorshuheng

lgtm, thanks!

ariefrahmansyah · 2024-01-29T03:17:49Z

On a separate note, what tool are you using for memory profiling? @tiopramayudi

tiopramayudi · 2024-01-29T03:19:13Z

On a separate note, what tool are you using for memory profiling? @tiopramayudi

I use memray as profiler https://github.com/bloomberg/memray

python/pyfunc-server/pyfuncserver/publisher/kafka.py

Call poll after produce to reduce usage of producer queue limit

5bc1bdc

tiopramayudi self-assigned this Jan 29, 2024

tiopramayudi requested a review from khorshuheng January 29, 2024 03:06

khorshuheng added the bug Something isn't working label Jan 29, 2024

khorshuheng approved these changes Jan 29, 2024

View reviewed changes

leonlnj reviewed Jan 29, 2024

View reviewed changes

python/pyfunc-server/pyfuncserver/publisher/kafka.py Show resolved Hide resolved

tiopramayudi merged commit e97330c into main Jan 29, 2024
32 of 33 checks passed

tiopramayudi deleted the memory-leak branch January 29, 2024 05:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Reduce memory usage when publishing prediction log to kafka #525

fix: Reduce memory usage when publishing prediction log to kafka #525

tiopramayudi commented Jan 29, 2024 •

edited

Loading

ghost commented Jan 29, 2024

Legend

khorshuheng left a comment

ariefrahmansyah commented Jan 29, 2024

tiopramayudi commented Jan 29, 2024

fix: Reduce memory usage when publishing prediction log to kafka #525

fix: Reduce memory usage when publishing prediction log to kafka #525

Conversation

tiopramayudi commented Jan 29, 2024 • edited Loading

Description

Modifications

Tests

Checklist

Release Notes

ghost commented Jan 29, 2024

Legend

khorshuheng left a comment

Choose a reason for hiding this comment

ariefrahmansyah commented Jan 29, 2024

tiopramayudi commented Jan 29, 2024

tiopramayudi commented Jan 29, 2024 •

edited

Loading