Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trino Hudi connector gets stuck while attempting to read empty partitions of a partitioned Hudi table #19506

Open
uroell opened this issue Oct 24, 2023 · 2 comments

Comments

@uroell
Copy link

uroell commented Oct 24, 2023

Hi,

after some discussion in the Hudi Slack channel (see https://app.slack.com/client/T4D7BR6T1/C4D716NPQ/thread/C4D716NPQ-1695733714.877639), I open this issue to track it.

Some information about the issue we are facing at the moment:

We have a partitioned Hudi table and there are certain partitions that do not have a data file (in our S3). And deleting all those "empty" partitions, so there is no metadata about them in our Hive Metastore, will let the query run successfully afterwards.

Basically what we do is a simple "select count(*) from exampletable" which gets stuck and seems to run forever and to compare with it a "select count(*) from exampletable where happeneddayde = '2023-09-19'" which returns with the expected result right away. So happeneddayde is a partition key and for this particular day there is data in the S3. For the stuck query we can see in the Trino dashboard that it is blocked.

Here is the query json for the blocked/stuck query:

20231017_131611_00008_r7bcz_anonymized.json

@nfarah86
Copy link

roping in @codope

@codesorcery
Copy link
Contributor

Update to @uroell (we're working in the same department):
#20151 fixes the problem of Trino failing on empty Hudi partitions for us.
#20027 should then additionally fix that Trino doesn't get stuck on errors.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants