Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] IndexActionIT tests are flaky Shard [test][0] is still locked after 5 sec waiting #12408

Open
peternied opened this issue Feb 21, 2024 · 2 comments
Assignees
Labels
bug Something isn't working flaky-test Random test failure that succeeds on second run Indexing:Replication Issues and PRs related to core replication framework eg segrep Storage Issues and PRs relating to data and metadata storage

Comments

@peternied
Copy link
Member

Describe the bug

Seeing test failures due to java.lang.AssertionError: Shard [test][0] is still locked after 5 sec waiting

See test report: https://build.ci.opensearch.org/job/gradle-check/33973/testReport/

Test Name Duration Age
org.opensearch.indexing.IndexActionIT.testAutoGenerateIdNoDuplicates {p0={"cluster.indices.replication.strategy":"SEGMENT"}} 4 min 8 sec 1
org.opensearch.indexing.IndexActionIT.testInvalidIndexName {p0={"cluster.indices.replication.strategy":"SEGMENT"}} 5.4 sec 1
org.opensearch.indexing.IndexActionIT.testCreateIndexWithLongName {p0={"cluster.indices.replication.strategy":"SEGMENT"}} 7.2 sec 1
org.opensearch.indexing.IndexActionIT.testCreatedFlag {p0={"cluster.indices.replication.strategy":"SEGMENT"}} 9.2 sec 1
org.opensearch.indexing.IndexActionIT.testCreatedFlagWithExternalVersioning {p0={"cluster.indices.replication.strategy":"SEGMENT"}} 9.4 sec 1
org.opensearch.indexing.IndexActionIT.testCreateFlagWithBulk {p0={"cluster.indices.replication.strategy":"SEGMENT"}} 8.5 sec 1
org.opensearch.indexing.IndexActionIT.testDocumentWithBlankFieldName {p0={"cluster.indices.replication.strategy":"SEGMENT"}} 7.6 sec 1
org.opensearch.indexing.IndexActionIT.testCreatedFlagWithFlush {p0={"cluster.indices.replication.strategy":"SEGMENT"}} 8 sec 1

Related component

Build

To Reproduce

REPRODUCE WITH: ./gradlew ':server:internalClusterTest' --tests "org.opensearch.indexing.IndexActionIT" -Dtests.method="testAutoGenerateIdNoDuplicates {p0={"cluster.indices.replication.strategy":"SEGMENT"}}" -Dtests.seed=6CB4AD2130F2C392 -Dtests.security.manager=true -Dtests.jvm.argline="-XX:TieredStopAtLevel=1 -XX:ReservedCodeCacheSize=64m" -Dtests.locale=en-PH -Dtests.timezone=Pacific/Yap -Druntime.java=21

Expected behavior

Tests are reliable

Additional Details

No response

@peternied peternied added bug Something isn't working untriaged flaky-test Random test failure that succeeds on second run Indexing Indexing, Bulk Indexing and anything related to indexing labels Feb 21, 2024
@github-actions github-actions bot added the Build Build Tasks/Gradle Plugin, groovy scripts, build tools, Javadoc enforcement. label Feb 21, 2024
@mgodwan mgodwan added Indexing:Replication Issues and PRs related to core replication framework eg segrep and removed Indexing Indexing, Bulk Indexing and anything related to indexing labels Feb 21, 2024
@mgodwan
Copy link
Member

mgodwan commented Feb 21, 2024

@mch2 @dreamer-89 Could you please check this?

@mgodwan mgodwan removed the untriaged label Feb 21, 2024
@mch2
Copy link
Member

mch2 commented Mar 15, 2024

Took a look here - this should get fixed with #11977. There is a race on shard shutdown causing some file handles to be left open. With WindowsFS this will throw an error. Will clean up the PR and get it in asap.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working flaky-test Random test failure that succeeds on second run Indexing:Replication Issues and PRs related to core replication framework eg segrep Storage Issues and PRs relating to data and metadata storage
Projects
Status: 🆕 New
Development

No branches or pull requests

4 participants