-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Edge stability bot is potentially unstable? #5947
Comments
There's an |
My current guess for #5942 is that it's some JIT bug in Edge around HTMLAllCollection. |
Pretty sure we must be allowing window.open because otherwise the harness wouldn't work. It might be possible to get video out of sauce to investigate these issues. @plh has access, and we might need to enable an option somewhere to cause it to be recorded. |
https://wiki.saucelabs.com/display/DOCS/Viewing+Screenshots%2C+Commands%2C+Logs%2C+and+Metadata+for+Test+Results is the documentatiaon about viewing the video. I think we should make Edge non-blocking for now. |
Can we do this? #5960 is also affected. |
#5942 definitely looks like an Edge bug. #5944 looks like some sort of error or timeout on the sauce side.
I don't believe there's an option to allow the screenshots/video - I think it's just there by default (I remember seeing it during my testing). I'm 👍 on adding both Edge and Safari to allow failures as long as the instability is still addressed. |
It's probably best to make them non-blocking until we have some history to see the rate of failures. However, we will occasionally have flakiness due to bugs in all browsers, and currently the only workaround is a manual merge. Should we have the equivalent of a "I confirm that this flakiness isn't my fault" button (a label?) to allow merging with flaky results in general? |
Super-admins (or whatever the GitHub permission is) like @jgraham can merge with failures. I believe in the past, iit's been a matter of escalating a PR to one of them. I haven't had to deal with that, but watching them go by on IRC, it has appeared to work. However, there may be a better process that could be put into place. |
FWIW, that process is not ideal (often comments on IRC end up not getting addressed) and requiring everyone to rebase because we misconfigured a new tool isn't great either. |
I have the permission bit in question and have been taking care of Chromium exports, but I also don't think it's ideal. If it only happens once a month it'd be OK, but if wpt contributions increase that seems unlikely. |
So I think there are two issues here:
|
I was going to recommend gating on validation from a quorum of the available |
#5942 as far as I can tell was an Edge bug fixed in 15, but Sauce Labs only has 14. :( |
Given that's a fixed bug in an outdated version, it doesn't seem to make sense to require a bug report for that? |
I could repro that in Edge 15 and filed https://developer.microsoft.com/en-us/microsoft-edge/platform/issues/12047069/ |
#5288 has https://travis-ci.org/w3c/web-platform-tests/jobs/233024736 which just timed out despite running a single test |
Got "OK: 9/10, TIMEOUT: 1/10" in #5991 (comment) |
As mentioned in IRC, it's possible that MISSING/TIMEOUT tests are due to the Sauce account limit on concurrent jobs. @plehegar can you check on that or somehow share access with me so I can do some digging? |
Edge is still an issue. We need to fix it in Sauce, see if Browserstack offers a more reliable solution, or set up our own Windows VM for the CI flakiness. |
@bobholt sorry, didn't see your question before. do you still need access to the sauce account? |
@plehegar I believe @lukebjerring is going to be taking on some of my infra responsibilities after next week. Once he's ramped up, I'll ask him to reach out to you to figure this out. |
#6327 is part of this when it comes to reftests (and that's blocked on MS). |
@mattl, do we have enough data in https://pulls.web-platform-tests.org/ to be able to work out how often there's a problem with any kind of run? At some point we want flakiness in all browsers to not go unnoticed, but it's scary to make failures blocking in Travis without knowing how often it happens. Given that we actually don't have any work planned for this soon, I think it's more honest to call this a backlog issue. If people keep noticing, or we have some stats about it, then we should raise the priority again. |
Sauce is gone now, as of #9903, and with it Edge stability runs. |
cc/ @jgraham @bobholt @domenic
#5944 and #5942 have both hit the Edge bot giving somewhat odd unstable results (and stable in every other browser); this and this are the Travis jobs for those.
#5944 I think might be random in Edge down to it trying to open third-party applications (scheme-dependent) and popups (from
window.open
, which we probably don't allow over Sauce?), and then racing as to whether one of those blocks the test harness?#5942 I suspect is just a bug in Edge?
The text was updated successfully, but these errors were encountered: