-
-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
2492 Fixes eyecite CI #140
Conversation
Thanks Alberto. Two things here:
|
@mlissner - yes - very curious |
@mlissner Sure, I'll check why the citations variation happened. About the dependencies, yeah I can test it via Github actions, the problem I had was generating the poetry.lock file on my machine since it failed. So I'll try it installing an x86 Linux virtual machine and see if it works. |
My theory about this, by the way, is that you enabled hyperscan in this PR and it's not enabled on |
@mlissner that would affect speed only, would it not? |
@albertisfu - I've had luck just creating a dockerfile to install and test hyper scan locally. fwiw. |
Hm, the underlying One thing I saw in the hyperscan change log is:
That sort of makes it sound like it's changing how hyperscan works, but it's from 2018-07-09, which is a long time ago. OTOH, the changelog of python-hyperscan says that version 0.3.0 (from late last year):
So.....maybe that's the root cause? Maybe python-hyperscan was using a very old version of hyperscan and finally upgraded to a new version that uses Chimera, which brought more regex compatibility/support? Maybe.... |
Well, in theory, a regex is a regex, and switching between Python's regex engine and hyperscan's wouldn't make for different results. That seemed to be the result in the past, but maybe it's not anymore. I've seen different regex engines support different parts of the standard. I even remember Firefox and Chrome had different support at one point (probably still do, ugh). |
79e8743
to
19a515d
Compare
The Eyecite Report 👁️Gains and LossesThere were 0 gains and 0 losses. Click here to see details.
Time ChartGenerated Files |
Well, the problem was not After multiple tests, the problem seems to be related to In this other PR, I set it to I think the problem here was the poetry.lock file was not updated in a while, the last update was on Dec 27, 2022 and Since the benchmark for Maybe including a Let me know what you think. |
Nice. Well, that's good that it's just our code that had the bug. Figures, I suppose!
Yeah, @flooie do you want to file a bug to take a look at either doing that or just making sure that we upgrade reporters-db when running the eyecite report, so we catch these things more quickly next time? And I guess we need a separate bug for figuring out what happened to the reporter DB that you or Kevin can take care of? |
For the record, I think I was wrong about everything on this issue. :)
Ah well... |
This PR fixes the issues described in #139 and #2492
fast_diff_match_patch
,exrex
was not related to a dependency problem but a CI cache issue. Seems the cache was broken, to solve it I changed the key removing the date-fixed string since now is possible to remove caches from the Actions tab, it would be easier to just remove old caches instead of changing the cache key.Additionally, I had some problems trying to install
eyecite
on my machine I couldn't install it on Mac with ARM or using a Ubuntu ARM virtual machine. The problem seemshyperscan
, maybe an incompatibility oflibhyperscan-dev
on ARM.So I had to install it on my old x86-64 Mac, the problem that I found here was using the last version of Poetry it tried to install the latest version of hyperscan 0.3.x which failed. So I had to downgrade the version to 0.2.0 in
pyproject.toml
not sure if this is only a problem on mac x86-64 due tohyperscan-dev
version for mac or if it's a widespread problem in other SO as well, I suspect it is, since the 1.2.0rc2 version of poetry (that was fixed in actions) installs by default the 0.2.0 version ofhyperscan
, so after fixed thehyperscan
version I removed the poetry1.2.0rc2
version constraint in CI actions, so now it's using the latest version of poetry.I think it might be worth it if somebody on Linux could try to update to the latest version of
hyperscan
and check if the installation succeeds in case we want to use the latest version ofhyperscan
.