-
-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue 30 find partial page citations #116
Issue 30 find partial page citations #116
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wow, this looks simple enough, thanks Matt!
@jcushman do you have time to take a look and chime in before we merge?
(A couple questions/comments in the review.)
See my thought about this above.
I forget how we use this hash. Can you elaborate and maybe that help me form an opinion?
Not that I know of. @flooie have you seen anything weird in the wild? |
It's primarily for |
Not with anything well informed. :) For Id. cites I imagine they tend to also give a parallel cite with page numbers, e.g. S.Ct. or a docket number, and you could cite to that? Like "Foo v. Bar, 1 S.Ct. 3, 4 U.S. __ (2000). ... Id. at (something)". I'm not sure how that's handled, though. It wouldn't be a terrible idea to grep through for 2 or 3 underscores and see how these are actually used. Possibly one answer is there's always a parallel cite that's better for resolution of the citation. |
Sounds good to me. |
I see you working, but let me know when this is ready for another review. |
I found this to be sometimes the case but often the Id. cite is just
Hashability changes done in 08b9625. N.B. I also had to make a change to the joke cite (5ee4145) because it was causing an issue with the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A couple little things, but looks close.
very nice, thank you! |
This enables
eyecite
to find citations that don't yet have a final page number, but just have a placeholder___
space instead. This is common with theU.S.
reporter. This simple parsing change is the first half of addressing #30; figuring out how to match these partial citations will have to be done in CL.Some other things I thought of but didn't include:
has_proper_page_number
or something? This would beFalse
if the page number is all underscores,True
otherwise. I wasn't sure if this would actually be helpful or not.1 U.S. __
to hash equivalently to1 U.S. ___
(identical expect one "placeholder page" has more underscores than the other)? Or should neither hash to the other at all, since maybe these are two completely different citations that just happen to have the same volume and reporter?