Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do not use results when PubMed has zero results #24

Open
paulalbert1 opened this issue Aug 30, 2018 · 6 comments
Open

Do not use results when PubMed has zero results #24

paulalbert1 opened this issue Aug 30, 2018 · 6 comments
Assignees
Labels

Comments

@paulalbert1
Copy link
Contributor

paulalbert1 commented Aug 30, 2018

Background

ReCiter translates juc2004 into the following query:

(Charles-rawlins J[au] OR Charles-rawlins J[au] OR Charles-rawlins J[au])

This returns zero results using the traditional search, so PubMed "helpfully" searches just for the first initial as you can see here.

See how we're importing 276 records with the first name, last initial search?

Screen Shot 2019-06-01 at 6 17 23 PM

Here's an example of record that's being imported.

Screen Shot 2019-06-01 at 6 19 37 PM

This is the XML.

<eSearchResult>
<Count>264</Count>
<RetMax>1</RetMax>
<RetStart>0</RetStart>
<QueryKey>1</QueryKey>
<WebEnv>NCID_1_144258394_130.14.18.34_9001_1535657066_726671994_0MetA0_S_MegaStore</WebEnv>
<IdList>
<Id>30153686</Id>
</IdList>
<TranslationSet />
<TranslationStack>
<TermSet>
<Term>J[au]</Term>
<Field>au</Field>
<Count>264</Count>
<Explode>N</Explode>
</TermSet>
<OP>GROUP</OP>
</TranslationStack>
<QueryTranslation>J[au]</QueryTranslation>
<ErrorList>
<PhraseNotFound>Charles-rawlins</PhraseNotFound>
</ErrorList>
</eSearchResult>

Requirements

Do not import the resulting records if the attribute QueryTranslation is 5 characters and only 5 characters, and the final 4 characters are [au].

Then, do not import any records.

@sarbajitdutta sarbajitdutta self-assigned this Apr 19, 2019
@sarbajitdutta
Copy link
Contributor

Lets discuss this approach. Is this even needed at this point? @paulalbert1

@paulalbert1
Copy link
Contributor Author

It is. I'm pretty sure the use case I gave above is still relevant.

@paulalbert1
Copy link
Contributor Author

Another example...

Screen Shot 2019-09-27 at 7 35 30 AM

@paulalbert1
Copy link
Contributor Author

Another example...
suggested because alternate name returned no results

@paulalbert1
Copy link
Contributor Author

paulalbert1 commented Jan 19, 2020

Another example...

shouldnt have been looked up -27080591

Another example...

Screen Shot 2021-09-14 at 8 55 05 AM

@paulalbert1
Copy link
Contributor Author

paulalbert1 commented Sep 15, 2021

The alternative is to put names in quotes. This returns zero results, but we may get more false negatives when people are being creative with their names.

Screen Shot 2021-09-15 at 12 04 07 PM

Screen Shot 2021-09-15 at 12 05 53 PM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants