Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SCHEMA] Add postal codes to the locations field #258

Open
adambuttrick opened this issue Aug 20, 2024 · 1 comment
Open

[SCHEMA] Add postal codes to the locations field #258

adambuttrick opened this issue Aug 20, 2024 · 1 comment
Labels
data model/schema Changes to ROR data model/schema

Comments

@adambuttrick
Copy link
Contributor

Version
v2

Describe the problem you would like to solve
In developing new affiliation matching, OpenAlex found that the inclusion of postal codes associated with the organization's address can significantly improve the confidence scores of new matches for previously unmatched affiliation strings. However, postal codes are not currently included as a field in our location metadata, limiting their ability to leverage ROR for this improvement.

Describe the schema change that you would like in order to solve the problem
OpenAlex proposes adding a "postal_code" field to the ROR schema for institutions. This field would contain the main address postal code for each institution.

Who would benefit from this change?
This change would benefit:

  1. Organizations and services that rely on accurate institutional identification and disambiguation using more granular location metadata, such as bibliometric databases, funding agencies, and research information management systems
  2. Developers working on new affiliation matching systems

Additional information
ROR would need to investigate how to derive for all, most, or the most commonly referenced subset of organizations in our data. We would also need to change our approach to curating location metadata more generally to support. For all records now, the only value we identify and index is the Geonames ID for the city in which the organization is located.

@adambuttrick adambuttrick added the data model/schema Changes to ROR data model/schema label Aug 20, 2024
@adambuttrick
Copy link
Contributor Author

adambuttrick commented Aug 20, 2024

Improvement to affiliation matching is consistent with possible degradation to our affiliation matching as a result of less location information in v2 vs. v1, as reported in #243

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data model/schema Changes to ROR data model/schema
Projects
None yet
Development

No branches or pull requests

1 participant