-
Notifications
You must be signed in to change notification settings - Fork 0
GSoC 2017 Ideas
This page lists the Library.Link project ideas for Google Summer of Code 2017. GSoC is a program run every year by Google where students are paid to work on open source projects during their summer break. If you are a student and interested in taking part then please read the Advice for Students section below. Everybody (not just students) is free to edit this page and add project ideas.
Libraries are publicly funded entities with rich resources to offer their communities. A primary factor in funding for libraries is their ability to show that these resources are actually used to forge connections with users. Today such users seek books, movies, and other information resources through search engines, mobile applications and Web-based services. Libraries are largely invisible in such places due to outdated technical infrastructure. On the other hand, the history of information management in libraries goes back to the 60s, and libraries happen to have some of the highest quality metadata in existence about content, media and more.
Library.Link is a collaboration among library service providers, led by Zepheira, to provide solutions which make library resources visible through Web applications, including search engines. The basis of this project is in open-source projects, including pybibframe (BIBFRAME is a Web-friendly format that's been developed for Library resources). The output of this work, called the Library.Link Network, in its first year (2016) published 100 million+ heavily cross-linked Web pages representing information from the catalogs of 2000+ libraries in 8 countries, including from 28 US states with 3 statewide implementations.
This depth and breadth of newly valuable information newly available on the Web, through the Library.Link Network can benefit to an even greater extent through integration with large-scale information being curated at Wikidata and other Wikimedia projects, and vice versa.
Even though libraries have specialized data management and curation practices, participating students are not expected to be specialized in these areas. The Library.Link project mentors will be able to provide guidance and handy tools in such areas.
Analyze the core BIBFRAME (Bibliographic Framework) for the linked data published on the Library.Link Network (and other library information systems projects), and develop declarative mappings to Wikidata vocabularies. The main focus will be the BIBFRAME Lite dialect.
Optional:
- Annotations to additional dialects of BIBFRAME. to schema.org and across the versions of BIBFRAME)
- Annotations to schema.org
Task profile:
- Required knowledge: data design basics, python
- Nice-to-have knowledge: linked data, RDF
- Difficulty level: intermediate
- Potential mentors: Gloria, Eric
Wikipedia services are special behaviors available through the Wikimedia software to provide specialized information linking and elucidation. An example is https://en.wikipedia.org/wiki/Wikipedia:Book_sources, which allow Wikipedia page authors to merely state an ISBN, and have this become an automated link which provides the user rich information about the book specified by that ISBN ( example ).
Create new services for other identifiers such as ISSN and LCCN, or enhance existing ones, such as connecting additional data from the Library.Link Network to the existing Book Sources service to expand its coverage.
Task profile:
- Required knowledge: php
- Nice-to-have knowledge: python, linked data, RDF, Wikipedia editing experience
- Difficulty level: intermediate
- Potential mentors: Mark, Gloria
Create data migration tools to improve the representation of libraries as organizations and institutions on Wikipedia, through enhanced Wikidata. For example Cuyahoga County Public Library is an entry in Wikipedia/Wikidata, but details such as branch names, locations, and facilities are missing, whereas these are available in Creative Commons licensed Library.Link Network data.
Task profile:
- Required knowledge: python
- Nice-to-have knowledge: php, linked data, RDF, Wikipedia editing experience
- Difficulty level: intermediate
- Potential mentors: Uche, Mark
The above projects are just suggestions --- it is also very good to suggest a project idea of your own if you have something in mind that you want to do.
Anyone can edit this wiki. Use this template to add your own:
### Title of the idea
Write a description of the idea in 3-5 sentences, your "elevator pitch".
- Required knowledge: python
- Difficulty level: intermediate
- Potential mentors:
If you are a student and interested in working on Library.Link Network as part of GSoC, please read the information below, as well as the GSoC program information provided by Google, including the student manual and timeline.
Potential candidates should browse key projects pybibframe and bibframe, and if possible install and test the code. Peruse any issues on those projects as well.
Experience and familiarity with most/all of these:
- Python
- Git and GitHub
- C
- linked data / RDF
General understanding of any of these:
- Data design
- Web architecture
- Wikipedia and Wikidata
Advice on writing a proposal (written with the Mailman project in mind, but generally applicable)
Format and some introductory content taken from AppImageKit Ideas Page
The project of connecting Libraries' rich media, cultural, research and media metadata to the Web is one that promises great benefits for society, but one that also requires early collaboration between the library informatics and general informatics spheres. We believe that the tools and data made available through the Library.Link Network presents a great opportunity for students to gain experience in big data and smart data processing in a pragmatic, immediately-applicable context. We are also excited about the potential to engage students with their fresh perspectives, regardless of their library systems experience. We see significant benefits for all parties in this.