Special Session (90 Min)

Linked Data for Production

Title: Linked Data for Production (LD4P): Technical services workflow evolution through Tracer Bullets
Day/Time: Thursday, 1:30-3:00
Venue: TBA

Abstract: Linked Data for Production (LD4P) is a Mellon-supported collaboration between six institutions (Columbia, Cornell, Harvard, Library of Congress, Princeton, and Stanford) to begin the transition of technical services production workflows to ones based in Linked Open Data (LOD). This first phase of the transition focuses on the development of the ability to produce metadata as LOD communally, the enhancement of the BIBFRAME ontology to encompass the multiple resource formats that academic libraries must process, and the engagement of the broader academic library community to ensure a sustainable and extensible environment. As its name implies, LD4P is focused on the immediate needs of metadata production such as ontology coverage and workflow transition. The focus of LD4P is on the identification, evaluation and adaption of existing viable tools to immediate production needs. A related project, LD4L-Labs, focuses on solutions that can be implemented in production at research libraries within the next three to five years. Their efforts focus on the enhancement and development of existing or new linked data creation and editing tools, exploration of linked data relationships, analysis of the graph to directly improve discovery, BIBFRAME ontology development, piloting efforts in URI persistence, and metadata conversion tool development needed by LD4P and the broader library community.

As part of LD4P, Stanford is leading the development of a Performed Music Ontology and is converting four key technical services production pathways from MARC-based to RDF-based in a project called the Tracer Bullets. In this panel, we will discuss our work on these projects, highlighting achievements and difficulties of current efforts, as well as plans for future work. In this panel, we plan to discuss our work on these projects, highlighting achievements and difficulties of current efforts, as well as plans for future work. On the Performed Music Ontology, we will discuss our work on extending BIBFRAME 2 with community input to better support description of music artifacts. With regards to the “Tracer Bullets”, we will go through the progress on our four designated end-to- end pathways: vendor-supplied copy-cataloging (Tracer Bullet 1); original cataloging (Tracer Bullet 2); deposit of a single item to the Digital Repository (Tracer Bullet 3); and ingestion of a collection into the Digital Repository (Tracer Bullet 4). We have examined each of these pathways, from acquisition to discovery. Based on that analysis, we are converting all key elements in those workflows to a process rooted in linked data, balanced with the current needs and resources of the systems interacting with those pathways. Our emphasis is on the completeness of the pathway, and we plan for the workflows themselves to be expanded in the future to account for additional complexities and fully leveraging the capabilities of the RDF data models once our initial pathway has been established.

For these tracer bullet pathways, Stanford is developing parallel processing streams. Resources flowing through these pathways will be processed in the traditional way with MARC or MODS-based metadata. A parallel, linked data workflow will be created for LD4P and duplicative metadata created. This metadata currently feeds into a parallel discovery environment so that we mimic the entire processing workflow. The metadata can also be sent to various library vendors and programs so that they can begin to adjust their businesses to incorporate linked data. Although this solution requires duplicative effort, it will allow Stanford to experiment with an alternative pathway without being dependent on the results for discovery. It also has the benefit of testing the new pathway with actual library resources and staff so that a true measure of effort and cost to implement the new paradigm can be evaluated.

LD4P has completed the first year of its two-year grant and has made substantial process on the Tracer Bullets. In our panel presentation, we’d like to focus on five main areas:

Introduction: General information on the goals of LD4P and its context in the current library technical services paradigm.

Philip Schreur, Associate University Librarian for Technical and Access Services

Workflow Analysis: Workflow analysis for Tracer Bullets 1 & 2 including the testing of Tracer Bullet 1 with actual library data.

Arcadia Falcone, Metadata Coordinator

MARC Data Enhancement and Conversion: Suggestions for enhancements to MARC data to make their conversion to RDF cleaner and our testing of MARC to BIBFRAME 2.0 conversion.

Josh Greben, Systems Programmer/Analyst
Nancy Lorimer, Head, Metadata Department

Tooling: Experimentation with current tools available to support Tracer Bullets 1 & 2 along with their enhancement and new tool development.

Josh Greben, Systems Programmer/Analyst
Nancy Lorimer, Head, Metadata Department

Digital Repository: Initial exploration of Tracer Bullets 3 & 4 and their implications for the Stanford Digital Repository.

Christina Harlow, Digital Repository, Data Operations

Stanford University Panel Presenters:

Arcadia Falcone (Metadata Coordinator), Stanford University
Arcadia is responsible for coordinating metadata creation, application, and maintenance for content being deposited in the SDR, assessing, developing, and documenting standards for metadata, and brokering acceptance of standards across the libraries.

Josh Greben (Systems Programmer/Analyst), Stanford University
Josh is a programmer/analyst on the Library Systems team with several year’s experience transforming MARC data to BIBFRAME, and all the attendant system requirements for managing, hosting and serving the data. With the other technical team members, he will be a key participant in both the analysis and engineering to support the Tracer Bullets.

Nancy Lorimer (Head, Metadata Department), Stanford University
Nancy is the Head of the Metadata Department at Stanford and is the project coordinator for the Performed Music Ontology. Formerly a music cataloger for over 15 years, she has extensive experience with music metadata creation and standards creation, including leading various committees and task forces for the Music Library Association, and has been active in coordinating the development of controlled vocabularies for music.

Christina Harlow (Digital Repository, Data Operations)
Christina works on the Stanford Digital Repository in the role of data operations. This involves analysis, management, and development of infrastructure and tooling to manage data (metadata, binaries, and logs) that interacts with our digital repository ecosystem. On LD4P, Christina is primarily involved in managing the work of integrating the generated Linked Data into our digital repository workflows, as well as to explore streaming work to make these dataflows better managed and deployed.

Philip Schreur (Associate University Librarian for Technical and Access Services), Stanford University
Philip is the PI for LD4P and has overall responsibility for the project. He has been the Chair of the Program for Cooperative cataloging and deeply involved in the implementation of the new cataloging rules Resource, Description and Access (RDA) in the United States. With a mid-career move to HighWire Press, he developed an interest in the automated taxonomic analysis of digital texts. Currently, he is in charge of coordinating linked-data project development for the Stanford University Libraries (SUL).




DCMI logo DCMI's work is supported, promoted and improved by « Member organizations » around the world:

The National Library of Finland The National Library of Korea The National Library Board Singapore
Shanghai Library Simmons College GSLIS (US) Information School of the University of Washington
MIMOS Berhad Research Center for Knowledge Communities, Tsukuba University Infocom Corporation (Japan)
UNESP (Brazil) Universisty of Edinburgh SUB Goettingen

DCMI logo DCMI's annual meeting and conference addresses models, technologies and applications of metadata

Join logo
Become a DCMI