Workshops – Open Repositories 2019

You will need to be registered for the conference before you can register for the workshops. Places for the workshops are limited.

Full-Day Workshops

Invenio User Group Workshop 2019

Lars Holm Nielsen, Jose Benito Gonzalez Lopez, Kai Wörner

Monday, 9:00-12:30 & 13:30-17:00, room East 121

Invenio User Group Workshop (IUGW) is a biennial workshop where the Invenio repository community meet among users and developers from around the world. The workshop consists of a series of tour de table service presentations, talks from attendees and a brainstorming session related to the Invenio digital repository framework. We exchange knowledge and experiences and drive forward the forthcoming developments of the Invenio platform. Previous workshops have included diverse presentations from the attendees on topics like IIIF, author disambiguation in Invenio, reference extraction, ILS for Invenio , statistics and reporting in Invenio, modelling copyright and licensing in MARC, just to name a few.

Research Data Packaging – Research Objects, Frictionless Data Packages, Datacrates and beyond

Peter Sefton, Carole Goble, Stian Soiland-Reyes

Monday, 9:00-12:30 & 13:30-17:00, room East 222

Packaging research data for discovery, distribution and reuse is a key concern for repositories; research data sets almost always consist of more that one kind of file and it is a challenge for repository managers to make data available in appropriate formats.

This two-part workshop will cover the state of the art in research data packaging – building on the outcomes from a successful and well attended workshop on Research Objects at eScience 2018 in Amsterdam but with a more tutorial approach. The morning part of the workshop will cover an introduction to data packaging and metadata for data discovery via search engines with practical advice about available tooling and examples of repositories using them. The afternoon will be an interactive session which will allow participants to get advice, discuss further work needed in this space and convey their requirements to leaders in the data packaging and repository technology space.

Morning Workshops

Getting Started with DSpace 7 (Basic Training)

Tim Donohue, Art Lowel, Andrea Bollini

Monday, 9:00-12:30, room West 121

DSpace 7 is a major step in the evolution of the DSpace platform and repositories in general. While retaining its ease-of-use, out-of-the-box goals, DSpace 7 features a brand new, client-side, responsive user interface (built on Angular), a full-featured, self-describing REST API, a powerful new configurable object model (featuring typed Items and relations between Items), and alignment with the COAR Next Generation Repositories recommendations (via a new ResourceSync interface and Signposting support).

This workshop, which provides the first comprehensive training on DSpace 7.0, will be split into two half day sessions. This is the first session:

Basic Training: The first half day will concentrate on installing or upgrading to DSpace 7 (Audience: Anyone). Activities will include overview of install/upgrade process, new features, new configuration options, and hands-on UI branding.

Repository/CRIS Workshop (I):
Realising Technical Interoperability and/or Integration

Michele Mennielli, Anna Clements, Pablo de Castro

Monday, 9:00-12:30, room East 221

This 3-hr session will look into the technical challenges posed by the need to make repositories and CRIS systems interoperable or, in some cases, fully integrated. Several case studies on technical information exchange workflows will be presented from a platform-agnostic perspective, and a discussion will be held afterwards among the presenters, the workshop organisers and the event attendees. This session will be of particular interest for developers and research information management practitioners.

A user journey in OpenAIRE services through the lens of repository managers (I – OpenAIRE interoperability guidelines, the content acquisition policy and the graph expansion)

Pedro Príncipe, Paolo Manghi, Leonard Mack, André Vieira, Jochen Schirrwagen

Monday, 9:00-12:30, room West 120

OpenAIRE is the European Union initiative for an Open Access Infrastructure for Research in Europe that aims to establish an open and sustainable scholarly communication infrastructure responsible for the overall management, analysis, manipulation, provision, monitoring and cross-linking of all research outcomes (publications, related datasets, software and services) across existing, planned and future repositories. In the new phase of OpenAIRE, among the project’s goals, there is the one to repackage OpenAIRE services providing them as complete products to the final users.

This workshop puts forward an interactive session aiming to provide detailed information on the main services and tools targeting content providers and in which OpenAIRE team foresee to collect contributions from the repository manager’s community to further develop the portfolio of services offered by OpenAIRE.

The workshop will provide an interactive demo of the dashboard for content providers tools (repositories registration and validation, collection monitor and content enrichments), detailing mainly the catch-all broker service functionalities, such as the metadata enrichments and the usage statistics service. Additionally, this all day workshop will introduce attendees the new content acquisition policy, the Guidelines for Literature Repository Managers version 4.0 and the OpenAIRE graph.

Islandora for All: ISLE Workshop

David Keiser-Clark,Bethany Seeger

Monday, 9:00-12:30, room East 123

This workshop is for people interested in migrating from a proprietary digital repository to an open-source collections management platform; those already on Islandora looking to reduce maintenance costs and improve security; and those seeking to create a new digital repository for their institution. ISLE is an open-source solution and offers a quick installation and timely automated updates. This workshop is designed for people who work closely with their digital repository, including repository administrators, developers, metadata specialists, archivists, systems administrators, or directors.

This friendly yet comprehensive hands-on instruction uses ISLE to install and update an institution’s Islandora digital repository platform.

This workshop will:

provide an overview of the ISLE project and technical make up;
provide a public demonstration;
walk through creating a local development environment of Islandora on your laptop;
provide basic instructions on how to ingest/edit/delete a sample set of digital objects in your repository;
show how to create permanent customizations that will persist through ISLE updates;
show how to use ISLE to update your Islandora platform and run supplied automated tests to verify success;
provide a checklist for migrating to ISLE;
provide Q&A; if time permits, we’ll discuss future roadmap.

Everyone will succeed in this workshop!

Closing the gap – connection points between DMPs and repositories

Sarah Jones, Sam Rust, Magdalena Dafiova

Monday, 9:00-12:30, room East 122

This proposal is for a 3 hour workshop session to discuss the potential connection points between Data Management Plans and repository platforms. DMPs capture a lot of useful information that could assist repositories in capacity planning and improving deposit workflows. At the IDCC conference in 2017, we ran a workshop that gathered requirements based around a number of use cases, including those of the repository community. [1] In the White Paper that resulted, we highlighted a number of potential avenues for exploration.

In the intervening period, a series of activities have taken place internationally. The California Digital Library has an NSF Eager grant in which they have been collaborating with data centres. [3] Meanwhile DCC is developing export features to common repository and journal platforms. Within the Research Data Alliance, two Working Groups are developing Common Standards for DMPs [4] to facilitate information exchange and addressing use cases for ‘exposing DMPs’ – specifically what information can be shared and how, when and by whom can it be used. [5]

The workshop will provide an update on recent activities and engage with the OR community to develop work further.

Hands on User Experience. Concepts and methods that takes you from idea to prototype in 3 hours.

Kristin Olofsson

Monday, 9:00-12:30, room West 122

This workshop will lead you in the process of going from zero, to an idea, to a lo-fi prototype in three hours, using a Lean UX startup framework in a fun and easy way.

The overall aim of the workshop is to explore a method that helps you embrace that process in a very short time. We will cover things like ideation, making a proto-persona, exploring user values, feature prioritizing, quick decision-making technique. Intermixed, you will also learn some Lean UX concepts and methods.

It’s a very hands-on workshop where everyone will work independently with their own cases, lead step by step by the workshop leader.

The audience is typical developers, librarians, product owners, designers and people working with Institutional repositories or other systems in general. No laptop or coding skills are required. The attendees are expected to participate actively.

The workshop is for everyone that wants to learn a method that helps them with ideation, prioritization, “package a service” and turning an idea into a testable concept. In short – create products that users want!

Samvera – an introduction to the community and sustaining its hosted and custom digital repository solutions

Chris Awre, Robin Ruggaber, Julie Allinson

Monday, 9:00-10:30, room East 120

Samvera is a community, a set of tools, and a collection of ready-to run and hosted applications to help build a digital repository for your institution. The community is open source in its practice and sustainable in its focus. This 90 minute workshop will provide an on-boarding and general entrée to the Samvera community and solutions for non-coders.

The workshop will provide an overview of Samvera solutions, hosting options and the community – what is it, why is it different? It will showcase applications solving a diverse set of needs and organizations, and discuss the how the community at large works to enable these.

The introduction will be followed by a general technical overview designed for a non-technical audience. The resources needed to maintain and contribute to a hosted or custom Samvera solution will be discussed, resources that exist to get started will be highlighted plus how to contribute to the community technically and non-technically.

The workshop will conclude with a discussion on how to pitch Samvera and get institutional support. It will look at the advantages of being part of the community and how that strengthens the sustainability of the tools, the applications, and the community overall.

Building a shared repository service using Samvera Hyku

Sara Gould, Jenny Basford, Brian Hole, Tom Mowlam

Monday, 11:00-12:30, room East 120

Samvera is an open source repository system with a growing user base. Since launch, two Samvera flavours have emerged: Hyrax, now with around 50 active repositories; and more recently Hyku, specifically designed to support multiple repositories on a single instance.

By June 2019, the British Library will be reaching the end of a pilot project to develop shared repository services using Hyku as the central platform for itself and four partners, British Museum, Tate, National Museums Scotland and MOLA (Museum of London Archaeology).

This workshop will cover:

The Samvera Hyku system, focusing on how it’s being developed to meet expectations of the British Library’s project
The successes and challenges for the Library, our users and partners of developing a shared repository service.

The workshop aims to be useful for both business/repository managers and repository system colleagues. We aim to pitch it at a level that makes it accessible for the ‘simply interested and not too technical’ but with enough detail to provide genuine insight into the potential of Samvera Hyku. After introductory presentations, the audience will divide to focus on the two areas above, with the chance to dig into the detail and throw questions at our expert presenters.

Afternoon Workshops

Getting Started with DSpace 7 (Advanced Training)

Tim Donohue, Art Lowel, Andrea Bollini

Monday, 13:30-17:00, room West 121.

This is the second session of the workshop:

Advanced Training: The second half day will concentrate on advanced customization of DSpace 7 (Audience: Developer oriented). Hands-on activities will include advanced enhancement of the UI (beyond basic branding), how to use the REST API, and contributing back.

Repository/CRIS Workshop (II):
Practical Applications for Interoperability and/or Integration

Michele Mennielli, Anna Clements, Pablo de Castro

Monday, 13:30-17:00, room East 221

The second 3-hr slot for this two-part workshop will explore the multiple applications of repository/CRIS platforms in areas like metadata management for publications and data, the provision of funding information across platforms or the introduction of a growing range of persistent identifiers. Same as for the first part, the session will include a number of case studies and will be followed by a discussion and a Q&A. This session will mainly be addressed at repository managers.

A user journey in OpenAIRE services through the lens of repository managers (II – OpenAIRE dashboard for content providers, usage statistics and the catch-all broker service)

Pedro Príncipe, Paolo Manghi, Leonard Mack, André Vieira, Jochen Schirrwagen

Monday, 13:30-17:00, room West 120

Workshop description to follow.

Docker for DSpace – Lowering the Barrier of Entry for New Contributors

Terrence Brady, Pascal-Nicolas Becker

Monday, 13:30-17:00, room East 120

In 2018, the DSpace project began publishing Docker images for each major branch of DSpace. These published images have already provided the DSpace developer community with greater flexibility in running and testing DSpace. Additionally, these published images provide a tremendous opportunity for on-boarding new DSpace contributors. By easing the setup of DSpace, Docker helps people to test DSpace configuration changes, DSpace versions, or even code changes without the need to fully understand how to install DSpace on their own. This workshop has been designed to reach all of these audiences: DSpace developers, new DSpace contributors, and people that want to test DSpace easily.

Docker provides users with a consistent and predictable runtime environment. Docker allows a user to manage multiple (and incompatible) environments from a single desktop or test server. For example, you can run different versions of DSpace, or DSpace with different versions of Java with ease.

Attendees will leave the session with the ability to run DSpace 6 and DSpace 7 in Docker on their machine, to tweak the configuration of these installations, and to recompile and restart DSpace in case of code changes or additions.

Introduction to Fedora 5.0 and the API Specification

David Wilcox, Andrew Woods, Daniel Bernstein

Monday, 13:30-17:00, room East 122

Fedora is a flexible, extensible, open source repository platform for managing, preserving, and providing access to digital content. The Fedora community recently released version 5.0, which brings the software into alignment with the recently published API specification. Both new and existing Fedora users will be interested in learning about and experiencing new Fedora features and functionality first-hand.

Using pre-configured virtual machines, participants will learn how to create and manage content in Fedora in accordance with linked data best practices and the Portland Common Data Model. Particular attention will be paid to new features and functionality in the 5.0 release. Participants will also learn how to search and run SPARQL queries against content in Fedora using the included Solr index and triplestore, and how to import resources into Fedora and export resources from Fedora to external systems and services as part of a digital curation workflow.

The human infrastructure: building community to support repository collaborations

Leila Sterman, Nick Shockey, Joseph McArthur

Monday, 13:30-17:00, room West 221

Two decades into the repository movement, there have been huge advances in repository technology and the efforts to establish repositories as important research infrastructure. Still, librarians and repository managers on many campuses struggle to encourage a local research culture that promotes the use and support of repositories. This workshop will use interactive exercises to highlight existing resources, plan new initiatives, and build community to support the human infrastructure of repositories.

There is a tendency to collaborate on technological infrastructure more visibly than we collaborate around scholarly communication services. This session aims to facilitate targeted collaboration opportunities with colleagues across regions, platforms, and experience levels and explore support structures that might enable further community building. By encouraging the use of shared resources and the repository community to build cross-repository practices and solutions we hope this workshop will promote ongoing conversations between repository managers and foster a culture of collaboration that supports these projects.

Introducing Algorithmic Awareness: Thinking about Algorithms and Digital Literacy in Open Repositories

Jason Anthony Clark, Julian Kaptanian

Monday, 13:30-17:00, room East 120

Why does your technology seem to know what you want before you do? Increasingly, our digital experiences are mediated by obscure algorithms. But what are algorithms and how can we teach about them? This workshop introduces “algorithmic awareness” and modes of teaching the rules governing software and shaping digital experiences.

Using YARD to Curate for Reproducibility

Limor Peer, Thu-Mai Christian, Florio Arguillas

Monday, 13:30-17:00, room West 122

Scientific reproducibility has captured the attention of academics, technologists, government agencies, private funders, and the public. We focus on computational reproducibility — the ability to obtain the same results from the data and code used in the original study — for two reasons. One, computational reproducibility is essential for understanding the complete scholarly record. Two, as data managers and archivists, we strongly feel that a test of computational reproducibility should factor into decisions about preserving and sharing these materials. Repositories have a responsibility to ensure the materials comprising the scholarly record can be used as expected in the long term. We advocate for curating for reproducibility (CURE), which involves activities that ensure that statistical and analytic claims about given data can be reproduced with that data. This 3-hour workshop is intended for librarians, data curators, and researchers of diverse professional backgrounds and experience. Participants will be introduced to the topic of curating for reproducibility, hear perspectives from three institutions practicing curating for reproducibility; learn about the CURE workflow and how to curate for reproducibility using YARD, a curation tool, using examples and hands-on activities. Participants will be invited to test YARD and deposit their own data and code.

Evening Workshops

DataCite DOI Services for Repositories – making all research outputs persistent

Robin Dasler

Monday, 17:30-19:00, room West 120

With research data increasingly being acknowledged as a first-class research output, it’s important for repositories to think about persistent identification of datasets. DataCite is a global non-profit organization that provides services that enable repositories to create and manage DOIs. Through DataCite, repositories can assign DOIs to their materials and expose their metadata to the world through services like DataCite Search.

This workshop will provide an overview of these services to explain to repositories how they can make the most of the existing services for datasets and other non-traditional research outputs. In addition, we will gather community input to inform further development of future services to ensure we enable repositories to meet the needs of their users.

Getting started with Haplo

Ben Summers, Tom Renner

Monday, 17:30-19:00, room East 221

This session provides developers with the opportunity to get to know Haplo. A relative new comer to the institutional repository community, this ‘new kid on the block’ has been used to build applications managing collections of semi-structured information for over ten years with particular strengths in workflow design, managing restricted files securely, and practice-based portfolio collections, all built on an innovative metadata model. The open source Haplo Repository is offered as a stand-alone repository or as an integrated part of the Haplo Research Management System.

Repository data management with GLAMpipe

Ari Häyrinen

Monday, 17:30-19:00, room East 222

Data management can be a frustrating process. Typically this involves using spreadsheets, writing adhoc Python scripts, or using R, Catmandu or similar tools. Albeit these tools have their own strengths, they also have various problems when used in managing repository content. Spreadsheets can be very labour intensive and clumsy for example for data with multiple authors. Using scripts requires coding good skills and often adhoc scripts are not documented and therefore not very useful for other team members.

GLAMpipe is an open source, flexible-by-design, document processing tool meant for GLAM (Galleries, Libaries, Archives, Museums) data. GLAMpipe can be used for data importing, exporting, processing and browsing. It has a graphical user interface and a full REST api.

The workflow of GLAMpipe is based on nodes. There are import nodes, processing nodes, export nodes and view nodes. Import nodes can import data from REST apis (like Dspace 6 or Wikidata) or CSVfiles.Processing files can make lookups, transfomations, metadata mappings, string operations and so on. Export nodes can push data and files to different REST apis (like Dspace 6) or CSV file.

The workshop is intended to everyone who is doing data management or who is creating a tools for content management.

Open your structured data with Wikibase — install your own instance of the technology behind Wikidata

Jens Ohlig

Monday, 17:30-19:00, room East 121

Wikibase lets you share your data in a form that can be read and edited by humans and machines. Read to query, link to other knowledge bases and open for everyone.

In this workshop you will learn how your organization can establish its own instance of a structured data repository and become a neighbor in the Linked Open Data Web. You will learn about Wikibase, the software behind Wikidata. Wikidata is a project by Wikimedia — the organization behind Wikipedia — to build a repository for the sum of all human knowledge.

Measuring data reuse with the COUNTER Code of Practice for Research Data

Helena Cousijn, Daniella Lowenberg, Lars Holm Nielsen, Kristian Garza, Martin Fenner

Monday, 17:30-19:00, room West 121

Many institutional repositories enable researchers to make their research data publicly available. However, few repositories have implemented ways to assess reuse of those datasets and share this information with their researchers. Within the Make Data Count project, standards and infrastructure have been developed to enable repositories to count and share views and downloads of datasets in the repository. This information feeds into a central hub where, together with citation information, the usage statistics are shared with the community.

In this workshop, we will explain the new COUNTER Code of Practice for Research Data and walk through all the steps needed to implement the code for your repository. Early implementers will be present to share their experiences with implementation and show examples of how these usage statistics are being displayed within their repository.