About Robin Rice

Robin on Twitter: Sparrowbarley

Dealing with Data 2019 (January 2020): Collaboration Across the Nations

Picture the scene: A cold January day, the wind blowing the scarves of the passers-by through the large windows of the Informatics Forum meeting room. The group inside listens, takes notes, tweets, and asks questions of the speakers, representing a range of disciplines across the University…

Dealing with Data is an annual event hosted by the Research Data Service. Its aim is to engage the University community of researchers and support professionals around a theme, to share success stories and challenges in the myriad, everyday issues involved with data-driven research. The theme this year reflected the difficulty of managing research data in large, collaborative projects. Due to industrial action, the original November event was postponed to January. Around a hundred researchers – staff and students – participated, along with support staff who gave lightning talks about research-focused services. Full presentations and videos are now available.

So Benjamin Bach, our keynote speaker, inspired us with state of the art data visualisation software and techniques for both exploration and presentation. But he also illustrated the difficulties of portraying all of the data in all of its facets of a rich dataset, and the consequences of making necessary choices for its interpretation.
The first session began with Tamar Israeli’s study of researchers’ use of collaborative and institutional tools showed the challenges of making local infrastructure user friendly enough to attract new users familiar with slick cloud-based services. Then Mark Lawson demonstrated his ingenuous ‘ethical hacking’ to piece together a set of APIs to create a research workflow for samples and images for histology research. Minhong Wang conveyed a higher level view of data management focused not just on data-driven, but knowledge-driven phenotyping.

Next were the lively lightning talks, in which Mike Wallis of Research Services warned of a new Digital Dark Age, and David Creighton-Offord spoke of the dillemmas in Information Security user support where shiny doesn’t always equal safe. Lisa Otty spoke of innovative training and text mining projects bringing data science to the Humanities, and Rory MacNeil demonstrated how the RSpace electronic lab notebook can connect to a host of popular open science tools.

Following a lively lunch with chat between delegates and with hosts of the service exhibitions, Alex Hutchison showed a highly programmatic view of data management and ethics control from the UNICEF collaboration, in collecting and analysing real world data about children in need. Caileen Gallagher offered a case study of how food courier data could be used to empower workers. Sanja Badanjak shared her data integration problems of peace agreements around the world, conveying both innovative solutions and time-consuming workarounds.

In the final session Edward Wallace brought in the Edinburgh Carpentries to the rescue of poor data skills within Biological Sciences and the wider University – itself a great example of cross-community collaboration building a community of trainers. Gillian Raab showed us how any data problem however intractable can be solved by resourcefulness and determination, making use of DataShield for multi-party computation when datasets are too sensitive to be shared. Johnny Hay and Tomasz Zielinski demo’d their Plasmo ‘boutique repository’ for plant-systems biology modelling and Holly Tibble described tackling an international collaboration in linking administrative datasets via ‘ridiculously detailed’ statistical analysis plans. Representing the Research Data Service, I wrapped up proceedings with some of these very observations.
Both presentations and videos are available.

Welcome

  • Jeremy Upton, Director of Library and University Collections. [Presentation]

Keynote

  • Data Visualization for Exploration and Presentation, Prof. Benjamin Bach. Lecturer in Design Informatics and Visualization. [Presentation] [Slides]

Session 1 – Chair: Theo Andrew

  • “Data Something”: Assessing Tools, Services and Barriers for Research Data Collaboration at the University of Edinburgh – a small-scale study carried out by Dr Tamar Israeli with support from the Research Data Support team. Robin Rice – Data Librarian & Head of Research Data Support Services. [Presentation] [Slides]
  • Integrated secure web application to deliver centralised management of research samples, histology services and imaging data. Mark Lawson, Data & Project Manager, MRC Centre for Reproductive Health, QMRI. [Presentation] [Slides]
  • Building the Knowledge Graph for UK Health Data Science Minhong Wang et. al, Deanery of Molecular, Genetic and Population Health Sciences. [Presentation] [Slides]

Session 2 – Chair: Kerry Miller

  • The Data Opportunities & Challenges when Collaborating across Organisations
    Alex Hutchison, Delivery Director – Data for Children Collaborative with UNICEF. [Presentation] [Slides]
  • Restoring Gig Workers to Power: Personal Data Portability, Supply of Digital Content and Free Flow of Data in the European Data Economy. Cailean Gallagher, Scottish Trades Union Congress, & St Andrews University Institute of Intellectual History. [Presentation] [Slides]
  • Dealing with data in peace and conflict research. Sanja Badanjak, Postdoctoral Research Fellow, School of Law. [Presentation] [Slides]

Session 3 – Chair: Robin Rice

  • Bringing researchers to data: computing skills training with Edinburgh Carpentries.
    Edward Wallace, Sir Henry Dale Fellow, Institute of Cell Biology. [Presentation] [Slides]
  • Running an analysis of combined data when the individual records cannot be combined. Gillian M Raab and Chris Dibben, Scottish centre for Administrative Data Research. [Presentation] [Slides]
  • The grant is dead, long live the data. Johnny Hay and Tomasz Zieliński, School of Biology, University of Edinburgh. [Presentation] [Slides]
  • International collaborations using linked administrative data: Lessons from the MARIC study. Holly Tibble, Usher Institute, University of Edinburgh. [Presentation] [Slides]

Robin Rice
Data Librarian and Head, Research Data Support
Library & University Collections

Share

A visit from the data jungle: My internship in research data management

This is a guest post from Dr. Tamar Israeli, who completed a work/study internship with the Research Data Support team last Autumn. A link to her report is available below.

Recently, there has been a rumor in Israel that research data should be managed. As a librarian and information specialist working in an academic institution, I decided to check if this was true.

When looking for a place for an internship on the role of the library in research data management (RDM), I was happy to find out that the University of Edinburgh RDM support team has a good reputation. I remember enjoying very much my visit to Edinburgh 30 years ago so I was very happy to get Robin Rice & Martin Donnelly’s kind invitation so I could boldly go where… I had already been before.

During September 2019, I worked with the RDM support team, attended some of the staff meetings and participated in one of the RDM trainings.  As part of my internship we carried out a small scale study. The purpose of the study was mainly to understand what are the barriers that prevent researchers from using tools and services provided to them by the university when collaborating with data.

For that purpose, I interviewed six researchers from different schools and disciplines. The researchers were open and cooperative and the interviews were very interesting and insightful. If you’d like to learn about the way researchers collaborate and what influences their decision to use a particular tool or service, here is a link to our report: http://dx.doi.org/10.7488/era/2

Many thanks to the support team for their invitation and warm hospitality. It was one of the most pleasant months of my life.

Tamar Israeli
Librarian and information specialist
Western Galilee College

Share

New research data management tool on one-year trial: protocols.io

Information Services aims to offer a research data service that meets most of the data lifecycle needs of the majority of UoE researchers without interfering with their freedom to choose tools and technologies which suit their work. In some cases cloud tools that are free to individual users are offered commercially as enterprise versions, allowing groups of researchers (such as lab groups) to work together efficiently.

The service’s steering group has agreed a set of criteria to apply when a tool is put forward by a research group for adoption. The criteria were developed after our two-year trial of the electronic lab notebook software, RSpace, and have been most recently applied to protocols.io. The protocols.io trial begins this month and will run for one year. An evaluation will determine whether to continue the enterprise subscription and how to fund it.

protocols.io is an online platform for the creation, management, and sharing of research protocols or methods. Users can create new protocols within the system, or upload existing methods and digitise them. Those with access to a protocol can then update, annotate, or fork it so that it can be continually improved and developed. There is interoperability with Github and RSpace, and long-term preservation of protocols through CLOCKSS.

Users can publish their protocol(s) making them freely available for others to use and cite or, with the enterprise version, keep them private. The tool supports the Open Science / Open Research agenda by helping to ensure that methods used to produce data and publications are made available, assisting with reproducibility.

Subscribing to the University plan will allow research groups to organize their methods and ensures that knowledge is not lost as trainees graduate and postdoctoral students move on. There are currently over 70 University of Edinburgh researchers registered to use protocols.io. You may follow these instructions to move your current protocols.io account to the premium university version. For more information contact data-support@ed.ac.uk.

Kerry Miller and Robin Rice
Research Data Support team

Share

Research Data Service achieves ISO 27001 accreditation for Data Safe Haven facility

Following a five day on-site audit by Lloyd’s Register, the Information Security Management System (ISMS) which forms the basis for the Data Safe Haven facility for University of Edinburgh researchers has been officially certified to the ISO/IEC 27001:2013 standard. In a few weeks we will receive a certificate from UKAS (United Kingdom Accreditation Service).

The Data Safe Haven (DSH) team, comprised of members of Research Data Support in L&UC and Research Services in ITI, and with input from the Information Security team and external consultants, has been working toward certification since 2016. The system, designed by ITI’s Stephen Giles, has been extensively and successfully ‘white box penetration tested’ by external experts, one of the many forms of proof provided to the auditor. (White box means the testers were given access to certain layers of the system, as opposed to a black box test where they are not.)

The steel cage surrounding Data Safe Haven equipment in one of the University data centres.

In addition to infrastructure, a proper ISMS is made up of people who perform roles and manage procedures, based on organisational policies. The Research Data Support team work with research project staff to ensure their practices comply with our standard operating procedures. The ISMS is made up of all the controls needed to ensure that it is sensibly protecting the confidentiality, availability, and integrity of assets from threats and vulnerabilities. Over 150 managed and versioned documents covering every aspect of the ISMS were written, discussed, practiced, reviewed and signed off before being examined and questioned by the auditor.

The auditor stated in the final report, “The objectives of the assessment were achieved and with consideration to any noted issues or raised findings, the sampled areas of the management system demonstrated a good level of conformance and effectiveness. The management system remains supportive of the organisation and its business and service management objectives.” On a slightly more upbeat note, Gavin Mclachlan, Vice-Principal and Chief Information Officer, and Librarian to the University said by email, “Congratulations to you and the whole team on the ISO 27001 certification. That is a great achievement.”

The Digital Research Services programme has invested in the Data Safe Haven to allow University researchers to conduct cutting edge research, access sensitive data from external providers and facilitate new research partnerships and innovation. Researchers are expected to include Data Safe Haven costs in funded grant proposals to achieve some cost recovery for the University. To find out if your project is a candidate for use of the Data Safe Haven contact data-support@ed.ac.uk or the IS Helpline.

Robin Rice
Data Librarian and Head, Research Data Support
L&UC

Share