Dealing with Data 2019 (January 2020): Collaboration Across the Nations

Picture the scene: A cold January day, the wind blowing the scarves of the passers-by through the large windows of the Informatics Forum meeting room. The group inside listens, takes notes, tweets, and asks questions of the speakers, representing a range of disciplines across the University…

Dealing with Data is an annual event hosted by the Research Data Service. Its aim is to engage the University community of researchers and support professionals around a theme, to share success stories and challenges in the myriad, everyday issues involved with data-driven research. The theme this year reflected the difficulty of managing research data in large, collaborative projects. Due to industrial action, the original November event was postponed to January. Around a hundred researchers – staff and students – participated, along with support staff who gave lightning talks about research-focused services. Full presentations and videos are now available.

So Benjamin Bach, our keynote speaker, inspired us with state of the art data visualisation software and techniques for both exploration and presentation. But he also illustrated the difficulties of portraying all of the data in all of its facets of a rich dataset, and the consequences of making necessary choices for its interpretation.
The first session began with Tamar Israeli’s study of researchers’ use of collaborative and institutional tools showed the challenges of making local infrastructure user friendly enough to attract new users familiar with slick cloud-based services. Then Mark Lawson demonstrated his ingenuous ‘ethical hacking’ to piece together a set of APIs to create a research workflow for samples and images for histology research. Minhong Wang conveyed a higher level view of data management focused not just on data-driven, but knowledge-driven phenotyping.

Next were the lively lightning talks, in which Mike Wallis of Research Services warned of a new Digital Dark Age, and David Creighton-Offord spoke of the dillemmas in Information Security user support where shiny doesn’t always equal safe. Lisa Otty spoke of innovative training and text mining projects bringing data science to the Humanities, and Rory MacNeil demonstrated how the RSpace electronic lab notebook can connect to a host of popular open science tools.

Following a lively lunch with chat between delegates and with hosts of the service exhibitions, Alex Hutchison showed a highly programmatic view of data management and ethics control from the UNICEF collaboration, in collecting and analysing real world data about children in need. Caileen Gallagher offered a case study of how food courier data could be used to empower workers. Sanja Badanjak shared her data integration problems of peace agreements around the world, conveying both innovative solutions and time-consuming workarounds.

In the final session Edward Wallace brought in the Edinburgh Carpentries to the rescue of poor data skills within Biological Sciences and the wider University – itself a great example of cross-community collaboration building a community of trainers. Gillian Raab showed us how any data problem however intractable can be solved by resourcefulness and determination, making use of DataShield for multi-party computation when datasets are too sensitive to be shared. Johnny Hay and Tomasz Zielinski demo’d their Plasmo ‘boutique repository’ for plant-systems biology modelling and Holly Tibble described tackling an international collaboration in linking administrative datasets via ‘ridiculously detailed’ statistical analysis plans. Representing the Research Data Service, I wrapped up proceedings with some of these very observations.
Both presentations and videos are available.

Welcome

  • Jeremy Upton, Director of Library and University Collections. [Presentation]

Keynote

  • Data Visualization for Exploration and Presentation, Prof. Benjamin Bach. Lecturer in Design Informatics and Visualization. [Presentation] [Slides]

Session 1 – Chair: Theo Andrew

  • “Data Something”: Assessing Tools, Services and Barriers for Research Data Collaboration at the University of Edinburgh – a small-scale study carried out by Dr Tamar Israeli with support from the Research Data Support team. Robin Rice – Data Librarian & Head of Research Data Support Services. [Presentation] [Slides]
  • Integrated secure web application to deliver centralised management of research samples, histology services and imaging data. Mark Lawson, Data & Project Manager, MRC Centre for Reproductive Health, QMRI. [Presentation] [Slides]
  • Building the Knowledge Graph for UK Health Data Science Minhong Wang et. al, Deanery of Molecular, Genetic and Population Health Sciences. [Presentation] [Slides]

Session 2 – Chair: Kerry Miller

  • The Data Opportunities & Challenges when Collaborating across Organisations
    Alex Hutchison, Delivery Director – Data for Children Collaborative with UNICEF. [Presentation] [Slides]
  • Restoring Gig Workers to Power: Personal Data Portability, Supply of Digital Content and Free Flow of Data in the European Data Economy. Cailean Gallagher, Scottish Trades Union Congress, & St Andrews University Institute of Intellectual History. [Presentation] [Slides]
  • Dealing with data in peace and conflict research. Sanja Badanjak, Postdoctoral Research Fellow, School of Law. [Presentation] [Slides]

Session 3 – Chair: Robin Rice

  • Bringing researchers to data: computing skills training with Edinburgh Carpentries.
    Edward Wallace, Sir Henry Dale Fellow, Institute of Cell Biology. [Presentation] [Slides]
  • Running an analysis of combined data when the individual records cannot be combined. Gillian M Raab and Chris Dibben, Scottish centre for Administrative Data Research. [Presentation] [Slides]
  • The grant is dead, long live the data. Johnny Hay and Tomasz Zieliński, School of Biology, University of Edinburgh. [Presentation] [Slides]
  • International collaborations using linked administrative data: Lessons from the MARIC study. Holly Tibble, Usher Institute, University of Edinburgh. [Presentation] [Slides]

Robin Rice
Data Librarian and Head, Research Data Support
Library & University Collections

Share

‘Protecting sensitive data’: new MANTRA unit now available

The Research Data Support team are pleased to announce a new update to MANTRA, the free and open online research data management training course.

The new ‘Protecting sensitive data’ module has been created from scratch, replacing the previous ‘Data protection, rights and access’ unit to provide an up-to-date guide for researchers working with sensitive and personal data.

MANTRA is designed to give post-graduate students, early career researchers, and information professionals the knowledge and skills needed to work effectively with research data.

The ‘Protecting sensitive data’ unit considers the ethical and legal contexts for conducting research with sensitive data, including information and guidance on:

  • What makes data sensitive?
  • What UK and European data protection laws mean for research
  • Research ethics, informed consent and consent documentation
  • Approvals processes and accessing restricted data
  • Safeguarding sensitive data, including data retention limits, access controls and anonymisation
  • Data protection training

MANTRA is designed to be approachable and informative, and incorporates text and video content plus quizzes, interactive exercises and a ‘further reading’ section.

We hope you find the new content interesting and useful, and we welcome comments on the new unit, as well as feedback on the other seven MANTRA modules.

Finally, the RDS team are currently working on a series of further updates to MANTRA which will be rolled out over the coming weeks, and information about these releases will be posted on this blog and the RDS Twitter account.

Bob Sanders

MANTRA, Lead Editor

Share

Training researchers for a software and data-intensive world with Edinburgh Carpentries

This is guest post from Giacomo Peru and the EdCarp Committee (https://edcarp.github.io/committee/). Sections of this post were published previously on the EPCC blog.

EdCarpLogo

EdCarpLogo

The Edinburgh Carpentries (EdCarp) is a training initiative, which offers the Carpentries computing and data skills curriculum in Edinburgh. The workshops train researchers on fundamental skills needed for conducting efficient, open, and reproducible research. The EdCarp team comprises staff and student volunteers from across disciplines, academic units, and career stages.

Since 2018, EdCarp has organised 25 workshops across the academic institution, training over 300 staff and students in data cleaning, manipulation, visualisation and version control methods using tools such as R, python, Unix shell, Git, SQL and OpenRefine. Courses are free to participants and are oversubscribed very quickly. We are now rolling out our 2020 schedule and announcing workshops.

EdCarp are working to establish collaborations with other organisations, external and internal to the university: the Scottish Funding Council, the Institute for Academic Development and the Data Driven Innovation programme.

EdCarp can work with your academic unit or doctoral training program to help promote the fundamental data skills that your colleagues need.

A crucial aspect of EdCarp and their training model is the participation and voluntary commitment of the community, where trainees go to become helpers, helpers to instructors and so on.  EdCarp are always looking for new people willing to help, in any capacity; please sign up here if you would like to be kept updated and/or get involved: https://eepurl.com/gl4MsX.

 

Share

A visit from the data jungle: My internship in research data management

This is a guest post from Dr. Tamar Israeli, who completed a work/study internship with the Research Data Support team last Autumn. A link to her report is available below.

Recently, there has been a rumor in Israel that research data should be managed. As a librarian and information specialist working in an academic institution, I decided to check if this was true.

When looking for a place for an internship on the role of the library in research data management (RDM), I was happy to find out that the University of Edinburgh RDM support team has a good reputation. I remember enjoying very much my visit to Edinburgh 30 years ago so I was very happy to get Robin Rice & Martin Donnelly’s kind invitation so I could boldly go where… I had already been before.

During September 2019, I worked with the RDM support team, attended some of the staff meetings and participated in one of the RDM trainings.  As part of my internship we carried out a small scale study. The purpose of the study was mainly to understand what are the barriers that prevent researchers from using tools and services provided to them by the university when collaborating with data.

For that purpose, I interviewed six researchers from different schools and disciplines. The researchers were open and cooperative and the interviews were very interesting and insightful. If you’d like to learn about the way researchers collaborate and what influences their decision to use a particular tool or service, here is a link to our report: http://dx.doi.org/10.7488/era/2

Many thanks to the support team for their invitation and warm hospitality. It was one of the most pleasant months of my life.

Tamar Israeli
Librarian and information specialist
Western Galilee College

Share