Digital Curation: maintaining and adding value to a trusted body of digital information for future and current use; specifically, the active management and appraisal of data over the entire life cycle
Digital Preservation: the series of actions and interventions required to ensure continued and reliable access to authentic digital objects for as long as they are deemed to be of value
Data Curation Tools
(see Data Tools for a more general list of useful tools)
- Open Data Tools: Turning Data into ‘Actionable Intelligence’ (2013) – A comprehensive list of “More than 349 Subject Specific Open Data Tools.”
- Information Space: 86 Helpful Tools for the Data Professional PLUS 45 Bonus Tools – Very useful anthology of tools and resources for data professionals, data dabblers, or data scientists from the iSchool at Syracuse.
- Digital Curation Resources outside the DCC – Catalog of tools for data creators and digital curators.
- DCC (Digital Curation Centre) Tools – A suite of data management and curation tools created by the UK's Digital Curation Centre.
- Digital Curation Glossary – Glossary of data curation and data preservation terminology from the Digital Curation Centre (UK).
- OpenRefine – OpenRefine (ex-Google Refine) is a powerful tool for working with messy data, cleaning it, transforming it from one format into another, extending it with web services, and linking it to databases like Freebase.
- ORCID – An open community-based effort to create and maintain a registry of unique researcher identifiers and a transparent method of linking research activities and outputs to these identifiers.
Data Curation Publications
- Data Curation, SPEC Kit 354 (2017) – The Association of Research Libraries' SPEC Kit "explores the infrastructure that ARL member institutions are using for data curation, which data curation services are offered, who may use them, which disciplines demand services most, library staffing levels, policies and workflows, and the challenges of supporting these activities. It includes examples of data repository web pages, descriptions of services, infrastructure, workflows, metadata schemas, and policies, and job descriptions."
- 10 Simple Rules for the Care and Feeding of Scientific Data (2014) – Collaborative article offering a short guide to the steps scientists can take to ensure that their data and associated analyses continue to be of value and to be recognized.
- Research Data Management: Principles, Practices and Prospects (2013) – This CLIR publication contains chapters pertaining to various aspects of Research Data Management that cover the full life cycle of curation.
- A Workflow Model for Curating Research Data in the University of Minnesota Libraries: Report from the 2013 Data Curation Pilot (2014) – Report of the University of Minnesota Libraries' 2013 Data Curation project investigating the libraries' programmatic and technical capacities for supporting the campus's RDM services with a fixed term data curation pilot.
- Research Data Curation Bibliography: Version 2 (2013) – This selective bibliography includes over 200 English-language articles and technical reports that are useful in understanding the curation of digital research data in academic and other research institutions.
- Data Curation is for Everyone! The Case for Master’s and Baccalaureate Institutions Engagement with Data Curation (2012) – Much of the discussion around data curation is framed around large research institutions. This article discusses reasons why master’s and baccalaureate institutions should engage with data curation too, and explains how one primarily undergraduate institution went about it.
- Curating for Quality: Ensuring Data Quality to Enable New Science (2012) – Final report from NSF sponsored workshop focusing on defining data quality issues and possible solutions. Includes outline of key points raised in the workshop and position papers submitted by participants. Key challenges outlined in the report include data selection strategies, understanding what context to include in data curation, tools and techniques that support painless data curation across disciplines, and cost models.
- Managing Research Data (2012) – Written for librarians, this book covers a wide variety of topics related to managing research data, such as why manage research data, explanation of the research data lifecycle, data management planning, and roles librarians can play to serve their faculty. The book is a compilation by many authors, all authorities in managing research data, and represent US, UK, and Australian academic and research institutions.
- Beyond The Low Hanging Fruit: Archiving Complex Data and Data Services at University of New Mexico (2012) – Describes history of data use that has lead to discovery and two specific test cases of data curation at the University of Mexico. Discusses issues related to repository software, and file size, and planning services that simplify the data documentation process for researchers and librarians.
- Linking to Scientific Data: Identity Problems of Unruly and Poorly Bounded Digital Objects (2011) – Explores the need for establishing standards for identity construction for scientific datasets.
- Managing Research Data Lifecycles through Context (2011) – Outlines Rutgers University Libraries’ approach to supporting research data lifecycle management.
- Communicating Scientific Data from the Present to the Future – Position paper from Princeton’s 2011 Research Data Lifecycle Management workshop advocates use of HDF5, Hierarchical Data Format Version, a generic scientific data format with supporting software, for long-term preservation of heterogeneous research data.
- Data Curation: An ecological perspective by Sayeed Choudhury (2010) – College & Research Library News. – Sayeed Choudhury draws inspiration from the natural world to illustrate the need for different library communities to contribute to an overall data curation network.
- Retooling Libraries for the Date Challenge by Dorothea Salo (2010) – Libraries need to recognize the unique characteristics of research data in order to implement effective work practices for data curation.
- Learning by Doing: Cases of Librarians Working with Faculty Research Data for the First Time (2010) – Purdue librarians conducted an exercise to learn about data curation in practical terms by identifying and engaging potential data contributors on campus. Subject specialist librarians engaged with six data creators from different disciplines to obtain data set contributions. The librarians reported descriptions of the data, the rationale for its selection and narratives of how they engaged with the data creators and questions and insights that emerged from these interactions.
Data & Intellectual Property Rights
- Introduction to Intellectual Property Rights in Data Managment – Provided by the Cornell University Research Data Management Services Group
- Copyright, Licensing and Intellectual Property Issues for Data – Guide by Duke University
- Intellectual Property Rights and Research Data: Focus on Copyright – Guide by University of Cambridge
- Research Data Management and Intellectual Property – Guide by University of Oregon