English Arabic Chinese (Simplified) French Hindi Japanese Korean Persian Portuguese Russian Spanish

A Library-based Data Retrieval and Management Training Program for Clinical Researchers

Enterprise Data Warehouses (EDWs) play an increasingly important role on academic medical campuses, housing clinical and other enterprise-wide data and making it available for research and strategic purposes in the learning health system. Here, we will create an end-to-end training program that introduces clinical researchers to clinical database architecture and clinical coding standards, teaches them how to translate their research questions into queries that will allow them to extract data properly, and how to do so in a way that supports transparency and reproducibility while still respecting guidelines for proper data sharing. This work will build on longstanding partnerships with the leadership and data analysts of the Northwestern Medicine Enterprise Data Warehouse (NMEDW). [1,2] We will work together to promote improved communication and collaboration between data analysts and clinical researchers to make them better partners in research projects. To promote reusability of research reports and database queries within Northwestern’s research community, we will provide workflows for preservation through our next-generation research data management (RDM) system to make these resources discoverable. Our ultimate goal is to bolster support for our local research community to use clinical research data from the NMEDW and also parlay this experience to develop a blueprint of best practice workflows for clinical research data education and training that could be applied in libraries at other institutions. [1] Available at https://www.nucats.northwestern.edu/resources/data-science-and-informatics/nmedw/index.html [2] Starren, JB, Winter, AQ, Lloyd-Jones, DM. “Enabling a Learning Health System through a Unified Enterprise Data Warehouse: The Experience of the Northwestern University Clinical and Translational Sciences (NUCATS) Institute.” Clin Transl Sci. 2015 Aug;8(4):269-71. doi: 10.1111/cts.12294
Identify any specific population(s) this project will serve: 
Medically Underserved Areas/Populations
Behavioral/Social Determinants of Health
Maternal Health
African Americans or Black
Alaska Natives
American Indian
Latino or Hispanic
Native Hawaiians
Pacific Islanders
Identify roles of participants this project will serve: 
Data resource or tool developer
Data scientist
Educator, college & post-grad
Health care provider
Library or information professional
Public health professional
Student, college & post-grad
Project Lead:
Matt Carson
Funding Source: 
Project Funding:
Federal Fiscal Year: 
Funding Amount: 
Funding Period:
May 1, 2020 to Apr 30, 2021
Project Status:
In Progress