CLiREN-LMS

Clinical Research Data Management

Manual-backed clinical research data management course synchronized from the completed course manual.

12 chapters / 12-week blended course

Course Path

133 lessons across 12 modules.

Log in to track completion and resume your course.

Create Account to Enroll

Objectives

  • Apply clinical research data management principles across the study lifecycle.
  • Design REDCap databases and CRFs from protocol requirements.
  • Use R for data cleaning, descriptive analysis, visualization, and reporting.
  • Document, validate, govern, and prepare clinical research data for reuse.
Module 1

Foundations of Clinical Research Data Management

Clinical research is one of the major routes through which societies generate evidence about health, disease, prevention, diagnosis, treatment, and care delivery. It may involve clinical trials of new medicines or vaccines, observational studies of disease progression, surveillance systems that monitor public health events, diagnostic evaluations, implementation studies, registries, or operational research designed to improve health services. Although these studies differ in design and scale, they share a common dependency: their conclusions are only as reliable as the data on which they are based.

  1. Foundations of Clinical Research Data Management: Orientation
  2. Introduction to Clinical Research Data Management
  3. The Evolution of Clinical Data Management
  4. Why Data Management Matters
  5. The Clinical Research Lifecycle
  6. The Role of the Clinical Data Manager
  7. Characteristics of High-Quality Data
  8. Data Flow in Clinical Research
  9. Introduction to Good Clinical Practice
  10. Introduction to FAIR Data Principles
  11. Data Protection, Ethics, and Confidentiality
  12. Foundations of Clinical Research Data Management: Summary and Assessment
Module 2

Protocol Translation and CRF Design

The study protocol is the central scientific, operational, and regulatory document for a clinical research study. It describes why the study is being conducted, what question it intends to answer, who will be included, what procedures will be performed, how participant safety will be protected, what outcomes will be measured, and how the data will be analyzed. For the clinical data manager, the protocol is more than a narrative description of a study. It is the blueprint from which data requirements, case report forms, database structures, validation checks, monitoring reports, and analysis datasets are derived.

  1. Protocol Translation and CRF Design: Orientation
  2. Understanding the Study Protocol
  3. Translating Research Objectives into Data Requirements
  4. Primary and Secondary Outcomes
  5. Developing a Data Collection Matrix
  6. Principles of CRF Design
  7. Visit Schedules and Longitudinal Data Collection
  8. Variable Naming and Coding Standards
  9. Data Dictionaries and Metadata
  10. CRF Version Control
  11. Common CRF Design Errors
  12. Protocol Translation and CRF Design: Summary and Assessment
Module 3

Database Design in REDCap

REDCap, which stands for Research Electronic Data Capture, is a secure, web-based application widely used for building and managing research databases and online surveys. It is particularly well suited to clinical and translational research because it allows study teams to design data collection instruments, manage records, define user permissions, create reports, export datasets, and preserve audit trails without requiring every data manager to become a software developer. In many research institutions, REDCap has become a standard platform for electronic case report forms, registries, clinical trial databases, surveillance systems, and operational research projects.

  1. Database Design in REDCap: Orientation
  2. Introduction to REDCap
  3. REDCap Project Types
  4. Project Setup and Online Designer
  5. Instruments, Fields, and Variable Types
  6. Validation Rules
  7. Branching Logic
  8. Calculated Fields
  9. Longitudinal Projects and Events
  10. Repeating Instruments
  11. Data Access Groups
  12. User Rights and Permissions
  13. Moving from Development to Production
  14. Testing and User Acceptance Testing
  15. Database Design in REDCap: Summary and Assessment
Module 4

Data Entry, Validation, and Access Control

Data entry is the process through which clinical research observations, measurements, assessments, and documents become structured data in a research database. Although the phrase may sound simple, data entry is a critical stage in the research lifecycle. It is the point at which protocol-defined information is transferred from clinical practice, laboratory work, participant interviews, field activities, or source records into a system that will eventually support monitoring, analysis, reporting, and archival.

  1. Data Entry, Validation, and Access Control: Orientation
  2. Data Entry Workflows
  3. Source Documents and Source Data Verification
  4. Single Entry, Double Entry, and Direct Electronic Capture
  5. Data Validation During Entry
  6. Required Fields and Completion Guidelines
  7. Role-Based Access Control
  8. Data Access Groups in Multisite Studies
  9. Audit Trails
  10. User Training and Competency
  11. Common Data Entry Errors
  12. Data Entry, Validation, and Access Control: Summary and Assessment
Module 5

Data Quality Management and Query Resolution

Data quality is the degree to which data are fit for their intended use. In clinical research, intended use includes participant safety oversight, protocol compliance, statistical analysis, regulatory reporting, publication, data sharing, and long-term archival. Data quality is therefore not a single property and cannot be judged only by whether a dataset contains values. A dataset may be complete but inaccurate, accurate but late, valid but poorly documented, or internally consistent but not suitable for answering the study question.

  1. Data Quality Management and Query Resolution: Orientation
  2. Understanding Data Quality
  3. Dimensions of Data Quality
  4. Sources of Data Quality Problems
  5. Data Quality Planning
  6. REDCap Data Quality Module
  7. Query Management Workflow
  8. Writing Effective Queries
  9. Query Metrics
  10. Central Monitoring
  11. Risk-Based Data Monitoring
  12. Database Freeze and Database Lock
  13. Data Quality Management and Query Resolution: Summary and Assessment
Module 6

Introduction to R for Clinical Data Management

Clinical data management is increasingly dependent on the ability to move between data collection systems, statistical software, reporting tools, and documentation workflows. In many studies, the primary database may be implemented in REDCap, OpenClinica, Medidata Rave, Castor, or another electronic data capture system. However, data managers often need to perform tasks that go beyond the point-and-click interface of the database. They may need to compare exports, check data consistency across instruments, generate query lists, reconcile laboratory files, prepare monitoring reports, summarize missingness, inspect patterns across sites, or document a cleaning decision in a way that can be repeated later. R is valuable because it allows these tasks to be written as reusable scripts rather than performed manually each time [@rcore2024r; @wickham2023r4ds].

  1. Introduction to R for Clinical Data Management: Orientation
  2. Why R Matters for Clinical Data Management
  3. Installing R and RStudio
  4. Understanding the RStudio Interface
  5. R Projects and Folder Structure
  6. Scripts, Console, Environment, and Working Directory
  7. Objects, Vectors, and Data Frames
  8. Installing and Loading Packages
  9. Introduction to the tidyverse
  10. Importing REDCap Exports, CSV, and Excel Files
  11. Inspecting Data with glimpse, summary, head, names, and dim
  12. Basic Data Quality Checks in R
  13. Introduction to R for Clinical Data Management: Summary and Assessment
Module 7

Data Cleaning and Preparation in R

Data cleaning is the organized process of identifying, investigating, documenting, and resolving problems in research data. It is not a casual activity performed after data collection is complete. It is part of the quality system of a study. In clinical research, cleaning begins before the first participant is enrolled, because the protocol, CRFs, database design, validation rules, completion guidelines, and monitoring plan all determine what kinds of errors are likely to occur and how they will be handled. R becomes useful when those expectations can be translated into transparent, repeatable checks and preparation steps.

  1. Data Cleaning and Preparation in R: Orientation
  2. The Purpose of Data Cleaning and Preparation
  3. Raw, Cleaned, and Analysis-Ready Datasets
  4. Importing and Exporting Data Through the REDCap API
  5. Handling Missing Data in Cleaning Workflows
  6. Recoding Categorical Variables
  7. Creating Derived Variables
  8. Writing Readable Cleaning Scripts
  9. Cleaning Logs and Query Outputs
  10. Preparing Datasets for Analysis and Reporting
  11. Common Cleaning Risks and How to Avoid Them
  12. Data Cleaning and Preparation in R: Summary and Assessment
Module 8

Data Analysis in R

Descriptive analysis is the process of summarizing data so that a study team can understand what has been collected. In clinical research, it is often associated with final reports, manuscripts, or statistical analysis plans. However, descriptive analysis is also central to data management. A data manager needs to know how many participants have been enrolled, how many records are incomplete, whether follow-up outcomes are missing, whether sites have similar patterns of data entry, whether adverse events are being reported consistently, and whether numeric values fall within plausible clinical ranges. These questions are descriptive before they are inferential.

  1. Data Analysis in R: Orientation
  2. Why Descriptive Analysis Matters in Clinical Data Management
  3. Preparing Data Before Analysis
  4. Summarizing Categorical Variables
  5. Summarizing Numeric Variables
  6. Grouped Summaries for Monitoring and Reporting
  7. Cross-Tabulations and Proportions
  8. Creating Report-Ready Tables in R
  9. Interpreting Descriptive Outputs
  10. Exporting Summary Outputs
  11. Common Errors in Descriptive Analysis
  12. Data Analysis in R: Summary and Assessment
Module 9

Data Visualization and Dashboards

Tables are essential in clinical research, but they are not always the best way to detect patterns. Visualization allows a study team to see distributions, trends, outliers, site differences, missingness patterns, and operational bottlenecks. A good graph can show that one site has delayed follow-up entry, that a laboratory value has an implausible cluster, that enrollment slowed after a protocol amendment, or that query resolution improved after retraining. Visualization is therefore not merely a presentation tool; it is a data management and monitoring tool.

  1. Data Visualization and Dashboards: Orientation
  2. Why Visualization Matters in Clinical Data Management
  3. Principles of Good Clinical Data Visualization
  4. Creating Basic Charts with ggplot2
  5. Visualizing Missingness and Data Quality
  6. Enrollment and Follow-Up Monitoring Plots
  7. Introduction to Shiny Dashboards
  8. Dashboard Governance and Privacy
  9. Data Visualization and Dashboards: Summary and Assessment
Module 10

Reporting and Reproducibility

Clinical research reports are often updated repeatedly. A weekly data quality report may be generated every Friday. An enrollment report may be reviewed at every trial management meeting. A query summary may be shared with sites monthly. A manuscript table may be updated whenever the database changes. If these reports are produced manually, the risk of inconsistency is high. Reproducible reporting reduces that risk by connecting the report directly to the code and data used to generate it.

  1. Reporting and Reproducibility: Orientation
  2. Why Reproducible Reporting Matters
  3. R Markdown and Quarto Basics
  4. Designing a Data Management Report
  5. Workflow Documentation
  6. Version Control and Review
  7. Reporting and Reproducibility: Summary and Assessment
Module 11

Data Documentation and Metadata

A dataset without documentation is fragile. Even if the data are accurate, future users may not understand what variables mean, how values were coded, what population is represented, which records were excluded, which dates were derived, or what missing codes mean. Clinical research data are often used long after collection ends. They may support manuscripts, audits, secondary analyses, data sharing, regulatory review, or future pooled analyses. Documentation makes these uses possible.

  1. Data Documentation and Metadata: Orientation
  2. Why Documentation Matters
  3. Codebooks and Data Dictionaries
  4. FAIR Principles and Clinical Research Data
  5. Archival and Governance
  6. Preparing a Documentation Package
  7. Data Documentation and Metadata: Summary and Assessment
Module 12

Final Project Preparation, Presentation, and Course Integration

The final project is the capstone of the course. Its purpose is to bring together the major skills developed across the previous chapters: protocol interpretation, CRF design, REDCap database development, validation rules, data entry workflow, data quality management, R-based cleaning, descriptive analysis, visualization, reporting, documentation, and governance. The project should demonstrate not only that the learner can use tools, but that they can use them responsibly within a clinical research data management workflow.

  1. Final Project Preparation, Presentation, and Course Integration: Orientation
  2. Purpose of the Final Project
  3. Defining Project Scope
  4. Suggested Project Themes
  5. Preparing Final Deliverables
  6. Presentation and Peer Review
  7. Assessment Rubric
  8. Course Integration and Professional Reflection
  9. Final Project Preparation, Presentation, and Course Integration: Summary and Assessment