How to Learn RWD Research: Insights Beyond the Textbooks

One question I’ve been repeatedly asked is: how do you truly learn to conduct robust and meaningful real-world data (RWD) studies? The answer, in my experience, goes far beyond formal training or academic courses. Unlike traditional clinical trials, which follow well-established and regulated frameworks, real-world research is often shaped by context, data availability, and practical constraints, and learning to navigate this space requires more than just technical skill. It demands curiosity, critical reading, and a habit of engaging deeply with existing literature, protocols, and regulatory feedback. In this article, I’ll share a few strategies and resources that I’ve found particularly helpful along the way.

 

One of the most valuable ways to build competence in RWD research is by reading, not casually, but analytically. I strongly recommend investing time in the methods sections of real-world evidence (RWE) publications presenting a study using the same data source you are interested in or the same outcome or disease investigated. The method section is where the real lessons often lie. Pay close attention to how researchers define their study populations, select data sources, construct variables, and manage confounding. Ask yourself: what kind of data was used? How was exposure defined? Was it claims, electronic health records (EHRs), or registry data? Each decision reveals trade-offs that shape the validity and generalizability of the results. This kind of reading isn’t passive; it’s an active inquiry into the architecture of a study.

 

Equally important is a critical review of the limitations described by the authors. This section often serves as a window into the real challenges of working with imperfect, heterogeneous data. Understanding what constrained a study, the lack of granularity in outcome definitions, missing data, residual confounding, or temporal ambiguity, helps sharpen your methodological intuition. More importantly, it reveals where the field still needs to evolve, and where thoughtful intervention in study design could yield better evidence. Rather than merely accepting these limitations, try to imagine how you would overcome them. Would it require a different data source, a different temporal design, or a more sophisticated analytic approach?

 

Beyond journal articles, regulatory resources are an underutilized treasure trove for RWD learning. The European Medicines Agency (EMA) maintains an extensive catalogue of RWD sources across the EU (DATA SOURCES). Currently, there are 258 data sources listed, each offering insight into the type, structure, data lags, frequency of database refresh, and utility of data available for research and regulatory decision-making. Spending time on this platform will help you understand what data are available in different regions, how they have been used in the past, and what types of studies are feasible. Alongside this, the EMA’s inventory of RWE studies now exceeds 3,132 entries (STUDIES). Many of these include protocols, which are essential to read. Protocols reveal not just what was studied, but how the study was designed to satisfy regulatory expectations, what was considered adequate in terms of population definition, follow-up time, covariate control, and endpoint ascertainment. These are living examples of applied methodology under regulatory scrutiny.

 

The U.S. Food and Drug Administration (FDA) provides another valuable learning opportunity. Beyond the studies themselves, what I find especially instructive are the commentaries and assessments from reviewers and rapporteurs of the different drug applications. These documents, often included in the publicly available submission dossiers, provide a rare glimpse into the regulatory reasoning process (DOSSIERS). How did they interpret the data source’s reliability? Why an External Control Arm was not considered robust enough? Were the endpoints deemed robust? Did the reviewers flag unmeasured bias, or request sensitivity analyses? These insights allow you to step into the mindset of regulatory science, where methodological rigor and decision utility intersect.

 

Ultimately, learning how to conduct successful RWD research doesn’t come from a single textbook or training course. It comes from immersing yourself in the research literature, deconstructing real protocols, interrogating the limitations of existing studies, and staying attuned to the standards of regulatory science. It requires humility, persistence, and a deep respect for the complexity of generating credible evidence from data not originally designed for research.

 

There is no shortcut, but the path is there for those who take the time to follow it with intention and dare to follow their intuitions.

By Nadia Barozzi

Passionate about data-driven insights and the advancement of Real World Evidence research, drug safety and pharmacovigilance.