USMLE Step 1 Biostatistics Epidemiology: Study Guide

By Jim Breese·Updated 2026-04-30

Biostatistics and epidemiology are critical components of the USMLE Step 1 exam. These subjects test your ability to understand disease patterns, study designs, and statistical analysis in clinical practice.

These topics form the foundation for interpreting research, understanding public health data, and making evidence-based clinical decisions. Mastering these concepts requires understanding both theory and practical applications in real clinical scenarios.

This guide covers essential topics including study designs, statistical measures, hypothesis testing, and epidemiological concepts that appear frequently on Step 1. With focused preparation using flashcards, you can efficiently memorize key formulas, definitions, and clinical examples.

Key Takeaways

•Sensitivity and specificity are test characteristics independent of disease prevalence, while PPV and NPV depend on how common disease is in the study population
•Study design hierarchy from weakest to strongest evidence: case reports/series, cross-sectional, case-control, cohort, and randomized controlled trials
•Internal validity threats include selection bias, information bias, and confounding. External validity determines whether results apply beyond the study population
•Master key formulas including NNT equals 1/ARR, Relative Risk from cohort studies, and Odds Ratio from case-control studies to interpret research quantitatively
•Causation requires establishing temporal relationship, biological plausibility, dose-response relationship, and consistent findings across studies. Correlation alone doesn't establish cause
•Incidence measures new cases over time while prevalence measures existing cases at a point in time. Prevalence approximately equals incidence multiplied by disease duration

Core Biostatistics Concepts You Must Master

Biostatistics forms the quantitative backbone of Step 1's biostatistics and epidemiology section. You need to master key statistical measures that evaluate test performance.

Key Diagnostic Measures

Sensitivity represents the true positive rate. Calculate it as TP/(TP+FN). If someone has the disease, what's the probability the test is positive? This is what sensitivity answers.

Specificity tells you the true negative rate. Calculate it as TN/(TN+FP). If someone doesn't have the disease, what's the probability the test is negative?

These measures don't change with disease prevalence. This makes them crucial for evaluating test performance in any population.

Positive predictive value (PPV) and negative predictive value (NPV) tell you what clinicians actually care about. PPV calculates as TP/(TP+FP) and answers: if my patient tests positive, what's the chance they actually have the disease? NPV calculates as TN/(TN+FN).

These values are prevalence-dependent. In populations with higher disease prevalence, a positive test is more likely to indicate true disease.

Additional Critical Concepts

You must also understand:

Number needed to treat (NNT): How many patients to treat to prevent one adverse event
Number needed to harm (NNH): How many patients to treat until one experiences harm
Relative risk (RR): Risk ratio between two groups
Odds ratio (OR): Comparison of odds between groups
Attributable risk: Excess risk from exposure

Standard deviation, confidence intervals, and p-values determine statistical significance. You must understand type I errors (false positives, significance level alpha) and type II errors (false negatives, related to power).

The normal distribution, z-scores, and t-tests appear frequently on exams. Study these concepts systematically, focusing on clinical examples rather than abstract mathematics.

Study Designs: The Foundation of Epidemiological Evidence

Understanding epidemiological study designs is essential for Step 1 success. Study designs range from weakest to strongest evidence.

Study Design Hierarchy

Case reports and case series
Cross-sectional studies
Case-control studies
Cohort studies
Randomized controlled trials (RCTs)

Understanding Each Design

Case reports and case series describe individual patient experiences without comparison groups. They provide no causation evidence but identify new phenomena.

Cross-sectional studies measure disease and exposure simultaneously. They provide prevalence data but no causation inference. They're useful for identifying associations but can't establish temporality.

Case-control studies start with diseased and non-diseased individuals. Then researchers look backward at exposure history. They're efficient for rare diseases and calculate odds ratios. However, they can't establish causation definitively.

Cohort studies follow exposed and unexposed individuals forward in time. They calculate relative risk directly and establish causation better than case-control studies. However, they take longer and cost more.

Randomized controlled trials (RCTs) randomly assign participants to intervention or control. Random assignment minimizes bias and establishes causation most definitively. They provide the strongest evidence.

Key Study Design Characteristics

You should know the advantages and disadvantages of each design. Common threats include:

Selection bias: Systematic differences between groups at baseline
Information bias: Measurement error affecting outcome assessment
Confounding: Third variables affecting results

Remember that each design calculates different statistics: prevalence for cross-sectional, odds ratio for case-control, relative risk for cohort, and risk reduction for RCTs.

Interpreting Research: Validity, Bias, and Causation

Step 1 tests your ability to critically evaluate research studies. Understanding validity and bias is essential for this skill.

Internal and External Validity

Internal validity refers to whether a study actually measures what it claims to measure within that specific population. Threats to internal validity include:

Selection bias: Systematic differences between groups at baseline
Information bias: Systematic measurement error
Confounding: Unmeasured third variables affecting results

External validity refers to whether results apply to populations outside the study. A study with high internal validity might have low external validity if the participants aren't representative of broader populations.

Recognizing Common Biases

You need to recognize sources of bias in research:

Recall bias: Participants misremember past events
Observer bias: Study personnel treat groups differently
Detection bias: Outcome measurement differs between groups

Establishing Causation

Establishing causation requires meeting Bradford Hill criteria:

Temporal relationship (exposure precedes disease)
Dose-response relationship (more exposure increases risk)
Biological plausibility (mechanism makes sense)
Strength of association (strong effect)
Consistency across studies (repeated findings)
Reversibility (removing exposure reduces risk)

Correlation does not equal causation. This fundamental principle appears frequently on Step 1.

Statistical vs. Clinical Significance

Statistical significance (p<0.05) means results weren't due to chance. Clinical significance means the effect size matters in practice.

A study might be statistically significant with p=0.04 but have a clinically insignificant effect size. Confidence intervals provide more useful information than p-values alone because they show the range of plausible effect sizes.

Understanding these concepts helps you interpret study quality and recognize when conclusions extend beyond what data supports.

Epidemiological Concepts: Risk, Rates, and Disease Surveillance

Epidemiology applies statistical methods to understand disease patterns in populations. Key terms distinguish different ways to measure disease occurrence.

Understanding Risk and Rates

Risk represents the probability that someone develops disease over a specific time period. Calculate it as new cases divided by population at risk.

Incidence rate equals new cases divided by (population at risk times time period). It measures new disease occurrence.

Prevalence equals total cases divided by total population. It represents a snapshot at one point in time.

Understanding the relationship between incidence and prevalence is critical. Prevalence depends on how many new cases develop (incidence) and how long people survive with the disease. In a stable situation, prevalence approximately equals incidence multiplied by average disease duration.

Additional Epidemiological Measures

Attack rate: Equals ill people divided by exposed population (used in outbreak investigations)
Case fatality rate: Equals deaths from disease divided by total diagnosed cases
Mortality rate: Equals deaths divided by total population at risk

These distinctions matter for different epidemiological questions.

Risk Reduction and Treatment Impact

Relative risk reduction (RRR) shows the proportional decrease in risk. Calculate it as (control event rate minus experimental event rate) divided by control event rate.

Absolute risk reduction (ARR) shows the actual difference: control event rate minus experimental event rate.

Number needed to treat (NNT) equals 1 divided by ARR. It indicates how many patients you must treat to prevent one adverse event.

Epidemiologists also use odds ratios when calculating from case-control studies. Disease surveillance systems track disease occurrence to identify outbreaks and trends. You should understand how diseases spread through populations using epidemic curves and basic outbreak investigation steps including establishing case definitions and calculating attack rates.

Practical Step 1 Preparation Strategies for Biostatistics and Epidemiology

Successfully mastering biostatistics and epidemiology for Step 1 requires deliberate, focused preparation. Apply these evidence-based study strategies.

Building Conceptual Understanding

First, create concept maps showing relationships between study designs, statistical measures, and clinical applications. The exam tests applied knowledge, not pure statistics. Questions always connect concepts to clinical scenarios.

Practice calculating sensitivity, specificity, PPV, and NPV repeatedly with different disease prevalences. This internalizes how prevalence affects predictive values. Create flashcards for formulas with clinical examples on the back side.

Systematic Study Approach

Study bias types systematically by writing scenario descriptions for each type and identifying them in practice questions. For study designs, create comparison tables showing:

When to use each design
What statistics they generate
Advantages and disadvantages
Time requirements and costs

Time management matters. Biostatistics and epidemiology questions require careful reading but don't demand extreme computational complexity. Practice interpreting graphs showing disease trends, survival curves, and dose-response relationships.

Question Practice and Active Learning

Review board-style questions that present studies and ask about validity threats or appropriate statistical analyses. Join study groups to explain concepts aloud. Teaching others reinforces your understanding. Use spaced repetition to revisit difficult topics at increasing intervals.

Since many concepts build on each other, ensure you master basics before advancing. Resources like USMLE-focused textbooks, question banks (NBME, UWorld), and review courses provide comprehensive coverage.

Clinical Application Focus

Remember that Step 1 emphasizes clinical applicability. Always think about how concepts apply to patient care and disease management. This perspective helps you remember information and answer questions correctly.

Start Studying USMLE Step 1 Biostatistics and Epidemiology

Master critical concepts with interactive flashcards designed for USMLE success. Create custom decks focusing on study designs, statistical formulas, and clinical interpretation. Use spaced repetition to efficiently retain information and boost your Step 1 score.

Create Free Flashcards

Frequently Asked Questions

What's the difference between sensitivity, specificity, and positive predictive value on USMLE Step 1?

Sensitivity and specificity are test characteristics that don't change with disease prevalence. Sensitivity (true positive rate) answers: if someone has the disease, what's the probability the test is positive? Calculate it as TP/(TP+FN).

Specificity (true negative rate) answers: if someone doesn't have the disease, what's the probability the test is negative? Calculate it as TN/(TN+FP).

Positive predictive value (PPV) answers what clinicians actually need to know: if my patient tests positive, what's the probability they actually have the disease? Calculate PPV as TP/(TP+FP).

The critical difference: PPV changes with disease prevalence. In populations with higher disease prevalence, a positive test is more likely to indicate true disease.

Step 1 tests whether you understand these distinctions and can apply them clinically. A highly sensitive test is useful for ruling out disease. A negative result excludes disease. A highly specific test is useful for ruling in disease. A positive result confirms disease.

How should I study different epidemiological study designs for the USMLE?

Create a systematic comparison table for each study design. Include these elements:

Definition
Direction of inference (forward or backward in time)
Starting point (disease or exposure)
Statistics calculated
Time requirements
Cost
Advantages and disadvantages

Case-control studies start with disease and look backward at exposure. They're efficient for rare diseases but calculate odds ratios, not relative risks.

Cohort studies start with exposure and follow forward. They calculate relative risk directly and establish causation better. However, they take longer.

Randomized controlled trials minimize bias through random assignment.

Practice identifying which design applies to specific clinical questions. For example, investigating a foodborne illness outbreak requires a cohort study because you identify exposed and unexposed people and follow them forward. Investigating a rare cancer by looking at medical records for past exposure is a case-control study.

Create flashcards with study design characteristics on one side and clinical scenarios on the other. Use spaced repetition to review these repeatedly because distinguishing between designs is fundamental to understanding epidemiology.

What biostatistics formulas must I memorize for Step 1?

Key formulas you absolutely must memorize:

Sensitivity = TP/(TP+FN)
Specificity = TN/(TN+FP)
PPV = TP/(TP+FP)
NPV = TN/(TN+FN)
Relative Risk = Risk in exposed / Risk in unexposed
Odds Ratio = (TP x FN) / (TN x FP) for case-control studies
Number Needed to Treat = 1 / Absolute Risk Reduction
Absolute Risk Reduction = Control Event Rate minus Experimental Event Rate
Attributable Risk = Risk in exposed minus Risk in unexposed
Standard Error = Standard Deviation / √n
95% Confidence Interval = estimate ± 1.96 × Standard Error

Understanding what each formula means clinically matters more than memorization. Create flashcard decks with formulas on front and clinical interpretations on back. Practice calculating these with real numbers from sample questions.

Most Step 1 questions won't require extensive calculations. They'll test conceptual understanding of what these statistics mean and how to interpret them in clinical context.

How do I recognize bias and validity threats in USMLE Step 1 questions?

Step 1 tests your ability to identify threats to study validity in clinical scenarios.

Selection bias occurs when systematic differences exist between comparison groups at baseline. Example: a diet study comparing healthy volunteers (diet group) with hospitalized patients (control group) has selection bias because groups differ fundamentally.

Information bias involves systematic measurement error. A study asking participants to recall diet from 20 years ago has recall bias.

Observer bias occurs when those measuring outcomes treat groups differently. For example, staff might measure outcomes more carefully in one group.

Confounding happens when an unmeasured third variable explains the association. If a study shows coffee drinkers have more heart disease but doesn't account for smoking, smoking is a confounder because coffee drinkers smoke more.

When reading Step 1 questions, ask yourself: Could unmeasured variables explain results? Did the groups differ at baseline? Were measurements systematic and unbiased? Did researchers follow participants long enough to establish temporal relationships?

Questions often describe studies with obvious flaws and ask what validity threat is present. Practice identifying these flaws by reviewing incorrect studies in question banks and noting which validity threats appear most frequently.

Why are flashcards particularly effective for learning biostatistics and epidemiology?

Flashcards leverage spaced repetition and active recall, which are scientifically proven study methods. Biostatistics and epidemiology require memorizing definitions, formulas, study design characteristics, and statistical interpretations. This is exactly what flashcards excel at teaching.

With flashcards, you actively retrieve information from memory rather than passively rereading material. This strengthens neural connections and improves retention.

Spaced repetition ensures you review material at optimal intervals before forgetting. This makes long-term retention more efficient than cramming.

You can create multi-layered flashcards effectively. Front side shows a formula or concept name. Back side includes the definition, calculation method, and a clinical example. This structure mirrors how Step 1 tests material by presenting clinical scenarios that require you to identify and apply concepts.

Flashcards also enable quick review sessions during busy schedules, supporting consistent learning. Digital flashcard apps track which concepts you know well and which need more review, optimizing study time.

Additionally, creating your own flashcards forces you to synthesize information and identify key concepts. This itself improves learning and retention before you ever review them.