Confidence intervals: a useful statistical tool to estimate effect sizes in the real world

Patino, Cecilia Maria; Ferreira, Juliana Carvalho

doi:10.1590/S1806-37562015000000314

PRACTICAL SCENARIO

A prospective cohort study evaluated the association between the presence of asthma and the risk of developing obstructive sleep apnea (OSA) in adults. Adults were randomly recruited from a population-based list of state employees and were followed for four years. Participants with asthma, when compared with those without, had a higher risk of developing OSA in four years (relative risk [RR] = 1.39; 95% CI: 1.06-1.82; p = 0.03)

BACKGROUND

When conducting clinical research, we usually recruit a subgroup of the population of interest in order to increase study efficiency (fewer costs and less time). This subgroup of individuals, thestudy population, are those individuals who meet the inclusion criteria and agree to participate in the study (Figure 1). We then complete the study and calculate an effect size (e.g., a mean difference or a relative risk) to answer our research question. This process (inference) involves using data collected from the study population to estimate the true effect size in the population of interest, i.e., the source population. In our example, investigators recruited a random sample of state employees (source population) who were eligible and agreed to participate in the study (study population) and reported that asthma increases the risk of developing OSA in the study population (RR = 1.39). To take into account a sampling error due to recruiting only a subgroup of the population of interest, they also calculated a 95% confidence interval (around the estimate) of 1.06-1.82, indicating a 95% probability that the true RR in the source population would be between 1.06 and 1.82.

DEFINITION

A confidence interval is a measure of imprecision of the true effect size in the population of interest (e.g., difference between two means or a relative risk) estimated in the study population. That imprecision is due to the sampling error caused by taking subsamples of the population of interest. However, the estimate calculated in the study population is always the best estimate of the effect size in the source population.

Figure 1
Research populations.

WHY DO WE NEED CONFIDENCE INTERVALS?

We need confidence intervals to indicate the amount of uncertainty or imprecision around the effect size calculated, using the study sample to estimate the true effect size in the source population. Calculating the confidence interval is a strategy that takes into account sampling error: the study effect size and its´ confidence interval represent plausible values for the source population, and the narrower the confidence interval is, the more certain we are that the estimate from the study population represents the true effect size in the source population.

CONFIDENCE INTERVALS: INTERESTING FACTS

The most common width of confidence intervals reported in the literature is the 95% confidence interval. However, if we are interested in more or less confidence, 90% or 99% confidence intervals can be used.

The confidence interval represents the uncertainty of the effect sizein the source population, not in the study population. When calculating a confidence interval, the width of the interval is determined by the sample size (i.e., the individuals who agreed to be studied), the amount of measurement error of the study, and the degree of confidence required.

There is a unique relationship between the 95% confidence interval and a two-sided 5% level of significance. When the 95% confidence interval for differences in effect does not include 0 for absolute measures of association (e.g., mean differences) or 1 for relative measures of association (e.g., odds ratios), it can be inferred that the association is statistically significant (p < 0.05). The advantage of the 95% confidence interval over the p value is that it provides information about the size of the effect, the uncertainty of the population estimate, and the direction of the effect.

Confidence intervals should always be used in order to describe the major findings of a research study. The relevant confidence intervals should be shown not only in the text of the paper but also in the abstract.

Publication Dates

Publication in this collection
Nov-Dec 2015

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] ¹
Teodorescu M, Barnet JH, Hagen EW, Palta M, Young TB, Peppard PE. Association between asthma and risk of developing obstructive sleep apnea. JAMA. 2015;313(2):156-64. http://dx.doi.org/10.1001/jama.2014.17822
» https://doi.org/10.1001/jama.2014.17822

[2] ²
Sedgwick P. Understanding confidence intervals. BMJ. 2014;349:g6051. http://dx.doi.org/10.1136/bmj.g6051
» https://doi.org/10.1136/bmj.g6051

[3] ³
Sedgwick P. Confidence intervals: predicting uncertainty. BMJ. 2012;344:e3147. http://dx.doi.org/10.1136/bmj.e3147
» https://doi.org/10.1136/bmj.e3147

[4] ⁴
Ferreira JC, Patino CM. What does the p value really mean? J Bras Pneumol. 2015;41(5):485. doi: 10.1590/S1806-37132015000000215.
» https://doi.org/10.1590/S1806-37132015000000215

[5] ⁵
Gardner MJ, Altman DG. Statistics with confidence: confidence intervals and statistical guidelines. British Medical Journal. Belfast: The Universities Press Ltd; 1989.

Brasil