{"id":101,"date":"2017-04-05T20:34:18","date_gmt":"2017-04-05T10:34:18","guid":{"rendered":"http:\/\/www.torusresearch.com.au\/?p=101"},"modified":"2020-05-20T09:21:19","modified_gmt":"2020-05-19T23:21:19","slug":"sampling-power-sample-size","status":"publish","type":"post","link":"https:\/\/www.torusresearch.com.au\/?p=101","title":{"rendered":"Sampling, power and sample size"},"content":{"rendered":"<h1>Introduction<\/h1>\n<p>Here we\u2019ll take a brief, but hopefully practical, overview of sampling in the context of designing a research project, and how to select a suitable sample size.<\/p>\n<p>The participants you include in your study are an important consideration, to ensure that the results of your study can be extended or generalised to a larger group than your actual study participants. This is known as \u2018generalisability\u2019, or \u2018external validity\u2019.<\/p>\n<p>Sample size is important to make sure that you include enough subjects to have a reasonable chance of detecting a difference between groups, for example difference in response rates to a new treatment compared with an old treatment, if there is one, and that you do not unnecessarily include too many subjects which might be unethical or an unjustifiable use of scarce research funding or resources. Having said that, a larger group is usually very desirable to improve the precision of the findings of your study and enable more detailed analyses to be run.<\/p>\n<p>Study protocols and ethics applications invariably require a justification for sample size.<\/p>\n<h1>What is a sample?<\/h1>\n<p>If we want to know the exact \u2018truth\u2019 of something, we can measure every person (or item) in a whole population. This is a \u2018census\u2019 \u2013 \u2018a study that involves the observation of every member of a population\u2019. Measuring an entire population, for example in the periodic Australian census, is expensive and inconvenient, and not usually feasible in the context of medical research.<\/p>\n<p>Instead we select a much smaller number of subjects who we hope are representative of the population we\u2019d like to study.<\/p>\n<p>We assume that this reflects the population, and that the results are estimates of the \u2018truth\u2019. Hence, summary statistics such as mean, median or proportion are given with standard deviation, standard error or a confidence interval, to indicate the precision of our estimate.<a href=\"#_ENREF_1\"><sup>1<\/sup><\/a><\/p>\n<p>We do a research study, in the biomedical or veterinary context, in order to apply the result to some &#8216;target population&#8217;. For example, if you are a respiratory clinician, you might like to know\u00a0whether dornase alfa administered to\u00a0children and adolescents aged 5 \u2013 18 with cystic fibrosis was effective in improving lung function in this group.<\/p>\n<p>The target population in this example is \u2018children and adolescents aged 5 \u2013 18 with cystic fibrosis\u2019. It is self-evident that it would not be feasible to test the drug in ALL people with cystic fibrosis aged 5 \u2013 18 in the world; however we would like to apply the results to this group in general. Thus, the research needs to be conducted in a group which <strong>represents<\/strong> all people with cystic fibrosis aged 5 \u2013 18 years.<\/p>\n<p>The research process can be pictured as an idea, followed by development of a structured research question, typically along the PICO guideline of Population, Intervention, Comparator. This is followed by development of specific statistically tested hypothesis or hypotheses. We then go ahead and conduct out study, think about our results and draw our conclusions.<\/p>\n<figure id=\"attachment_149\" aria-describedby=\"caption-attachment-149\" style=\"width: 960px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-149\" src=\"http:\/\/www.torusresearch.com.au\/wp-content\/uploads\/2017\/04\/Slide4.jpg\" alt=\"\" width=\"960\" height=\"720\" \/><figcaption id=\"caption-attachment-149\" class=\"wp-caption-text\">Figure 1. The research cycle from the idea to application.<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<p>Hopefully, the conclusions will be applied to our original wider population of interest. This is the essence of the process of translation of research into practice.<\/p>\n<p>There is a big \u2018step down\u2019 from the target population to the actual study sample. It is very useful to conceptualise this using a diagram like this on \u2013 you can see how important the selection of the study sample is.<\/p>\n<p>You have a large \u2018target population\u2019, the population to which you wish to apply the conclusions of your study.<\/p>\n<p>Everitt defines a target population as<\/p>\n<p><em>\u2018The collection of individuals, items, measurements, etc. about which is is required to make inferences. Often the population actually sampled differs from the target population, and this may result in misleading conclusions being made\u2019.<\/em> <a href=\"#_ENREF_1\"><sup>1<\/sup><\/a><\/p>\n<h1>The sampling frame<\/h1>\n<p>Then you have a \u2018sampling frame\u2019. This is the group of subjects to which you have access \u2013 essentially a list &#8211; from which you hope to select your study sample.<\/p>\n<p>It could be quite specific, for example, you may have an actual list of all the patients in your clinic. You may have a list for potential control subjects.<\/p>\n<p>It might be less specific; you may not have a list of patients in your clinic, but plan just to invite everyone who comes to clinic over a specified time period, say 3 months or 6 months.<\/p>\n<p>It may not be practical to invite every single person in your sampling frame to participate in the study. For example, if you are recruiting over a 3 month time period, those attending over the other nine months of the year will not be invited. Thus, not everyone in the sampling frame is likely to be invited to participate in your study.<\/p>\n<p>You might have a list of 6,000 people from whom to select your study subjects \u2013 it is not practical to invite all of them to participate in your study.<\/p>\n<p>So the invited study subjects are a subset of the sampling frame. (See selecting a random sample in later slides).<\/p>\n<h1>The study sample<\/h1>\n<p>Of those who are invited to participate, not all will agree or consent to being part of your study. So in turn, the initial study sample will be a subset of the invited subjects.<\/p>\n<p>Finally, of those people who consent to participate in your study, some may prove ineligible for the study, if there is any kind of screening for eligibility, and some are likely to drop out.<\/p>\n<p>The subjects left at the end of the study, for whom outcome information is available, constitute the final <strong>study sample<\/strong>.<\/p>\n<figure id=\"attachment_150\" aria-describedby=\"caption-attachment-150\" style=\"width: 960px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-150\" src=\"http:\/\/www.torusresearch.com.au\/wp-content\/uploads\/2017\/04\/Slide5.jpg\" alt=\"\" width=\"960\" height=\"720\" \/><figcaption id=\"caption-attachment-150\" class=\"wp-caption-text\">Figure 2. Target population to study sample.<\/figcaption><\/figure>\n<h1>Dropout<\/h1>\n<p>The reason for drop out is important. Participants may drop out of studies because of simply electing not to continue with a study, by moving away, or because of illness or death.<\/p>\n<p>Any study dropout is undesirable, but drop out because of illness or death must be taken into account in the statistical analysis. For example, in our cystic fibrosis example, the study was conducted over 96 month. If, hypothetically, some had dropped out because of illness (with very poor lung function) or death, especially if there were more dropouts in the active treatment group, those surviving until the end of the study will be those with the best lung function. This might produce a spurious \u2018result\u2019 of improved lung function in the treatment group.<\/p>\n<p>The importance of the reason for dropout will vary depending on the nature of the research that you are doing, but it is very important to consider the implications of dropout at the study design stage.<\/p>\n<figure id=\"attachment_151\" aria-describedby=\"caption-attachment-151\" style=\"width: 960px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-151\" src=\"http:\/\/www.torusresearch.com.au\/wp-content\/uploads\/2017\/04\/Slide6.jpg\" alt=\"\" width=\"960\" height=\"720\" \/><figcaption id=\"caption-attachment-151\" class=\"wp-caption-text\">Figure 3. Dropout from the initial study sample.<\/figcaption><\/figure>\n<h1>Summary and practical example \u2013 target population to study sample<\/h1>\n<p>Figure 4 below is a summary of the population to which the study results are intended to be applied, and the flow of patients through a study. Note that essentially all of the \u2018standards of reporting\u2019 such as CONSORT .<a href=\"#_ENREF_2\"><sup>2<\/sup><\/a>and STROBE,<a href=\"#_ENREF_3\"><sup>3<\/sup><\/a> by which editors and reviewers are guided, strongly recommend the use of a flowchart in your manuscript. If you are writing a Cochrane Review a flowchart is essential.<a href=\"#_ENREF_4\"><sup>4<\/sup><\/a><\/p>\n<figure id=\"attachment_152\" aria-describedby=\"caption-attachment-152\" style=\"width: 960px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-152\" src=\"http:\/\/www.torusresearch.com.au\/wp-content\/uploads\/2017\/04\/Slide7.jpg\" alt=\"\" width=\"960\" height=\"720\" \/><figcaption id=\"caption-attachment-152\" class=\"wp-caption-text\">Figure 4. Selection process through a study.<\/figcaption><\/figure>\n<p>Hypothetically, suppose we want to run a trial of some drug, or an observational study, in people with cystic fibrosis. We want our conclusions to be applicable to all people with cystic fibrosis. We are based in Queensland and have access to patients of the Queensland clinic. Are our potential study subjects representative of all people with CF?<\/p>\n<p>Consider the flow of subjects through the study. We are restricted by feasibility to the Queensland clinics, probably restricted to the Brisbane clinic. This might give us a sampling frame of 300 people. From this, we consider the inclusion and exclusion criteria, which will render a proportion of subjects ineligible for the study at this step. Additionally, we may have budgetary constraints that limits the number of subjects we can include in the study. We have to \u2018guestimate\u2019 the likely study sample, the potential dropouts to the study, and work backwards to invite a suitable number of participants. Let&#8217;s say we invite 80 people to participate.<\/p>\n<p>Of those invited to participate, not all will consent. Some patients will be found to be ineligible during the patient information step of obtaining informed consent. Here, we have 20 patients who either refuse consent or are ineligible when we initially invite them to participate.<\/p>\n<p>Additionally, sometimes it is not possible to assess subjects for eligibility before consent, for example if some screening test, like a blood test, needs to be applied. In this hypothetical example, we lost some subjects who are ineligible at this point, and others who may drop out for reasons either unrelated or related to the study. We lose another 15 subjects here.<\/p>\n<p>Thus our final study sample is 45 people (Figure 5).<\/p>\n<figure id=\"attachment_153\" aria-describedby=\"caption-attachment-153\" style=\"width: 960px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-153\" src=\"http:\/\/www.torusresearch.com.au\/wp-content\/uploads\/2017\/04\/Slide8.jpg\" alt=\"\" width=\"960\" height=\"720\" \/><figcaption id=\"caption-attachment-153\" class=\"wp-caption-text\">Figure 5. Flowchart for a hypothetical study in cystic fibrosis.<\/figcaption><\/figure>\n<h1>What sample size do I need to select?<\/h1>\n<p>This brings us nicely to the topic of sample size \u2013 \u2018how many subjects do we need in our study\u2019, because the statistical analysis will be performed on the study sample, i.e. those subjects who have actually participated in the study.<\/p>\n<h1>Estimates and measures of uncertainty<\/h1>\n<p>To expand a little on estimates and confidence intervals:<\/p>\n<p>Any statistic (mean, median, proportion) calculated from the study sample is only an estimate of the \u2018true\u2019 statistic we would get if we measured every person in the target population<\/p>\n<p>If we started again and selected a different sample, we would get a slightly different statistic. For this reason, statistics calculated from samples should be reported with a confidence interval. For a 95% confidence interval \u2013 if you repeated your study 100 times, 95 times out of 100 the statistic would lie within the confidence intervals.<a href=\"#_ENREF_5\"><sup>5-7<\/sup><\/a><\/p>\n<p>For example for 20 subjects, the mean FEV<sub>1<\/sub>\u00a0and 95% confidence interval might be 60% \u2013 80%. As the number of subjects increases the confidence interval decreases. If you measured 40 subjects, the mean FEV<sub>1<\/sub>\u00a0might still be 70% or something close to it, but the confidence interval would be narrower, say 65% &#8211; 75%.<\/p>\n<figure id=\"attachment_155\" aria-describedby=\"caption-attachment-155\" style=\"width: 960px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-155\" src=\"http:\/\/www.torusresearch.com.au\/wp-content\/uploads\/2017\/04\/Slide10.jpg\" alt=\"\" width=\"960\" height=\"720\" \/><figcaption id=\"caption-attachment-155\" class=\"wp-caption-text\">Figure 6. The confidence interval narrows (estimate becomes more precise) as the number of study subjects increases.<\/figcaption><\/figure>\n<h1>Clustering<\/h1>\n<p>Valid conclusions assume that the study sample constitutes a random sample. Everitt<a href=\"#_ENREF_1\"><sup>1<\/sup><\/a> defines a random sample as:<\/p>\n<p><em>&#8216;\u2026.a sample of n individuals selected from a population in such a say that each sample of the same size is equally likely&#8217;<\/em>. <a href=\"#_ENREF_1\"><sup>1<\/sup><\/a><\/p>\n<p>For example, in the diagram, if the blue circle represents the sampling frame, and the red dots represent subjects in your study, your comparisons assume the first picture, not the second.<\/p>\n<figure id=\"attachment_156\" aria-describedby=\"caption-attachment-156\" style=\"width: 960px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-156\" src=\"http:\/\/www.torusresearch.com.au\/wp-content\/uploads\/2017\/04\/Slide12.jpg\" alt=\"\" width=\"960\" height=\"720\" \/><figcaption id=\"caption-attachment-156\" class=\"wp-caption-text\">Figure 7. Random sample and clustered sample<\/figcaption><\/figure>\n<p>If there is clustering:<\/p>\n<p>Firstly, the results will be biased \u2013 because the sample is not representative. You may have selected subjects who are sicker, older, younger, richer, poorer, or different in some other attribute, than the average in the target population. Similar comments apply to any control or reference group.<\/p>\n<p>OR it may be possible to adjust for clustering in analysis \u2013 you can use techniques like stratifying your study subjects, or more sophisticated techniques which are outside the scope of this article.<\/p>\n<h1>Practical calculation of sample size<\/h1>\n<p>Calculating sample size is a whole field of study in itself, but in its basic form is easy. For example, in a cross sectional study, you might wish to measure the proportion of mothers at a particular hospital who deliver by caesarean section. How many mothers do you need to assess for mode of delivery to obtain a reasonably accurate or precise estimate?<\/p>\n<p>You might want to know if term babies born by caesarean section are of \u2018normal\u2019 mean birth-weight. You may already know what a normal birth-weight is for Australian born babies.<\/p>\n<p>Probably the most common type of sample size calculation is for a comparison of two means. Most sample size calculations come down to two main types:<\/p>\n<h1>Planning your study: sample size questions<\/h1>\n<p>How many subjects do I need to show a statistically significant effect (if there is one) of my intervention\/risk factor? An alternate way of phrasing this question is \u2018how many subjects to I need to show a clinically significant difference of X between two groups?\u2019 \u2013 in this latter example you need to decide yourself what you regard as clinically important, for example you may decide that an average in birth-weight between two groups of 100g is clinically significant.<\/p>\n<h1>Resources for calculating sample size (or power): online calculators and statistics software<\/h1>\n<p>There are many on-line calculators for sample size. Two good ones are:<\/p>\n<p><a href=\"http:\/\/sampsize.sourceforge.net\/iface\/\" target=\"_blank\" rel=\"noopener\">http:\/\/sampsize.sourceforge.net\/iface\/<\/a><\/p>\n<p><a href=\"http:\/\/www.stat.ubc.ca\/~rollin\/stats\/ssize\">www.stat.ubc.ca\/~rollin\/stats\/ssize<\/a><\/p>\n<p>Statistical software<\/p>\n<p>STATA (www.stata.com)<\/p>\n<p>SAS (www.sas.com)<\/p>\n<p>SPSS (www.ibm.com\/software\/au\/analytics\/<strong>spss<\/strong>\/products\/<strong>statistics<\/strong>\/)<\/p>\n<p>R (<a href=\"http:\/\/www.r-project.org\/\">www.<strong>r<\/strong>&#8211;<strong>project<\/strong>.org\/<\/a>)<\/p>\n<p><strong>Commonly used abbreviations<\/strong><\/p>\n<p>Statistical notation often uses greek letters. Mean is often represented mu (m) and standard as sigma (s). The Type I error is represented as alpha (\u03b1) and Type II error as beta (b).<\/p>\n<h1>Sample size for prevalence (proportion)<\/h1>\n<p>You will need to \u2018guess\u2019 the expected prevalence, based perhaps on prior studies or pilot studies, and you need to specify how precise you want your estimates in terms of precision and confidence interval. Typically researchers use 5% precision and 95% confidence interval.<\/p>\n<ul>\n<li>Prevalence \u2013 expected prevalence from literature or previous experience<\/li>\n<li>Precision \u2013 how accurately you wish to measure your prevalence<\/li>\n<li>Confidence interval \u2013 (1 \u2013 precision)<\/li>\n<li>Population size \u2013 if unknown estimate will not be adjusted for small population size<\/li>\n<\/ul>\n<p>We will use a hypothetical example where we wish to estimate the proportion of women in a particular hospital who deliver by caesarean section. We think, from our clinical experience, that around 27% of women delivery by caesarean section. We decide on 5% precision and 95% confidence interval.<\/p>\n<p>In the &#8216;sampsize&#8217; web calculator (<a href=\"http:\/\/sampsize.sourceforge.net\/iface\/\">http:\/\/sampsize.sourceforge.net\/iface\/<\/a>), enter the desired precision, prevalence and confidence level, click on \u2018calculate\u2019 and your sample size will come up. The calculation shows that we need to ascertain the mode of delivery in 303 women.<\/p>\n<p><em>Note: the website specifies that if the prevalence is unknown, enter 50%. This is because measurement of a prevalence of 50% requires the largest sample size, so is a very conservative or &#8216;safe&#8217; estimate of sample size.<\/em><\/p>\n<h1>Sample size for two means or two proportions<\/h1>\n<p>You might want to calculate a sample size for comparing two means, or two proportions.<\/p>\n<p>For example, you might want to compare the mean fetal weight of infants of women with excessive gestational weight gain with those with acceptable gestational weight gain (this example is adapted from Walsh<a href=\"#_ENREF_8\"><sup>8<\/sup><\/a>).<\/p>\n<p>You might want to compare the proportion of women in Australia having caesarean section with the proportion of women in England having caesarean section (adapted from Prosser<a href=\"#_ENREF_9\"><sup>9<\/sup><\/a>).<\/p>\n<h2>Power and sample size \u2013 Type I and Type II error<\/h2>\n<p>You need to consider the acceptable level of type 1 or type II error.<\/p>\n<p>The type I or alpha error is often set at 0.05. This is the probability of wrongly accepting a difference if there is really no difference between groups. The &#8216;alpha&#8217; is the p-value which is the researcher chooses\u00a0at the beginning of the study, at the design stage. There are many types of statistical tests, all of which result in a &#8216;test statistic&#8217;. For example, a t-test results in a &#8216;t&#8217; statistic; a chi-square test results in a &#8216;Chi-square&#8217; statistic; an analysis of variance (ANOVA) results in an &#8216;F&#8217; statistic.<\/p>\n<p>The p-value is the probability of obtaining a statistic as extreme, or more extreme, than the test statistic, whatever that test statistic might be. In practice, this is the value on the x-axis of a distribution.<\/p>\n<p>For example, the probability of obtaining a t statistic equal to, or more extreme, than plus or minus 2.042, (with 30 degrees of freedom), is 5% or less. The p-value would be 0.05 or less. Look at a table of Student&#8217;s t-distribution in any statistics text book to see how this works. Conversely, the &#8216;critical value&#8217; of t if alpha is set at p-0.05 is plus or minus 2.042.<\/p>\n<p>[Image to be inserted]<\/p>\n<p>The \u2018power\u2019 of a study is the chance of detecting a difference between groups, if there is in fact a difference. For biological studies, power is often set at a minimum of 80%, i.e. the study has an 80% chance of detecting a difference between two groups. Power is related to a quantity called \u2018beta\u2019. Beta is often set at 0.20 or 20%. Power is calculated as:<\/p>\n<p>power = 1- \u03b2<\/p>\n<p>so if beta is set at 0.2, power is correspondingly equal to 0.8 or 80%. This is probability of falsely rejecting the alternate hypothesis, i.e. concluding there is NO difference between groups when there is in fact a difference. An <em>under-powered<\/em> study might find no difference between groups when in fact there IS a difference.<\/p>\n<p>For example, if we found that estimated fetal weight at 34 weeks was 2681 g compared with 2574 g, and we conclude this is different, but in fact is due to chance (sampling variation), this would be a type I error. At an pre-set alpha of 0.05, there is a 5% chance this will happen.<\/p>\n<figure id=\"attachment_107\" aria-describedby=\"caption-attachment-107\" style=\"width: 1280px\" class=\"wp-caption aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-107\" src=\"http:\/\/www.torusresearch.com.au\/wp-content\/uploads\/2017\/04\/typeI.jpg\" alt=\"\" width=\"1280\" height=\"720\" \/><figcaption id=\"caption-attachment-107\" class=\"wp-caption-text\">Type I &amp; Type II error<\/figcaption><\/figure>\n<p>For example, if we found that estimated fetal weight was 2681 g compared 2574 g and we conclude this is not different, but in fact really is different, this would be a type II or beta error.<\/p>\n<p>Power can be interpreted as the probability of finding a difference between groups, if there is one.<\/p>\n<h2>Sample size for two means<\/h2>\n<p>To calculate the sample size for comparing two means you need:<\/p>\n<ul>\n<li>Your estimate of mean 1 (\u03bc1) and mean 2 (\u03bc2)<\/li>\n<li>Your estimate of the standard deviation (SD) of the means<\/li>\n<li>Your decision about alpha<\/li>\n<li>Your decision about beta<\/li>\n<\/ul>\n<p>The difference between means is known as the \u2018effect size\u2019. If the means are close together, as in the picture on the left, you will need many subjects. If the means are far apart, with a large effect size, you will not need so many study subjects.<\/p>\n<p>We will use the stat.ubc website (<a href=\"http:\/\/www.stat.ubc.ca\/~rollin\/stats\/ssize\">www.stat.ubc.ca\/~rollin\/stats\/ssize<\/a>) to calculate a sample size to detect the difference between two means. You need to know the power you want your study to have, typically 0.8 (80%), the alpha value or p-value, typically 0.05, and \u2018guestimate\u2019 of the pooled standard deviation of your two groups.<\/p>\n<p>Notice you can solve for either power or sample size<\/p>\n<ul>\n<li>Enter mean one and mean two (often known as mu 1 and mu2)<\/li>\n<li>Enter values for alpha and power<\/li>\n<\/ul>\n<p>Click on calculate.<\/p>\n<p>For this example, I used the fetal weights and standard deviations from Jennifer Walsh\u2019s paper.<a href=\"#_ENREF_8\"><sup>8<\/sup><\/a> If mean 1 is 2681 grams, mean 2 is 2574 grams, with standard deviation 345 (standard deviation is often represented by the greek letter sigma), you can see the sample size needed is 164 in each group.<\/p>\n<p>If you want a greater than 80% chance of detecting a difference if there is one, you can increase the power of your study. Similarly if you want a smaller chance of falsely concluding there is a difference when there is really none, you can decrease you alpha value.<\/p>\n<p>Try this with power of 90% and alpha of 0.01!<\/p>\n<p>What was the power of the study actually conducted by Walsh?<\/p>\n<h2>Sample size for two proportions<\/h2>\n<p>To calculate the sample size for comparing two proportions you need:-<\/p>\n<ul>\n<li>Your estimate of proportion 1 and proportion 2 (p1 and p2)<\/li>\n<li>Your decision about alpha<\/li>\n<li>Your decision about beta<\/li>\n<\/ul>\n<p>Again using the &#8216;stat.ubc&#8217; website (<a href=\"http:\/\/www.stat.ubc.ca\/~rollin\/stats\/ssize\">www.stat.ubc.ca\/~rollin\/stats\/ssize<\/a>), we enter our data. For this example, data from Prosser\u2019s paper comparing caesarean rates in Queensland and England<a href=\"#_ENREF_9\"><sup>9<\/sup><\/a> was used. To detect a difference in a proportion of 0.36 compared with 0.25, we need 274 subjects in each group. Note that the &#8216;real&#8217; report includes many more subjects and analysed many more variables than merely the proportions of women having cs.<\/p>\n<h1>Other sample size calculations<\/h1>\n<p>There are many more situations where a calculation for power or sample size is required, such as comparing a sample mean to a known population mean, or a sample proportion to a known population proportion.\u00a0 More complex calculations are required for studies involving ANOVA, crossover studies, regression, and diagnostic tests.<\/p>\n<p>&nbsp;<\/p>\n<ol>\n<li>Everitt BS. <em>The Cambridge Dictionary of Statistics in the Medical Sciences<\/em>. Cambridge, UK: Cambridge University Press, 1995.<\/li>\n<li>CONSORT Website.<\/li>\n<li>STROBE Website.<\/li>\n<li>Collaboration C. Cochrane Handbook for Systematic Reviews of Interventions, 2011.<\/li>\n<li>Altman DG, Bland JM. How to obtain the confidence interval from a P value. <em>BMJ (Clinical research ed.)<\/em> 2011;343:d2090.<\/li>\n<li>Altman DG, Bland JM. Uncertainty and sampling error. <em>BMJ (Clinical research ed.)<\/em> 2014;349:g7064.<\/li>\n<li>Doll H, Carney S. Statistical approaches to uncertainty: P values and confidence intervals unpacked. <em>Equine veterinary journal<\/em> 2007;39(3):275-6.<\/li>\n<li>Walsh JM, McGowan CA, Mahony RM, Foley ME, McAuliffe FM. Obstetric and metabolic implications of excessive gestational weight gain in pregnancy. <em>Obesity (Silver Spring, Md.)<\/em> 2014;22(7):1594-600.<\/li>\n<li>Prosser SJ, Miller YD, Thompson R, Redshaw M. Why &#8216;down under&#8217; is a cut above: a comparison of rates of and reasons for caesarean section in England and Australia. <em>BMC pregnancy and childbirth<\/em> 2014;14:149.<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction Here we\u2019ll take a brief, but hopefully practical, overview of sampling in the context of designing a research project, and how to select a suitable sample size. The participants you include in your study are an important consideration, to ensure that the results of your study can be extended or generalised to a larger &hellip; <a href=\"https:\/\/www.torusresearch.com.au\/?p=101\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Sampling, power and sample size<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3,10],"tags":[],"_links":{"self":[{"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=\/wp\/v2\/posts\/101"}],"collection":[{"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=101"}],"version-history":[{"count":8,"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=\/wp\/v2\/posts\/101\/revisions"}],"predecessor-version":[{"id":215,"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=\/wp\/v2\/posts\/101\/revisions\/215"}],"wp:attachment":[{"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=101"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=101"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.torusresearch.com.au\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=101"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}