亚洲免费av电影一区二区三区,日韩爱爱视频,51精品视频一区二区三区,91视频爱爱,日韩欧美在线播放视频,中文字幕少妇AV,亚洲电影中文字幕,久久久久亚洲av成人网址,久久综合视频网站,国产在线不卡免费播放

        ?

        The differences and similarities between two-sample t-test and paired t-test

        2017-11-29 03:44:40ManfeiXUDrewFRALICKJuliaZHENGBokaiWangXinTUChangyongFENG
        上海精神醫(yī)學(xué) 2017年3期
        關(guān)鍵詞:均值樣本檢驗

        Manfei XU*, Drew FRALICK, Julia Z. ZHENG, Bokai Wang, Xin M. TU, Changyong FENG,4

        ?Biostatistics in psychiatry (39)?

        The differences and similarities between two-sample t-test and paired t-test

        Manfei XU1*, Drew FRALICK1, Julia Z. ZHENG2, Bokai Wang3, Xin M. TU5, Changyong FENG3,4

        independent t-test; paired t-test; pre- and post- treatment; matched paired data

        1. Introduction

        In clinical research, we usually compare the results of two treatment groups (experimental and control). The statistical methods used in the data analysis depend on the type of outcome.[1]If the outcome data are continuous variables (such as blood pressure), the researchers may want to know whether there is a significant difference in the mean values between the two groups. If the data is normally distributed, the two-sample t-test (for two independent groups) and the paired t-test (for matched samples) are probably the most widely used methods in statistics for the comparison of differences between two samples.Although this fact is well documented in statistical literature, confusion exists with regard to the use of these two test methods, resulting in their inappropriate use.

        The reason for this confusion revolves around whether we should regard two samples as independent(marginally) or not. If not, what’s the reason for correlation? According to Kirkwood: ‘When comparing two populations, it is important to pay attention to whether the data sample from the populations are two independent samples or are, in fact, one sample of related pairs (paired samples)’.[2]In some cases, the independence can be easily identified from the data generating procedure. Two samples could be considered independent if the selection of the individuals or objects that make up one sample does not influence the selection of the individuals or subjects in the other sample in any way.[3]In this case,two-sample t-test should be applied to compare the mean values of two samples. On the other hand, if the observations in the first sample are coupled with some particular observations in the other sample,the samples are considered to be paired.[3]When the objects in one sample are all measured twice (as is common in “before and after” comparisons), when the objects are related somehow (for example, if twins,siblings, or spouses are being compared), or when the objects are deliberately matched by the experimenters and have similar characteristics, dependence occurs.[2]

        This paper aims to clarify some confusion surrounding use of t-tests in data analysis. We take a close look at the differences and similarities between independent t-test and paired t-test. Section 2 illustrates the data structure for two-independent samples and the matched pair samples. We discuss the differences and similarities of these two t-tests in Sections 3.

        In section 4, we present three examples to explain the calculation process of the independent t-test in independent samples, and paired t-test in the time related samples and the matched samples, respectively.The conclusion and discussion are reported in Section 5.

        2. Independent samples and matched-paired samples

        The t-tests are used for data with continuous outcomes.We first discuss the data structure.

        2.1 Two independent samples

        Let Xij, i = 0; 1; j = 1, …., nibe the observations from two independent samples (i = 0 or 1 denotes control or experimental group). The mean and variances of Xijare μiand(i = 0, 1). There are two levels of independence in the data from two independent samples. The data from two different subjects within the same sample are independent, i.e. Xijand Xikare statistically independent if j ≠ k. The data of two subjects from different samples are also independent, i.e. X0jand X1kare independent for j = 1,…, n0and k = 1, …, n1.

        The sample means and sample variances of these two samples are

        If the variance of those two samples are the same,a more efficient estimator of the variance ofis

        2.2 Matched pair data

        Suppose two samples are matched pair with outcomes Xj= (X0j, X1j), i = 1,…, n. Data from different pairs are independent, i.e. Xjand Xkare independent if j ≠ k.However, within each pair i, X0iand X1iare correlated.Hence the data in the control group (X01, …, X0n) and in the treatment group (X11, …, X1n) are correlated.Assume the correlations are the same within all pairs and denote the common correlation coefficient by ρ.

        The variance of Xdcan be estimated by

        2.3 The difference between independent samples and matched-pair samples

        We discuss the difference between independent samples and matched-pair samples based on the sample mean difference. To simplify our discussion, we assume n0= n1= n. From the above we know that the formulas to calculate the sample mean difference are always the same, which equals the sample mean of the treatment group minus the sample mean of the control group. One of the differences is their variances, which can be easily seen from (1) and (3). For the matchedpair data, if two observations within the same pair are positively (negatively) correlated, i.e. ρ> 0(< 0), the variance of the mean difference is smaller (larger) than that in the case of independent samples. They are equal if two samples are uncorrelated (ρ= 0).

        Another difference is in the estimation of the variance of the sample mean values. In the independent samples, we need the sample variances of both samples in order to estimate the variance of(see [2]). In the matched-pair data, we only need the difference within each pair to estimate the variance of, as indicated in (4).

        3. T-tests

        Suppose we want to test the hypothesis that two samples have the same mean values, i.e. H0: μ0= μ1.In the following discussion we assume the data follows bivariate normal distribution. The t-test is of the form

        sample mean difference

        sample standard deviation of the sample mean difference

        3.1 Two-sample t-test

        The two-sample t-test is of the form

        Under the null hypothesis H0, if σ0= σ1, T1follows student’s t-distribution with degrees of freedom (df) n0+ n1- 2. If σ0≠ σ1, the exact distribution of T1is very complicated. This is the well-known Behrens-Fisher problem in statistics[4,5], which we will not discuss here.When n0and n1are both large enough, the distribution of T1can be safely approximated by standard normal distribution.

        3.2 Paired t-test

        The paired t-test is of the form

        It’s obvious that the paired t-test is exactly the onesample t-test based on the difference within each pair. Under the null hypothesis, T2always follows t-distribution with df = n-1.

        3.3 Differences between the two-sample t-test and paired t-test

        As discussed above, these two tests should be used for different data structures. Two-sample t-test is used when the data of two samples are statistically independent, while the paired t-test is used when data is in the form of matched pairs. There are also some technical differences between them. To use the twosample t-test, we need to assume that the data from both samples are normally distributed and they have the same variances. For paired t-test, we only require that the difference of each pair is normally distributed.An important parameter in the t-distribution is the degrees of freedom. For two independent samples with equal sample size n, df = 2(n-1) for the two-sample t-test. However, if we have n matched pairs, the actual sample size is n (pairs) although we may have data from 2n different subjects. As discussed above, the paired t-test is in fact one-sample t-test, which makes its df =n-1.

        4. Examples

        In this section we present some numerical examples to show the differences between the two tests.

        4.1 Example 1: two independent samples

        To illustrate how the test is performed, we present the data shown in table 1 which compares positive symptom scores on the Positive and Negative Syndrome Scale (PANSS) between the experimental group and the control group, each of which had 10 patients each. We want to test if the mean scores of the two groups are the same.

        The sample mean values of these two groups are 11.2 and 14.3, respectively. The sample variances are 2.40 and 1.70, respectively. The two-sample t-test statistic equals 4.54. From the t-distribution with df= 18, we obtain the p-value of 0.0001, which shows strong evidence to reject the null hypothesis.

        Table 1. Positive symptom scores in Positive and Negative Syndrome Scale (PANSS)

        4.2 Example 2: Pre- and post-treatment

        To illustrate how the test is performed, we still use the data shown in table 1, except for changing the two variables to one group having positive symptom scores of PANSS at baseline and one group having positive symptom scores of PANSS after treatment. Hence there are only 10 subjects in this example. The sample mean difference is the same as that in Example 1. However,the example variance of the sample mean difference is 2.45. The paired t-test statistic equals 6.33. From the t-distribution with df = 9, we obtain the p-value of 0.00007, which shows strong evidence to reject the null hypothesis.

        4.3 Example 3: Matched pair data

        In addition to the time related samples, paired t-test is also introduced in the data analysis of matched sampling. Such sampling is a method of data collection and organization which helps to reduce bias and increase precision in observational studies.[6]For example, consider a clinical investigation to assess the repetitive behaviors of children affected with autism.A total of 10 children with autism enroll in the study.Then, 10 controls are selected from healthy children with matched age and gender which may be the confounding factors in the study. Each child is observed by the study psychologist for a period of 3 hours.Repetitive behavior is scored on a scale of 0 to 100 and scores represent the percent of the observation time in which the child is engaged in repetitive behavior(see table 2). Thus, we present the calculation process of paired t-test and independent t-test in the data analysis, respectively, under the assumption that both samples come from normally distributed populations with unknown but equal variances.

        Table 2. Repetitive behavior scores in the groups of children with autism and the healthy controls

        In this example, there are 20 subjects. However,each subject in the experimental group is matched with a subject in the control group. We also need to use the matched pair t-test to compare the mean values of the two groups. The paired t-test statistic equals 2.667.From the t-distribution with df = 9, we obtain the p-value of 0.01, which shows strong evidence to reject the null hypothesis.

        5. Discussion

        Although two-sample t-test and paired t-test have been widely used in data analysis, misuse of them is not uncommon in practice. In this paper, we show the differences and similarities of those tests. Two-sample t-test is used only when two groups are marginally independent. To say more about matching, let us suppose that age is a possible confounding factor of the outcome. During randomization, we first match subjects by age. For two subjects with the same age,they are assigned to two treatment groups by block (of size 2) randomization. Why should we use paired t-test in this case? This is related to the technical notation of conditional independence in statistics. For each pair,their outcomes are independent given the (same) age.However, they are not independent marginally. That’s why the two-sample t-test cannot be used. However,perfect matching is very difficult to implement in practice especially when the factor of matching is a continuous variable (the probability that two subjects have the exact same age is always 0!).

        Funding statement

        No funding support was obtained for preparing this article.

        Conflicts of interest statements

        The authors declare no conflict of interests.

        Authors’ contributions

        Manfei XU wrote the draft; Andrew FRALICK helped with the writing of the article; Dr. Xin Tu established the outline of the article; Dr. Changyong Feng, Julia Z. ZHENG, and Bokai Wang provided comments and revisions to the article.

        1.Daya S. The t-test for comparing means of two groups of equal size. Evidence-based Obstetrics & Gynecology.2003; 5(1): 4-5. doi: https://doi.org/10.1016/S1361-259X(03)00054-0

        2.Kirkwood BR, Sterne JAC. Essential Medical Statistics, 2nded. United Kingdom, Oxford: Blackwell; 2003. pp: 58-79

        3. Peck R, Olsen C, Devore J. Introduction to Statistics & Data Analysis, 4thed. MA, Boston: Brooks/Cole; 2012. pp: 639-640

        4.Fisher RA. The asymptotic approach to behrens integral with further tables for the d test of significance. Annals of Eugenics. 1941; 11: 141-172

        5. Chang CH, Pal N. A revisit to the Behrens-Fisher problem:Comparison of ve test meth-ods. Commun Stat Simul Comput. 2008; 37(6): 1064-1085. doi: https://doi.org/10.1080/03610910802049599

        6.Rubin DB. Matching to remove bias in observational studies. Biometrics. 1973; 29(1):159-183. doi: https://doi.org/10.2307/2529684

        雙樣本t檢驗和配對檢驗的異同性

        徐曼菲, Fralick D, Zheng JZ, Wang B, Tu X, Feng C

        獨立樣本t檢驗,配對t檢驗,治療前后,配對數(shù)據(jù)

        Summary: In clinical research, comparisons of the results from experimental and control groups are often encountered. The two-sample t-test (also called independent samples t-test) and the paired t-test are probably the most widely used tests in statistics for the comparison of mean values between two samples.However, confusion exists with regard to the use of the two test methods, resulting in their inappropriate use. In this paper, we discuss the differences and similarities between these two t-tests. Three examples are used to illustrate the calculation procedures of the two-sample t-test and paired t-test.

        [Shanghai Arch Psychiatry. 2017; 29(3): 184-188.

        http://dx.doi.org/10.11919/j.issn.1002-0829.217070]

        1Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China

        2Department of Immunology and Microbiology, McGill University, Montreal, QC, Canada

        3Departments of Biostatistics & Computational Biology and4Anesthesiology, University of Rochester, Rochester, NY, USA

        5Department of Family Medicine and Public Health, University of California San Diego School of Medicine, La Jolla, CA, USA

        *correspondence: Manfei Xu; Mailing address: 600 South Wanping RD, Shanghai, China. Postcode: 200030; E-Mail: manfeixu@163.com

        概述:臨床研究中經(jīng)常遇到比較實驗組和對照組之間的結(jié)果。雙樣本t檢驗(又稱為獨立樣本t檢驗)和配對t檢驗可能是運用于比較兩個樣本之間均值的最廣泛的統(tǒng)計方法。然而,這兩種方法的運用會產(chǎn)生混淆,從而導(dǎo)致使用不當(dāng)。本文中,我們討論了這兩種t檢驗之間的異同性,并運用三個范例來闡述雙樣本t檢驗和配對t檢驗的計算過程。

        Manfei Xu obtained a bachelor’s degree in Biomedical engineering from the medical college,Shanghai Jiao Tong University in 2002, and a Master’s degree in Public Health from the University of South Florida, USA in 2010. The same year she started working as a researcher at the Shanghai Mental Health Center in China. Since 2013, she has been the full-time technical editor for the Shanghai Archives of Psychiatry. Her work involves preliminary assessment of manuscripts,consulting on biostatistical analysis, and research into the application of statistical methods in mental health studies.

        猜你喜歡
        均值樣本檢驗
        序貫Lq似然比型檢驗
        用樣本估計總體復(fù)習(xí)點撥
        2021年《理化檢驗-化學(xué)分冊》征訂啟事
        對起重機(jī)“制動下滑量”相關(guān)檢驗要求的探討
        推動醫(yī)改的“直銷樣本”
        隨機(jī)微分方程的樣本Lyapunov二次型估計
        關(guān)于鍋爐檢驗的探討
        均值不等式失效時的解決方法
        均值與方差在生活中的應(yīng)用
        村企共贏的樣本
        亚洲性综合网| 国产专区一线二线三线码| 精品亚洲一区二区三区在线观看| 尤物99国产成人精品视频| 亚欧免费无码AⅤ在线观看| 中文字幕一区二区黄色| 三级做a全过程在线观看| 成av人大片免费看的网站| 18国产精品白浆在线观看免费| 人妻在线日韩免费视频| 久久久国产精品ⅤA麻豆| 亚洲一级无码AV毛片久久| 亚洲不卡av一区二区三区四区 | 国产精品一级av一区二区| 久草福利国产精品资源| 国产va免费精品高清在线观看 | 免费人成在线观看播放国产| 亚洲图文一区二区三区四区| 最近免费中文字幕中文高清6| 在线精品国产一区二区三区| 日韩欧美在线播放视频| 精品人妻午夜中文字幕av四季| 亚洲精品乱码久久久久蜜桃| 午夜精品久久久久久中宇| 国产视频不卡在线| 久亚洲精品不子伦一区| 女的扒开尿口让男人桶30分钟| 国际无码精品| 99国产精品欲av麻豆在线观看| 亚洲中文字幕精品乱码2021| 亚洲а∨天堂久久精品2021| 日韩最新在线不卡av| 亚洲熟妇一区二区蜜桃在线观看| 婷婷射精av这里只有精品| 亚洲永久无码动态图| 中文在线最新版天堂av| 亚洲 欧美 偷自乱 图片| 越南女子杂交内射bbwbbw| 人妻无码AⅤ中文系列久久免费| 亚洲熟女一区二区三区250p| 日产无人区一线二线三线乱码蘑菇 |