対応のないt検定または対応のあるt検定を使用して2群の平均値を比較します。結果はAPA第7版形式で表示されます。
A t-test is a statistical test used to compare the means of two groups and determine if they are significantly different from each other. Developed by William Sealy Gosset in 1908 under the pseudonym "Student," the t-test is one of the most commonly used statistical tests in social science, psychology, medicine, and education research. It answers a simple question: is the difference between two group means likely due to a real effect, or just random chance?
Use an independent samples t-test when comparing means from two different, unrelated groups. For example, comparing test scores between a treatment group and a control group, or comparing salary between male and female employees. This calculator uses Welch's t-test by default, which does not assume equal variances and is recommended by the American Psychological Association as the default approach.
Use a paired samples t-test when comparing means from the same group at two different times (pre-test vs post-test) or when participants are matched on key variables. The paired t-test accounts for the correlation between measurements, making it more powerful than an independent samples test when the design allows it. Common examples include before/after intervention studies and within-subjects experimental designs.
A researcher wants to test whether a new teaching method improves exam scores. 15 students use the new method (experimental group) and 15 use the traditional method (control group).
Experimental Group (n=15)
85, 90, 78, 92, 88, 95, 82, 91, 87, 93, 86, 89, 94, 80, 91
M = 88.07, SD = 4.94
Control Group (n=15)
78, 82, 75, 80, 77, 83, 79, 81, 76, 84, 73, 80, 82, 77, 79
M = 79.07, SD = 3.15
Results
t(23.47) = 5.87, p < .001, d = 2.15, 95% CI [5.82, 12.18]
The experimental group scored significantly higher than the control group, with a very large effect size (Cohen's d = 2.15).
| Situation | Recommended Test |
|---|---|
| Comparing 2 independent group means | Independent samples t-test |
| Comparing pre/post scores (same group) | Paired samples t-test |
| Comparing 3+ group means | One-way ANOVA |
| Non-normal data, 2 groups | Mann-Whitney U test |
| Non-normal paired data | Wilcoxon signed-rank test |
Before interpreting your results, verify these assumptions are met:
1. Scale of Measurement
The dependent variable must be continuous (interval or ratio scale). If your data are ordinal (e.g., Likert scales), consider a non-parametric alternative.
2. Random Sampling
Data should be collected from a representative, randomly selected portion of the population.
3. Normality
Each group's data should be approximately normally distributed. With sample sizes above 30 per group, the t-test is robust to violations of normality due to the Central Limit Theorem. For smaller samples, check normality using the Shapiro-Wilk test.
4. Homogeneity of Variance (for Student's t)
The two groups should have approximately equal variances. StatMate uses Welch's t-test by default, which does not require this assumption and is recommended for general use.
While p-values tell you whether a difference is statistically significant, Cohen's d tells you how large the difference is in practical terms. This is critical because with large sample sizes, even tiny, meaningless differences can be "significant."
| Cohen's d | Interpretation | Practical Meaning |
|---|---|---|
| 0.2 | Small | Difference noticeable only with careful measurement |
| 0.5 | Medium | Difference visible to the naked eye |
| 0.8 | Large | Substantial, obvious difference |
| 1.2+ | Very Large | Very strong effect, hard to miss |
According to APA 7th edition guidelines, t-test results should include the t-statistic, degrees of freedom, p-value, effect size, and confidence interval. Here are templates you can use:
Independent Samples
An independent-samples t-test revealed that the experimental group (M = 88.07, SD = 4.94) scored significantly higher than the control group (M = 79.07, SD = 3.15),t(23.47) = 5.87, p < .001, d = 2.15, 95% CI [5.82, 12.18].
Paired Samples
A paired-samples t-test indicated that post-test scores (M = 82.40, SD = 6.12) were significantly higher than pre-test scores (M = 75.60, SD = 7.35),t(24) = 4.32, p < .001, d = 0.86.
Note: Report t-values and degrees of freedom to two decimal places. Report p-values to three decimal places, except usep < .001 when the value is below .001. Always include an effect size measure.
StatMate's t-test calculations have been validated against R (t.test function) and SPSS output. We use the jstat library for probability distributions and implement Welch's t-test with Welch-Satterthwaite degrees of freedom approximation. All results match R output to at least 4 decimal places.
分散分析
3群以上の平均値を比較
カイ二乗検定
カテゴリ変数の関連を検定
相関分析
関係の強さを測定
記述統計
データを要約
サンプルサイズ
検出力分析・標本計画
1標本t検定
既知の値との比較
マン・ホイットニーU
ノンパラメトリック群間比較
ウィルコクソン検定
ノンパラメトリック対応検定
回帰分析
X-Yの関係をモデル化
重回帰分析
複数の予測変数
クロンバックのα
尺度の信頼性
ロジスティック回帰
二値アウトカムの予測
因子分析
潜在因子構造の探索
クラスカル・ウォリス
ノンパラメトリック3群以上比較
反復測定
被験者内分散分析
二元配置分散分析
要因計画の分析
フリードマン検定
ノンパラメトリック反復測定
フィッシャーの正確検定
2×2表の正確検定
マクネマー検定
対応のある名義データの検定
Excel/スプレッドシートから貼り付け、またはCSVファイルをドロップ
Excel/スプレッドシートから貼り付け、またはCSVファイルをドロップ
データを入力して「計算」をクリックしてください
または「サンプルデータを読み込む」をクリックしてお試しください