• If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

View

# Paired t-interval

View current version     Page history
Saved by
on December 1, 2014 at 1:56:32 pm

When comparing two independent sets of data, you subtract one from the other to find the difference for every group in the set. This doesn't make sense in general. It only works for certain kinds of data. These differences are the means of paired data. Makes no sense. After finding the mean differences, we want to know how confident we are of capturing the true difference. A paired t-interval constructs a confidence interval to estimate the mean difference between matched pairs of data. This confidence interval tells us how far away we are from the true mean. Incorrect. Before we can create a confidence interval, we need to check our conditions.

Rather than format it like this, just write a few paragraphs. Use full sentences that flow one to another. Your presentation is too choppy.

The paired data condition states that the data set must be paired. Along with this condition, if the data are paired, the groups themselves are not independent, rather the differences must be independent of each other. This is the independence assumption. The randomization condition states that all groups must be random in the sample in order to run the test. The nearly normal condition assumes that the population of differences follows a normal model. This condition can be checked with a histogram or QQ-plot.

In order to conduct the test, we need to follow the appropriate steps to find a solution and come to a conclusion.

First, plan. State what we want to know. Then, use model steps to check conditions, draw a picture, state which distribution model you will be using, and choose your method. Next, use mechanics to test the paired data. Estimate the standard error and calculate the margin of error. For your conclusion, interpret the confidence interval contextually.

If all of our conditions are met, we can calculate a confidence interval:

$\left(&space;\bar{d}&space;-&space;t_{*}&space;\left(\frac{s_{d}}{\sqrt{n_{d}}}&space;\right&space;),&space;\bar{d}&space;+&space;t_{*}&space;\left(\frac{s_{d}}{\sqrt{n_{d}}}&space;\right&space;)&space;\right)$

where $\bar{d}$ is the mean difference and $t_{*}$ is the critical value with $n_{d}&space;-&space;1$ degrees of freedom.

For example if we are given average high temperatures in January and in July from twelve different countries (Vienna, Copenhagen, Paris, Berlin, Athens, Rome, Amsterdam, Madrid, London, Edinburgh, Moscow, and Belgrade, respectively), we notice that one is a winter season and the other is a summer season.

January: 34, 36, 42, 35, 54, 54, 40, 47, 44, 43, 21, 37

July: 75, 72, 76, 74, 90, 88, 69, 87, 73, 65, 76, 84

Our plan is that we want to know the confidence interval for the mean temperature difference between the two seasons, since both are different extremes of each other. Next, we check our conditions. We see that these data are paired, average temperatures in January and average temperatures in July. The data from this random survey collected random temperatures throughout the month from twelve countries, so we know the temperatures should be independent of each other, meeting the independence assumption. We check this data set with a histogram to find it unimodal and symmetric.

Vienna: 34-75= -41

Copenhagen: 36-72= -36

Paris: 42-76= -34

Berlin: 35-74= -39

Athens: 54-90= -36

Rome: 54-88= -34

Amsterdam: 40-69= -29

London: 44-73= -29

Edinburgh: 43-65= -22

Moscow: 21-76= -55

This is suppose to be the same example you use in the video. Why isn't it?

I won't comment much about it because you'll have to change all of it. But like I said above, you need a much more expository style, organized into flowing paragraphs. Don't just copy the book. Put it in your own words and follow the rubric.

Looking at married couples, husbands tend to be slightly older than wives. How much older, on average, are husbands? We have data from a random sample of 200 British couples. Only 170 couples provided ages for both husband and wife. Form a confidence interval for the mean difference of husband's and wife's ages for these 170 couples.

Plan - I want to estimate the mean difference in age between husbands and wives. I have a random sample of 200 British couples, 170 of whom provided both ages.

Model - Paired Data Condition: the data are paired because they are testing on members of married couples.

- Independence Assumption: the data are from a randomized survey, so couples should be independent of each other.

- Nearly Normal Condition: a histogram is best.

The conditions are met, so we can use Student's t-model with (n-1)=169 degrees of freedom and find a paired t-interval.

Mechanics - n=170

- dbar=2.2

- sd=4.1 years

- standard error of dbar: SE(dbar) = sd /sqrt(n)

= 4.1/sqrt(170) = 0.31 years

degree of freedom(df): n-1=169

- the 95% critical value for t169 is 1.97

- the margin of error: ME = t*169 X SE(dbar)

= 1.97(0.31) = 0.61

So the 95% confidence interval is 2.2 + 0.6, or an interval of (1.6,2.8) years

Conclusion - I am 95% confident that British husbands are, on average, 1.6 to 2.8 years older than other wives.

## Using SPSS to find Confidence Intervals for Matched Pairs

• First open the file that you will be working with on SPSS We're assuming this has already been done. (By the way, each step is a sentence and, therefore, requires a period.)
• Select the Analyze drop down menu and choose Compare Means
• Under the Compare Means menu, select Paired-Sample T Test
• A menu will pop up, place the two different variables into the variable 1 and variable 2 boxes. Run-on sentence.
• SPSS has the confidence interval set to 95%, but if you would like to change the interval select the options button and replace the 95 with the percentage that you want
• Select the OK button and SPSS will produce the output for the Paired-Sample T Test
• Scrolling through the output, in the Paired Samples Test box there is a column for the interval that you specified that breaks it into the upper and lower ends of the interval

Again, you can assume the data set is open already. As you work through the steps, explain a little about the example. So introduce the data and make sure the viewer understands what they're looking at.

You can check one of the conditions in SPSS by calculating the differences in a new column and looking at a histogram and QQ-plot. It would be wise considering that you only have 12 data points.

You need to explain the choice you made to put July first and then January. The viewer will want to know how to make that decision and why one choice might be better than the other.

You need to help the viewer interpret the interval they see. What do those numbers mean in context?