Overview¶

In a Stroop task, participants are presented with a list of words, with each word displayed in a color of ink. The participant’s task is to say out loud the color of the ink in which the word is printed. The task has two conditions: a congruent words condition, and an incongruent words condition. In the congruent words condition, the words being displayed are color words whose names match the colors in which they are printed: for example RED, BLUE. In the incongruent words condition, the words displayed are color words whose names do not match the colors in which they are printed: for example PURPLE, ORANGE. In each case, we measure the time it takes to name the ink colors in equally-sized lists. Each participant will go through and record a time from each condition.

import pandas as pd
%matplotlib inline

df = pd.read_csv('stroopdata.csv')
df['diff']  = df['Incongruent'] - df['Congruent']
df

The experiment takes participants with two test, congruent task and incongruent task. Congruent task is word with agreeing text and font color, while incongruent is a different text and its font color. Both of the task require the participants to say it out loud the word that are being display, and press 'Finish' button to see which time do they take. The control group is the congruent task, while experiment group is ingconruent task.

The independent variables is which makes differ between congruent task and incongruent task. That is words that are being displayed. Participants are requested to say the font color of the words, which is the same for both control and experiment group. But while text displayed agree with color in congruent, incongruent is the other way around.

The dependent variables is time participants take to complete the task. The time is depend on whether the text agree with the font color being displayed. We can see that from the data, on average, the time participants took for incongruent task is different than when they solve congruent task. We will use statistical test to test whether the time is significantly different.

So what kind of paired data should we be asking? We know that in general Incongruent task take longer than Congruent task. So in Confidence Interval, we could be asking the interval in which Ingrouent takes more second than congruent, and in hypothesis we could be asking is whether the incongruent task results in significantly different than congruent task.

Our sample size is less than 30, and that would means that our sampling distribution won't be normal. We're faced with two conditions, using t-test or bootstrapping. In this case, We will be using t-test. And since this is an experiment (assumed random assignment), we can draw causation.

In the instructions, it doesn't stated anywhere how the participants are collected. There might be a convenience bias(only participants that know the experiment), location bias(city/country where the experiment performed ), or voluntarily bias. Assumed participants randomly sampled without any bias at all. The result of this experiment can be generalized to world population.

We design the hypothesis test as follows:

H0: $ \mu_\mathbf{congruent} = \mu_\mathbf{incongruent}$ The time took for population to solve both congruent task and incongruent task is the same, on average

HA:$\mu_\mathbf{congruent} \neq \mu_\mathbf{incongruent}$ The time took for population to solve both congruent task and incongruent task is different, on average

We're going to use two-sided t-statistics. This is an experiment where we have limited data and samples, and we want to test our hypothesis to the population parameters.

df.describe()

The measure of tendency that will be used in this situation is mean, and measure of variability is standard deviation.

df.plot.scatter(x='Congruent',y='Incongruent');

The plot shown a moderaly weak correlation between congruent variable and incongruent variable.

(df.Incongruent - df.Congruent).plot.hist();

We can see that is the difference is right skewed distribution. This makes sense, since congruent task is easier, there shouldn't be any participants that solve incongruent task shorter tha congruent task. And it should be the longer time it took for the participants at solving incongruent task, the less should be for the number of participants.

Hypothesis Testing¶

%%R

n = 24
mu = 7.964792
s = 4.864827
CL = 0.95
n = 24
# z = round(qnorm((1-CL)/2, lower.tail=F),digits=2)
SE = s/sqrt(n)
t = mu/SE
t_crit = round(qt((1-CL)/2,df=n-1),digits=3)
c(t,c(-t_crit,t_crit))

[1]  8.020707  2.069000 -2.069000

Since our t-statistics, 8.02 is higher than the t critical values, we can conclude that the data provides convincing evidence that the time participants took for incongruent task is significantly different than when they took congruent task.

Confidence Interval¶

%%R

ME = t*SE
c(mu+ME,mu-ME)

[1]  5.910216 10.019368

So we are 95% confident that participants on average, took incongruent task 5.91 to 10.02 seconds longer than congruent task. Since this is an experimental design, we can draw causation intead of correlation.

REFERENCES:

http://napitupulu-jon.appspot.com/posts/paired-data-coursera-statistics.html

http://napitupulu-jon.appspot.com/posts/inference-means-anova.html

https://www.udacity.com/course/viewer#!/c-ud134-nd/l-4578095863/e-147019342/m-147300501

	Congruent	Incongruent	diff
0	12.079	19.278	7.199
1	16.791	18.741	1.950
2	9.564	21.214	11.650
3	8.630	15.687	7.057
4	14.669	22.803	8.134
5	12.238	20.878	8.640
6	14.692	24.572	9.880
7	8.987	17.394	8.407
8	9.401	20.762	11.361
9	14.480	26.282	11.802
10	22.328	24.524	2.196
11	15.298	18.644	3.346
12	15.073	17.510	2.437
13	16.929	20.330	3.401
14	18.200	35.255	17.055
15	12.130	22.158	10.028
16	18.495	25.139	6.644
17	10.639	20.429	9.790
18	11.344	17.425	6.081
19	12.369	34.288	21.919
20	12.944	23.894	10.950
21	14.233	17.960	3.727
22	19.710	22.058	2.348
23	16.004	21.157	5.153

	Congruent	Incongruent	diff
count	24.000000	24.000000	24.000000
mean	14.051125	22.015917	7.964792
std	3.559358	4.797057	4.864827
min	8.630000	15.687000	1.950000
25%	11.895250	18.716750	3.645500
50%	14.356500	21.017500	7.666500
75%	16.200750	24.051500	10.258500
max	22.328000	35.255000	21.919000