Cosine wrote:
Just thought of another question. Would it be true that taking a
large sample size N makes those rare events more likely happen?
Consequently, this would make us more likely to reject H0 when
actually we should accept it?
After all, in doing a hypothesis test, we assign H0, Ha, and alpha.
Then we take samples and get a p-value. If this p-value is less than
alpha/2, we say that the result of this sample reflects the happening
of an event being rarer than the extremity. Therefore we reject the
H0. But isn't it true that with a large sample, more likely those
rare events would happen?
In strandard situations and "in theory", as the sample size increases,
the probability of rejecting the null hypothesis if the null hypothesis
is true is fixed, while the the probability of rejecting the null
hypothesis if the null hypothesis is false wll increase (for a fixed "alternative hypothesis"). This conclusion depends on a number of
assumptions:
(a) the evaluation of the null distribution to get the critical values
for different sample sizes is not badly affected by numerical
approximations or inaccuracies that may lead to the true "alpha" for a
given nominal alpha fluctuating as the sample-size changes;
(b) the specification of the test statistic for any given sample size
should behave sensibly... for example that you don't just ignore any
samples beyond some fixed sample-size, and that you don't choose a
radically different test statistic formula as the sample size changes'.
Since the general theory of hypothesis testing allows you to choose any
test statistic you like, it seems difficult to justify the conclusion
without coming up with a precise definition of "using essentially the
same test statistic as the sample-size changes" that can be applied in
general ... but for standard situations the meaning may seem clear.
Even for a fairly standard situation of using a likelihood ratio test
for a complicated non-normal model, there seems to be no formal
justification that the power of the test is strictly monotonic as the sample-size changes. (This last may be wrong.);
(c) that the sorts of poor experimental comditions as discussed for
your first question do no apply.
Some special consideration needs to be gien to your sentence: "But
isn't it true that with a large sample, more likely those rare events
would happen?" It is implicit in the theory that the probability
distributions involved fully take into account any "rare events".
Rejection of the null hypthesis happens if an event happens that would
be considered rare if the null model actually holds. What is considered
"rare" is determined by what you call "alpha": the probability that the
test statistic is greater than the critical vaue if the null hypohesis
is true. You are free to change "alpha" (as the sample size changes) if
you want to, as the theory doesn't specify that you should use any
particular value .... you need to choose something sensible for the
particular circumstances of the decision you are trying to make.
--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)