Suppose that the probability mass function of a discrete
k-dimensional random vector
is given by
where
is the
jth value of
and
denotes the vector of probabilities when
. Here
is a normalizing constant for which
We see that the model in (
1) prescribes a change of measure from the null to the alternative hypothesis. Let
be the
matrix of possible vector values of
. Then under the distribution specified by
,
where the expectations are with respect to the model (
1). This particular situation arises often when dealing with the nonparametric randomized block design. Define
and suppose that we would like to test
Letting
denote a multinomial random vector with parameters
, we see that the log likelihood as a function of
is, apart from a constant, proportional to
The score vector under the null hypothesis is then given by
Under the null hypothesis,
and the score statistic is given by
where
In the one-sample ranking problem whereby a group of judges are each asked to rank a set of
t objects in accordance with some criterion, let
be the space of all
t! permutations of the integers
and let the probability mass distribution defined on
be given by
where
. Conceptually, each judge selects a ranking
in accordance with the probability mass distribution
In order to test the null hypothesis that each of the rankings are selected with equal probability, that is,
where
define a
k-dimensional vector score function
on the space
and following (
1), let its smooth probability mass function be given as
where
is a
t-dimensional vector,
is a normalizing constant and
is a
t-dimensional score vector to be specified in (
8). Since
it can be seen that
and hence the hypotheses in (
5) are equivalent to testing
It follows that the log likelihood function is proportional to
where
and
represents the number of observed occurrences of the ranking
. The Rao score statistic evaluated at
is
whereas the information matrix is
The test then rejects the null hypothesis whenever
where
is the upper
critical value of a chi square distribution with
degrees of freedom. We note that the test just obtained is the locally most powerful test of
Specializing this test statistic to the Spearman score function of adjusted ranks
we can show that the Rao score statistic is the well-known Friedman test
[
5].
where
is the average of the ranks assigned to the
ith object.
2.1 The Two-Sample Ranking Problem
The approach just described can be used to deal with the two-sample ranking problem assuming again the Spearman score function. Let
be two independent random vectors whose distributions as in the one sample case are expressed for simplicity as
where
represents the vector of parameters for population
l. We are interested in testing
The probability distribution
represents an unspecified null situation. Define
where
represents the number of occurrences of the ranking
in sample
l.
Also, for
, set
,
and
where
Let
be the covariance matrix of
under the null hypothesis defined as
where
and
. The logarithm of the likelihood
L as a function of
is proportional to
In order to test
we calculate the Rao score test statistic which is given by
It can be shown to have asymptotically a
whenever
as
where
Here
is the Moore–Penrose inverse of
and
is a consistent estimator of
and
f is the rank of
, as required.
2.2 The Use of Penalized Likelihood
In the previous sections, it was possible to derive test statistics for the one and two-sample ranking problems by means of the change of measure paradigm. This paradigm may be exploited to obtain new results for the ranking problems. Specifically, we consider a negative penalized likelihood function defined to be the negative log likelihood function subject to a constraint on the parameters which is then minimized with respect to the parameter. This approach yields further insight into ranking problems.
For the one-sample ranking problem, let
represent the penalizing function for some prescribed values of the constant
c. We shall assume for simplicity that
. When
t is large (say
), the computation of the exact value of the normalizing constant
involves a summation of
t! terms.
[
6] noted the resemblance of (
6) to the continuous von Mises-Fisher density
where
is the norm of
and
is on the unit sphere and
is the modified Bessel function of the first kind given by
This seems to suggest the approximation of the constant
by
In
[
1], penalized likelihood was used in ranking situations to obtain further insight into the differences between groups of rankers.