When considering two discrete variables
and
at the same time,
it is possible to measure the degree of uncertainty or information
associated with them. It is called the
joint entropy
,
. If
and
may respectively take
and
possible values, the joint entropy is computed as in
equation 6.4:
where
represents the probability of being classified in both
category
of variable
and category
of variable
.
The joint entropy varies from a theoretical 0 (or empirically
) to
. The relation
between the individual entropies and their joint entropy is given by
equation 6.5:
It expresses the fact that the joint entropy is always smaller then the sum of the individual entropies. The equality holds only when the two variables are independent.