Communication theory has been formulated best for symbolic-valued signals. Claude Shannon published in 1948 The Mathematical Theory of Communication, which became the cornerstone of digital
communication. He showed the power of **probabilistic models** for symbolic-valued signals, which allowed him to quantify the information present in a signal. In the
simplest signal model, each symbol can occur at index n with a probability *Pr [a _{k}], k = {1,...,K}* . What this model says is that for each signal value a

*K*-sided coin is flipped (note that the coin need not be fair). For this model to make sense, the probabilities must be numbers between zero and one and must sum to one.

(6.48)

(6.49)

This coin-flipping model assumes that symbols occur without regard to what preceding or succeeding symbols were, a false assumption for typed text. Despite this probabilistic model's
over-simplicity, the ideas we develop here also work when more accurate, but still probabilistic, models are used. The key quantity that characterizes a symbolic-valued signal is the
**entropy** of its alphabet.

Because we use the base-2 logarithm, entropy has units of bits. For this Definition to make sense, we must take special note of symbols having probability zero of occurring. A zero-probability
symbol never occurs; thus, we define 0log_{2}0=0 so that such symbols do not affect the entropy. The maximum value attainable by an alphabet's entropy occurs when the symbols are
equally likely In this case, the entropy
equals log_{2} *K*. The minimum value occurs when only one symbol occurs; it has probability one of occurring and the rest have probability zero.

**Exercise 6.20.1**

Derive the maximum-entropy results, both the numeric aspect (entropy equals log_{2}*K*) and the theoretical one (equally likely symbols maximize entropy). Derive the value of the
minimum entropy alphabet.

**Example 6.1**

A four-symbol alphabet has the following probabilities.

Note that these probabilities sum to one as they should. As The entropy of this alphabet equals

(6.51)

- 771 reads