Channel Capacity

4.10 Channel Capacity

The channel capacity is the maximum rate of communication for which an arbitrarily small error probability can be achieved. It is an intrinsic property of a channel and extensively studied in the Information Theory literature.

Each channel has its capacity and this concept gets confused because sometimes the word capacity is also used to describe the bit rate achieved by a given communication system. For example, under this alternative interpretation of “capacity”, one can say that increasing the number $b$ of bits per symbol a PAM from $b = 2$ to 4 doubled the “capacity”. This terminology should be avoided and, instead, bit rate used in this case.

Because each channel has its capacity, a given expression for capacity must be used only for the specific channel that it was derived for.

4.10.1 Capacity of AWGN channels

In the case of the continuous-time AWGN channel of Figure 4.6, the capacity $C$ in bits per second (bps) is¹⁵

C_{[bps]} = BW {log}_{2} (1 + SNR),

(4.52)

where $BW$ is given in Hz and $SNR$ is the SNR at the receiver, in linear scale (not dB).

Recall from Section 4.2.5, that the transmit signal power of an AWGN channel is power-limited. This avoids the situation of having an infinite capacity by increasing the SNR in Eq. (4.52). In practice, $C$ is finite because the transmit signal power is always bounded and the noise power is not zero.

From Eq. (4.52), it can also be seen that doubling $BW$ leads to twice the capacity, but the dependence of $C$ on $SNR$ is only logarithmic.

The discrete-time AWGN channel does not have any time explicitly associated to the transmission of a vector $m$ of dimension $D$ and one cannot specify the symbol duration in seconds, for example. Hence, it is more sensible to account capacity as bits “per channel use” (bpcu) or “per transmission” in this case, where each “use” is the transmission of a vector with $D$ symbols. For example, $D = 1$ and 2 when using PAM and QAM, respectively. Assuming that the unit is “bits per channel use per dimension”, the normalized capacity of the discrete-time AWGN channel is

{\bar{C}}_{[bpcu]} = \frac{1}{2} {log}_{2} (1 + SNR)

(4.53)

and its total capacity $C = D \bar{C}$ bits per channel use.

Eq. (4.53) is also called the capacity of the discrete-time real AWGN, while the capacity of the complex version (that corresponds to using QAM instead of PAM) is

C_{[bpcu]} = {log}_{2} (1 + SNR).

(4.54)

Note that Eq. (4.52) is valid for both real and complex AWGN channels, and it can be obtained from Eq. (4.53) or Eq. (4.54) as follows. When the symbols are real-valued (e. g., when PAM is used in baseband channels), Table 4.2 indicates that the maximum number of symbols that can be sent over a channel with bandwidth BW is $R_{sym}^{max} = 2 BW$ . Therefore, using Eq. (4.53), the capacity is

C_{[bps]} = R_{sym}^{max} C_{[bpcu]} = 2 BW (\frac{1}{2} {log}_{2} (1 + SNR)) = BW {log}_{2} (1 + SNR),

as indicated by Eq. (4.52). Similarly, when the symbols are complex-valued (e. g., when QAM is used in bandpass channels), Table 4.3, indicates that the maximum number of symbols that can be sent over a channel with bandwidth BW is $R_{sym}^{max} = BW$ and from Eq. (4.54):

C_{[bps]} = R_{sym}^{max} C_{[bpcu]} = BW {log}_{2} (1 + SNR).

4.10.2 Water filling

This section discusses the water filling (or water pouring) algorithm that provides the optimum allocation of power in order to achieve the capacity of the frequency-selective channel depicted in Figure 4.1. A by-product of using water filling for power allocation is an estimate of the channel capacity itself. It is assumed that the transmit signal is constrained to have a maximum power value (otherwise the capacity would be infinite).

The strategy to achieve the capacity in this case is to segment the available BW into $K$ “subchannels” with small BW denoted as $Δ_{f}$ , such that the gain can be assumed constant within $Δ_{f}$ . With this strategy, the frequency-selective channel is transformed into a set of parallel “subchannels”, with each one modeled as AWGN with a frequency response magnitude that is flat over frequency, as depicted in Figure 4.43(a). More specifically, the $k$ -th “subchannel” has a center frequency $f_{k} = k Δ_{f}$ corresponding to a subcarrier or tone, and frequency response $H(f_{k})$ that is assumed flat, as depicted in Figure 4.2. For convenience, the squared magnitude $|H(f_{k}) |^{2}$ is denoted as $g_{k}$ .

Figure 4.43: Frequency-selective channel modeled as a set of parallel “subchannels” and corresponding water filling solution.

Assuming the WGN $ν(t)$ at the receiver has a unilateral PSD value of $N_{0}$ , the noise power on each “subchannel” is

P_{k} = N_{0} Δ_{f} .

(4.55)

To achieve capacity, the total transmit power $P$ needs to be distributed among the $K$ subcarriers such that $∑_{k=1}^{K} P_{k} = P$ . This power allocation problem does not appear in the AWGN of Figure 4.6. Figure 4.43(b) indicates the solution, which is known as “water filling”, and provides the optimum power value $P_{k}^{∗}$ as

P_{k}^{∗} = max (C − \frac{N_{0}}{g_{k}},0),

(4.56)

where $C$ is known as the “water level”. If the first argument of the max function in Eq. (4.56) is negative, $P_{k}^{∗} = 0$ and no power is allocated to transmission using the $k$ -th subcarrier.

The water filling solution will be further discussed in the sequel, but note that after the optimum power allocation is obtained, Eq. (4.53) can be used to calculate the total capacity in bits per channel use as

C_{[bpcu]} = ∑_{k=1}^{K} {log}_{2} (1 + {SNR}_{k})

(4.57)

where ${SNR}_{k}$ is the SNR at subcarrier $k$ . It is assumed in Eq. (4.57) that each subcarrier uses $D = 2$ dimensions.

Eq. (4.57) can be converted to bits per second, by taking in account that each subchannel has a bandwidth $Δ_{f} = BW ∕K$ . Using $R_{sym}^{max} = Δ_{f}$ and Eq. (4.57), the capacity of the parallel channels is

C_{[bps]} = R_{sym}^{max} C_{[bpcu]} = Δ_{f} ∑_{k=1}^{K} {log}_{2} (1 + {SNR}_{k}).

(4.58)

Assuming that $r_{k}$ is the PSD value of the received signal (after passing the channel), which can be assumed¹⁶ constant over $Δ_{f}$ , ${SNR}_{k}$ is the ratio between the received power $r_{k} Δ_{f}$ and the noise power $N_{0} Δ_{f}$ , which can be simplified to

{SNR}_{k} = \frac{r_{k} Δ_{f}}{N_{0} Δ_{f}} = \frac{r_{k}}{N_{0}} .

(4.59)

Eq. (4.59) indicates that, instead of using power values, ${SNR}_{k}$ can be obtained from the associated PSD values given that $Δ_{f}$ is canceled out.

The PSD at the transmitter can be taken in account via Eq. (F.40), which allows to write $r_{k} = s_{k} | H_{k} |^{2} = s_{k} g_{k}$ . Hence, the SNR in terms of the transmit PSD, channel response and noise at receiver is

{SNR}_{k} = \frac{r_{k}}{N_{0}} = \frac{s_{k} g_{k}}{N_{0}} .

(4.60)

Substituting Eq. (4.60) into Eq. (4.58) leads to

C_{[bps]} = Δ_{f} ∑_{k=1}^{K} {log}_{2} (1 + \frac{s_{k}^{∗} |H(f_{k}) |^{2}}{N_{0}}),

(4.61)

where $s_{k}^{∗}$ denotes the optimum transmit PSD.

Water filling for a finite set of parallel channels

As mentioned, the optimum solution to the power loading problem is the water filling. More specifically, the power loading problem is the following: maximize Eq. (4.57) subject to a maximum transmit power $Δ_{f} ∑_{k=1}^{K} s_{k} = P$ . The problem is convex and can be solved using a Lagrange multiplier. The optimality relies on the fact that Eq. (4.57) is a convex (in fact, concave) function of the PSD, which influences the SNR according to Eq. (4.60). Before delving in the the associated mathematics, a very simple example of water filling will be described.

Example 4.18. Power loading with only two tones. The problem here is to allocate power considering that there are only $K = 2$ parallel channels. At the tones $k = 0$ and 1, the channel has squared magnitude $g_{0} = |H(f_{0}) |^{2} = 0.1$ and $g_{1} = |H(f_{1}) |^{2} = 1 0^{−10}$ , respectively. The number of channel uses per second is $R_{sym} = 4$ kbauds and $Δ_{f} = 4.3125$ kHz is adopted. The background noise is modeled as having a white (unilateral) PSD with $N_{0} = −140$ dBm/Hz, which corresponds to $1 0^{−17}$ W/Hz and a noise power of $N_{0} Δ_{f} = 4.3125 × 1 0^{−14}$ W per tone. It is assumed that both tones can use $D = 2$ and transmit QAMs. The maximum transmit power is 1 mW, such that $P_{0} + P_{1} = 1$ mW. Figure 4.44 shows the rate for each pair $(P_{0}, P_{1})$ . Figure 4.44 indicates that the rate obtained with Eq. (4.57) is a concave function of the PSD.

Figure 4.44: Total rate for different allocations of power among $K = 2$ tones with channel squared magnitudes $g_{0} = 0.1$ and $g_{1} = 1 0^{−10}$ .

When all the power is allocated to the worst tone $k = 1$ , the total rate is only $R ≈ 7$ kbps. When the total power is allocated to $k = 0$ , the rate is 124.7 kbps. The best allocation is $P_{0} = 0.7072$ mW and $P_{1} = 0.2928$ mW, which achieves $R = 125.8$ kbps, as indicated in Figure 4.44 by the marker. The main point here is to observe that convexity can be assumed and this enables easier procedures to find the optimum solution. $□$

Watefilling solution via Lagrange multiplier

Given that Eq. (4.57) is convex, it is possible to maximize it using a Lagrange multiplier to constrain the total power. Hence, the Lagrangian can be formulated as

L = ∑_{k=1}^{K} {log}_{2} (1 + \frac{g_{k} s_{k}}{N_{0}}) + λ(∑_{k=1}^{K} s_{k} Δ_{f} − P).

For a specific subchannel $j$ , calculating $d L∕d s_{j} = 0$ decouples $j$ from the tones $k≠j$ as follows:

\frac{d L}{d s_{j}} = \frac{d}{d s_{j}} [{log}_{2} (1 + \frac{g_{j} s_{j}}{N_{0}}) + λ(s_{j} Δ_{f} − P)] = \frac{g_{j} ∕ N_{0}}{(1 + (g_{j} s_{j})∕ N_{0}) ln 2} + λ Δ_{f} = 0,

which can be simplified to

s_{j} + \frac{N_{0}}{g_{j}} = \frac{− 1}{λ Δ_{f} ln 2} = C, the water level constant .

(4.62)

Hence, the solution, which was anticipated in Eq. (4.56) is to pour power according to the transmit PSD $s_{k}$ (“water”) on a curve specified by $N_{0} ∕ g_{k}$ , which can be seen as an “inverse SNR” such that the summation of these two parcels corresponds to the water level $C$ , which is the same for all tones and needs to be found by the algorithm. Listing 4.15 lists the first lines of a simple water filling algorithm.¹⁷ The fourth parameter (gap) of ak_simplewaterfill will be discussed in Section 5.8 and its default value can be assumed here.

Listing 4.15: MatlabOctaveFunctions/ak_simplewaterfill.m

1function [powerPerTone,powerWaterlevel,SNR] = ... 
2    ak_simplewaterfill(g,noisePowerPerTone,totalPower,gap) 
3% Inputs: 
4%  g - squared magnitude of the channel frequency response 
5%  noisePowerPerTone - noise power per tone (scalar array 
6%                     with the same dimension of g) 
7%  totalPower  total power (or energy) 
8%  gap - gap (not in dB, but linear scale). Default value is 1 
9% Outputs: 
10%  powerPerTone - optimum power allocation per tone 
11%  waterlevel - power (not PSD) level of the water filling 
12%  SNR - signal to noise ratio in linear scale (not dB) 
13% It assumes all the channels are of the same type (e.g. 
14% PAM or QAM) and have the same dimension (1 or 2)

With the help of ak_simplewaterfill.m (Listing 4.15), the solution that maximizes rate in Figure 4.44 can be easily obtained with Listing 4.16.

Listing 4.16: MatlabOctaveCodeSnippets/snip_channel_waterfilling.m

1g=[0.1 1e-10]; %channel squared magnitudes 
2deltaf = 4.13125e3; %subchannel BW (borrowed from ADSL standard) 
3N0 = 1e-17; %noise PSD in Watts/Hz, corresponding to -140 dBm/Hz 
4noisePowerPerTone = N0 * deltaf; %noise power at each tone 
5totalPower=1e-3; %total power (Watts) to be shared between tones 
6[powerPerTone,waterlevel,SNR]=ak_simplewaterfill(g,... 
7  noisePowerPerTone,totalPower) %solution provided by waterfilling 
8bitsPerTone = log2(1 + (powerPerTone.*g)/noisePowerPerTone) %bits

Note that water filling in Listing 4.16 provided 0.706562 and 0.29344 mW as the optimum power allocation, which is slightly different (and better) than the values in Example 4.18. The reason is that Example 4.18 used a discrete grid to search for the optimum and the grid resolution did not allow to find it exactly.

Example 4.19. Another example of water filling. For another water filling example, consider the conversion of a channel into a set of $K = 7$ parallel channels with squared magnitudes $[1,0.1,1.1,0.04,0.05,1.2,0.1]$ . The total power is $P = 5 × 1 0^{−13}$ W. Assuming the unilateral white noise PSD has value $N_{0} = 1 0^{−17}$ W/Hz, and the subchannel BW is $Δ_{f} = 1$ kHz, the noise power at each tone is $N_{0} Δ_{f} = 1 0^{−14}$ W. The script MatlabOctaveBookExamples/ex_channel_waterfill.m was used with these values to obtain Figure 4.45.

Figure 4.45: Watterfilling example with $K = 7$ tones, in which there was no power allocated to the fourth and fifth tones.

As indicated in Figure 4.45, the water level in this case is $C = 1.4548 × 1 0^{−16}$ . The optimum PSD is S $= 1 0^{−15} × [0.1355,0.0455,0.1364,0,0,0.1372,0.0455]$ .

Power was not allocated to tones 4 and 5 due to their $N_{0} ∕ g_{k}$ being higher than $C$ . Most of the power is allocated to tones 1, 3 and 6, due to their relatively high gains. In this case, the SNR per tone is $[13.5485,0.4548,15.0033,0,0,16.4582,0.4548]$ in linear scale and, assuming $D = 2$ dimensions can be used, the capacity is 13.07 bits per channel use. $□$

After a channel is partitioned in parallel channels and the available power is distributed among these channels, Eq. (4.53) or Eq. (4.54) can be used to determine their capacities. The capacity can then be used to estimate the (maximum) bit rate.

The power loading problem is related to the bit loading problem, as will be discussed in Chapter 6. Often the number of bits (constellation cardinality) must be integer valued.¹⁸ In this case, simply using a continuous water filling algorithm for power loading, and then truncating (or rounding) the number of bits provided by capacity equations leads a simple but suboptimal solution.¹⁹

Water filling for continuous frequency

The capacity of a frequency selective LTI Gaussian channel with frequency response $H(f)$ and bandwidth BW is also achieved with a water filling solution, but now in a continuous frequency $f$ . This water filling can be obtained by letting the number $K$ of subchannels go to infinite, which makes the bandwidth $Δ_{f} = BW ∕K$ of each subchannel go to zero. Eq. (4.56) then converges to

S^{∗} (f) = max (C − \frac{N_{0}}{|H(f) |^{2}},0),

(4.63)

where the water level $C$ is chosen such that

∫_{0}^{BW} S^{∗} (f) d f = P

(4.64)

and $S^{∗} (f)$ is the optimum transmit PSD.

4.10.3 Capacity of the frequency-selective LTI Gaussian channel

Using the optimum PSD provided by Eq. (4.63), the capacity of the frequency-selective LTI Gaussian channel in bits per second is then

C_{[bps]} = ∫_{0}^{BW} {log}_{2} (1 + \frac{S^{∗} (f)|H(f) |^{2}}{N_{0}}) d f.

(4.65)

Eq. (4.65) can be obtained from the capacity expression from a set of $K$ parallel channels and making $K →∞$ . More specifically, substituting $Δ_{f} = BW ∕K$ into Eq. (4.61), leads to

C_{[bps]} = \frac{BW}{K} ∑_{k=1}^{K} {log}_{2} (1 + \frac{s_{k}^{∗} |H(f_{k}) |^{2}}{N_{0}}),

(4.66)

which converges to Eq. (4.65) when $Δ_{f} → 0$ and the discrete frequencies $f_{k}$ become $f$ to describe a continuous frequency response $H(f)$ .

4.10.4 Revisiting the capacity of flat fading continuous-time AWGN

Note that the channels in Figure 4.6, with a flat frequency response, can be seen as special cases of the frequency-selective LTI Gaussian channel. Applying water filling as in Eq. (4.63) to the channels in Figure 4.6 would lead to flat transmit PSDs $S_{x} (f) = P∕ BW$ with BW, where $P$ is the maximum transmit power (unilateral PSDs are adopted). Using Eq. (F.40), the PSD at the receiver is $S_{y} (f) = κ^{2} S_{x} (f)$ , where $κ = |H(f)|$ is the frequency response magnitude over the channel passband. Hence, the SNR at the receiver is

SNR = \frac{S_{y} (f) BW}{N_{0} BW} = \frac{κ^{2} S_{x} (f)}{N_{0}} = \frac{κ^{2} P}{N_{0} BW} .

(4.67)

Substituting Eq. (4.67) into Eq. (4.52) leads to

C_{[bps]} = BW {log}_{2} (1 + SNR) = BW {log}_{2} (1 + \frac{κ^{2} P}{N_{0} BW})

(4.68)

Hence, only when the channel is frequency-selective the water filling solution is required for obtaining the optimum transmit PSD and, from that, the channel capacity. Alternatively, Eq. (4.68) suffices to calculate the capacity for channels with flat frequency responses.

Example 4.20. Example of capacity calculation for a continuous-time AWGN channel. Assume that the bandpass channel depicted in Figure 4.6(b) has $BW = 20$ Hz and magnitude $κ = |H(f)| = 1∕2$ over the passband. The WGN has a unilateral PSD level $N_{0} = 1$ mW/Hz and the maximum transmit power is $P = 240$ mW.

Using Eq. (4.68), the capacity of this channel is

C_{[bps]} = BW {log}_{2} (1 + \frac{κ^{2} P}{N_{0} BW}) = 20 {log}_{2} (1 + \frac{(1∕4)240}{20}) = 40.

For practicing the interpretation of Eq. (4.61), consider that the current channel is modeled as a set of $K = 4$ subchannels, each with $Δ_{f} = BW ∕K = 5$ Hz. Because the channel is flat, the water filling solution would equally split $P$ among the four subchannels, with $P_{k} = P∕K = 60$ mW being allocated to each, corresponding to a PSD level of $s_{k}^{∗} = P_{k} ∕ Δ_{f} = 12$ mW/Hz, $∀k$ . In this case, the capacity per channel use of a subchannel is

C_{[bpcu]} = {log}_{2} (1 + \frac{s_{k}^{∗} |H(f_{k}) |^{2}}{N_{0}}) = {log}_{2} (1 + \frac{12(1∕4)}{1}) = 2,

which leads to

C_{[bps]} = Δ_{f} ∑_{k=1}^{K} C_{[bpcu]} = 5 ∑_{k=1}^{4} 2 = 40,

which coincides with the result from Eq. (4.68). $□$

¹⁵ The proof for Eq. (4.52) and other capacity results can be found in [TV05], which is available online at [ url7tse].

¹⁶ The value of $Δ_{f}$ is supposed to be small enough for this assumption to be valid.

¹⁷ For better understanding the code, the reader can use Chapter 4 of [Cio10].

¹⁸ A method to obtain a non-integer number of bits per symbol are the multidimensional constellations (lattice codes) [FJW89].

¹⁹ Algorithms such as the Levin-Campello’s provide optimal solutions to the power and bit loading problems when the number of bits per parallel channel must be an integer.