Applications

2.10 Applications

This section will briefly discuss some applications of transforms.

Application 2.1. Example of Gram-Schmidt transform. As an example of the Gram-Schmidt procedure, assume that Listing A.4 is invoked with the commands

1x=[0,-1,-1,0;  0,2,2,0;  0,1,0,1;  1,1,1,1;  -2,2,2,1] %row vectors 
2[Ah,A]=ak_gram_schmidt(x) %perform the orthonormalization procedure 
3X=Ah*transpose(x(1,:)) %coefficients corresponding to first vector 
4x1=A*X; %reconstruction of first vector x(1,:) (as a column vector)

where $M = 5$ vectors and $D = 4$ is the space dimension. In this case, the number of orthonormal basis functions is $N = 4$ , such that both matrices Ah and A have dimension $4 \times 4$ . The basis functions correspond to the columns of matrix A. The first element of x1, corresponding to the first basis function, is the only non-zero element and has value $\sqrt{2}$ which is the coefficient that must multiply the first basis function $[0, - \sqrt{2} ∕ 2, \sqrt{2} ∕ 2, 0]$ to reconstruct the first vector $[0, - 1, - 1, 0]$ . The reason is that ak_gram_schmidt chooses the first basis as a normalized (unit-norm) version of the first vector in x.

Debugging the execution of the code ak_gram_schmidt, step-by-step, allows to observe more details as listed in Listing 2.4.

Listing 2.4: MatlabOctaveCodeSnippets/snip_transforms_granschmidt_debug.m. [ Python version]

1tol = 1.1102e-015  %calculated (default) tolerance 
2%first basis in y is [0  -0.7071  -0.7071  0], numBasis=1 
3k = 2, m = 1 
4projectionOverBasis = [0 2.0 2.0 0] %2nd input vector 
5errorVector = 1.0e-015 * [0    0.4441    0.4441  0] 
6magErrorVector = 6.2804e-016 %do NOT add error vector to basis set 
7%2nd vector is already represented, go to next iteration 
8k = 3, m = 1 
9projectionOverBasis = [0 0.5 0.5 0] %3rd input vector 
10%errorVector below is orthogonal to [0 -0.7071 -0.7071 0] 
11errorVector = [0 0.5 -0.5 1] 
12magErrorVector = 1.2247 %add normalized error vector to basis set 
13%second basis is [0 0.4082 -0.4082 0.8165], numBasis=2 
14k = 4, m = 1 
15projectionOverBasis = [0 1.0 1.0  0] %4th input vector 
16errorVector = [1.0 0.0 0.0 1] %using 1st basis 
17k = 4, m = 2 
18projectionOverBasis = [0.0  0.3333   -0.3333    0.6667] 
19errorVector = [1.0 -0.3333 0.3333 0.3333] %using 2 basis vectors 
20magErrorVector = 1.1547 %add normalized error vector to basis set 
21%3rd basis is [0.8660 -0.2887 0.2887 0.2887], numBasis=3 
22k = 5, m = 1 
23projectionOverBasis = [0 2.0 2.0 0] %5th input vector 
24errorVector = [-2.0 0.0 0.0 1.0] %using only 1st basis 
25k = 5, m = 2 
26projectionOverBasis = [0 0.3333 -0.3333 0.6667] 
27errorVector = [-2.0 -0.3333 0.3333 0.3333] % using 2 basis vectors 
28k = 5, m = 3 
29projectionOverBasis = [-1.250 0.4167 -0.4167 -0.4167] 
30errorVector = [-0.75 -0.75 0.75 0.75] %using 3 basis vectors 
31magErrorVector = 1.5000 %add normalized error vector to basis set 
32%4th basis is [-0.5 -0.5 0.5 0.5], numBasis=4 
33%abort because (numBasis >= N)

The reader can perform a similar analysis for another set of vectors and compare with the results obtained via calculating by hand. $□$

Application 2.2. Time-localization property of Haar coefficients.

Figure 2.33: Signal $x [n] = δ [n - 11]$ analyzed by 32-point DCT and Haar transforms. The right column shows the transform coefficients (abscissa is $k$ ). The left column shows time-domain plots. Below the signal itself, one can visualize the “best” (the one corresponding to the coefficient with largest absolute value) basis functions for each transform.

A very simple experiment will help illustrating the time-localization property of Haar functions. Consider the signal to be analyzed in the transform domain is a single impulse $x [n] = δ [n - n_{0}], n_{0} = 0, \dots, 31$ , which will be represented by a vector $x$ will zeroed elements but the one corresponding to the $n_{0}$ position. The script MatlabOctaveBookExamples/ex_transforms_dcthaar_example.m can be used to observe the behavior of the transforms. Figure 2.33 shows the output of the script when the signal is $x [n] = δ [n - 11]$ . In this case, many Haar coefficients are zero and the one with largest magnitude ( $k = 21$ ) has a corresponding basis function (bottom of the left column) that helps to localize the occurrence of the impulse in $x [n]$ . In contrast, most DCT coefficients have relatively large values and the basis functions do not help in time-localization. The mentioned script allows to investigate this aspect more deeply. Compare the results of DCT and Haar transforms when the signal is a cosine summed to an impulse. $□$

Application 2.3. Inverse Laplace transforms with Matlab’s Symbolic Math Toolbox. Keeping in mind that Matlab adopts the unilateral Laplace transform, its Symbolic Math Toolbox can be used to calculate Laplace transforms and their inverses. For example, the unilateral Laplace transform $X (s) = 1 ∕ s$ of $u (t)$ can be obtained with the commands syms t; laplace(t / t). Similarly, the commands syms w0,t; laplace(sin(w0*t)) indicate that $X (s) = ω_{0} ∕ (s^{2} + ω_{0}^{2})$ for $x (t) = sin (ω_{0} t) u (t)$ . The following commands illustrate the inverse transform of $X (s)$ given by Eq. (2.54):

Listing 2.5: MatlabOnly/snip_transforms_ilaplace.m. [ Python version]

1syms s %defines s as a symbolic variable 
2a=1; b=-2; c=-1+j*2; %choose poles and zeros 
3X=(s-a)/((s-b)*(s-c)*(s-conj(c))); %define X(s) 
4ilaplace(X) %Matlab's inverse unilateral Laplace transform

The command pretty (e. g., pretty(ilaplace(X))) can be used to beautify the output, which in this case is - 3/5 exp(-2 t) + 1/5 exp(-t) (3 cos(2 t) + sin(2 t)). Find the inverse Laplace transform for other signals, such as $x (t) = cos (ω_{0} t) u (t)$ . $□$

Application 2.4. ECG transform coding. Because a block transform is completely specified by an invertible matrix $A$ , going from one domain to another ( $x$ to $X$ or vice-versa) is loss-less, which means there is no loss of information. In many applications, such as image coding, where the goal is to minimize the number of bits to represent an image, it is useful to compress the signal using a lossy algorithm. Transform coding can be a lossy algorithm when it discards or quantizes coefficients in the transform domain. This is effective in many applications because, while all samples in $x$ have the same “importance”, in the transform domain $X$ the coefficients can be organized according to some hierarchy or rank. In transform coding the most “important” coefficients are quantized more carefully than the unimportant ones.

Figure 2.34 depicts a segment (first 1,500 samples) of an ECG signal. The ECG data used here is from the MIT-BIH Arrhythmia Database.¹⁸, and Figure 2.34 corresponds to the first channel of file 12531_04.dat. The function ak_rddata.m indicates that the DC offset of the ADC chip was calibrated to be zero. It also indicates that the ECG signals were digitized with a sampling frequency $F_{s} = 250$ Hz, 12 bits per sample, and a quantizer with step $Δ_{AD} = 1 ∕ 400$ mV. The (empirical) signal power is 3.07 mW and, following the conventional quantization model, Eq. (1.20) suggests the quantization noise power was $5.2 \times 1 0^{- 10}$ mW. Hence, the SNR corresponding to the quantization stage alone is approximately 97.7 dB.

Figure 2.34: A segment of one channel of the original ECG data.

A very simple example of a transform coding system is provided as Matlab/Octave code in the companion software (in directory Applications/ECGTransformCoding). The idea is to simply discard the higher-order coefficients in the transform domain. For example, assume that $N$ is the dimension of $x$ and $X$ , which are related by a $N \times N$ transform matrix $A$ . In the encoding stage, a discrete-time input signal $x [n]$ is segmented into blocks of dimension $N$ , composing a set of $M$ input vectors ${x_{1}, \dots, x_{M}}$ , where $N \times M$ is the available number of samples of $x [n]$ . The coding scheme converts each $x_{i}$ into $X_{i} = A^{H} x_{i}$ and keeps in a new vector ${\hat{X}}_{i}$ only the first $K \leq N$ elements of $X_{i}$ (the remaining $N - K$ are discarded). The $M$ vectors ${\hat{X}}_{i}$ can be concatenated to create an encoded output signal $\hat{X} [n]$ with $M \times K$ samples.

In the decoding stage, in order to reconstruct a signal $x' [n]$ from $\hat{X} [n]$ (if there were no losses, $x' [n] = x [n]$ ), an inverse procedure is adopted. The signal $\hat{X} [n]$ is blocked into vectors of dimension $K$ and $N - K$ zero elements are inserted to create vectors $X'_{i}$ of dimension $N$ (this operation is called zero-padding). Each of these $N$ -dimensional vectors is converted to time-domain vectors $x'_{i} = A X'_{i}$ , which are then concatenated to form $x' [n]$ .

Figure 2.35 was obtained using a DCT matrix $A$ of $N = 32$ points and discarding 26 (high-frequency) coefficients. It can be seen that keeping only $K = 6$ out of 32 coefficients is enough to provide a rough representation of $x [n]$ but the error is significant in high-frequency regions. Note that the DCT operation is performed in blocks of $N = 32$ samples and Figure 2.35 is the result of processing many of these blocks.

Figure 2.35: Original and reconstructed ECG signals with DCT of $N = 32$ points and discarding 26 (high-frequency) coefficients. Therefore, the error is predominant in high-frequency regions as indicated in the plot.

In practice, a quantization scheme should be adopted to represent $\hat{X} [n]$ with a small number of bits per sample. The compression ratio is the number of bits to represent $x [n], n = 0, \dots, M \times N$ divided by the number of bits to represent $\hat{X} [n], n = 0, \dots, M \times K$ . For simplicity, the proposed example does not involve quantization and the compression ratio is evaluated by $N ∕ K$ . This corresponds to assuming that each sample of both $x [n]$ and $\hat{X} [n]$ is represented with the same number of bits.

Figure 2.36 shows the percentage of kept coefficients $K ∕ N$ in the abscissa and $10 {log}_{10} MSE$ in the ordinate, where

MSE = \frac{1}{M \times N} \sum_{n = 0}^{M \times N - 1} {(x [n] - x' [n])}^{2}

is the mean-squared error, which is equivalent to the power of the error signal. If $K = N$ (100% in the abscissa) the system is lossless and $MSE = 0$ ( $- \infty$ in log scale), so the graphs do not show these points. The larger $N$ , the better the coding performance but the higher the computational cost.

Assuming $N = 128$ and $K = 10$ as indicated in Figure 2.36, the abscissa is $K ∕ N = 10 ∕ 128 \approx 7.81$ % and the corresponding error is $MSE = 1.4824 \times 1 0^{- 8}$ W ( $- 78.29$ dBW) or, equivalently, $- 48.29$ dBm. Given that the signal power was estimated as 3.07 mW ( $4.87$ dBm), the ${SNR}_{dB} \approx 4.87 + 48.3 = 53.16$ dB when considering only the distortion incurred by this specific encoding procedure.

Figure 2.36: Performance of five DCT-based ECG coding schemes. The number of points is varied $N \in {4, 8, 32, 64, 128}$ and $K = 1, 2, \dots, M - 1$ . The larger $N$ , the better the coding performance but the higher the computational cost.

It should be noted that the dataset used is formed by abnormal ECG signals and, eventually, better results could be obtained when using another dataset.

Write an encoder that uses DCT coding to generate files of a relatively small size and a decoder to “decompress” such files and recover an ECG signal (an approximation to the original one). To do that, it is necessary to quantize the DCT coefficients and pack them in a way that the resulting file has a small size. Consider using scalar or vector quantization. The advantage of vector quantization is that you need to store only the index to the codebook entry (codeword) with a vector that will represent the DCT. In case you use a codebook with $2^{8} = 256$ codewords, each entry can be conveniently written as an unsigned char of 8 bits. To design the codebook, besides the Mathwork’s code in the DSP System Toolbox, there are many algorithms such as the K-means with implementations available on the Web.

Another interesting exercise is to apply KLT and other transforms to this problem, trying to get better results than with the DCT. $□$

Application 2.5. DCT coding of image.

The usefulness of DCT in coding is illustrated by the JPEG image coding standard.¹⁹ Before describing an example, two dimensional transforms are briefly discussed.

Linearly-separable two dimensional (2-D) transforms of a block $x$ (e. g., a matrix with pixel values) is obtained by first using the transform along the rows (or columns) and then transforming this result along the columns (or rows). Alternatively, one can use matrix notation: $X = A^{H} x A^{*} .$ Because, $A$ is real for a DCT, one has

X = A^{T} x A .

This is equivalent to first calculating $T = A^{H} x$ : the 1-D DCT transform of each column of $x$ is placed in its corresponding column in a temporary matrix $T$ . Then, each row of $T$ is transformed by another 1-D DCT, which could be accomplished by $A^{H} T^{T}$ . This result should be transposed to generate $X$ . These operations are equivalent to:

X = {(A^{H} {(A^{H} x)}^{T})}^{T} = A^{H} x A^{*} .

Figure 2.37: A zoom of the eye region of the Lenna image.

Listing 2.6 illustrates the adoption of a 2-D DCT for coding an image block (represented by a matrix).

Listing 2.6: MatlabBookFigures/figs_transforms_dctimagecoding

1%load black & white 256 x 256 pixels of Lenna: 
2fullPath='lenna_bw.gif' %file location 
3  [lenaImage,map]=imread(fullPath); %read file 
4colormap(map); %use the proper color map 
5x1=128; x2=135; y1=160; y2=167; %to take a 8x8 block close to her eye 
6x=lenaImage(x1:x2,y1:y2); %get the region 
7x=double(x); %cast x to be double instead of integer 
8%draw a rectangle to indicate the block: 
9lenaImage(x1,y1:y2)=1; lenaImage(x2,y1:y2)=1; 
10lenaImage(x1:x2,y1)=1; lenaImage(x1:x2,y2)=1; 
11imagesc(lenaImage) %show the image 
12axis equal %correct the aspect ratio 
13%Some calculations to practice: 
14X=octave_dct2(x); %calculate forward DCT 
15x2=octave_idct2(X); %calculate inverse DCT to check 
16numericError = max(x(:)-x2(:)) %any numeric error? 
17Ah=octave_dctmtx(8); %now calculate in an alternative way 
18X2 = Ah*x*transpose(Ah); 
19%X2 = transpose(Ah)*x*Ah; %Note: this would be wrong! 
20numericError2 = max(X(:)-X2(:)) %any numeric error?

Similar to the 1-D ECG coding example in Application 2.4, few DCT coefficients can be enough to represent an image. Study the code MatlabOctaveFunctions/ak_dctdemo.m using DCT to represent different images,²⁰ including ones with text. Even better results can be obtained with the wavelet transform, which is used in the JPEG 2000 image coding standard. Write an encoder and decoder that use a transform to compress and decompress an image file. $□$

Application 2.6. Orthogonality of continuous-time sinusoids over a given interval. In applications such as frequency modulation, it is important to know the conditions in which two cosines with frequencies $f_{1}$ and $f_{2}$ and phase $𝜃$ , are orthogonal over a given interval $T_{sym}$ of interest. Mathematically, this requirement can be expressed as

\int_{< T_{sym} >} cos (2 π f_{1} t + 𝜃) cos (2 π f_{2} t + 𝜃) = 0

and, because the phase $𝜃$ is the same for both cosines, this requirement is called “coherent orthogonality”. It can be shown²¹ that this condition is satisfied with integers $m$ and $n$ such that

f_{1} = \frac{2 n - m}{4 T_{sym}}, f_{2} = \frac{2 n + m}{4 T_{sym}} and f_{2} - f_{1} = \frac{m}{2 T_{sym}} .

For example, assume that $T_{sym} = 0.01$ seconds, $m = 4$ and $n = 35$ . In this case, $f_{1}$ and $f_{2}$ are 1650 and 1850 Hz, respectively. Note that the minimum value of $f_{2} - f_{1}$ is $1 ∕ (2 T_{sym}) = 50$ Hz in this case, which corresponds to choosing $m = 1$ .

As a side note, for “noncoherent orthogonality”, the minimum frequency separation²² is $1 ∕ T_{sym}$ . $□$

Application 2.7. Orthogonality of finite duration discrete-time sinusoids. It is also important to evaluate in what conditions a pair of discrete-time sinusoids are orthogonal considering a finite duration interval.

Two discrete-time exponentials $e^{j Ω_{0} n}$ and $e^{j Ω_{1} n}$ , $Ω_{0} \neq Ω_{1}$ , are always orthogonal when summed over a range $- \infty$ to $\infty$ (see Example 2.1), but this is not the case when the summation interval is finite. One way of inspecting what happens when the duration is finite is to reason as following: 1) keep in mind that the sum of the samples of a single sinusoid over a multiple of its fundamental period is zero. 2) Interpreting Example 2.3 in discrete-time allows to decompose the inner product for testing orthogonality into a sum of two sinusoids. 3) the inner product is mapped to summations of sinusoids with frequencies equal to $Ω_{0} + Ω_{1}$ and $Ω_{0} - Ω_{1}$ and the goal is to find an integer $N$ that is a multiple of both fundamental periods.

One application of this orthogonality principle is the design of a binary transmission, where the two signals are distinguished by their frequencies $f_{0}$ and $f_{1}$ . This scheme is called frequency shift keying (FSK). We are interested on determining the minimum symbol period that guarantees orthogonality between the two sinusoids. In the context of telecommunications, this minimum period will correspond to the highest symbol rate. The script MatlabOctaveBookExamples/ex_transforms_check_orthogonality.m can be used for studying this question and is repeated below. The sampling frequency is $F_{s}$ and one should check different pairs of $f_{0}$ and $f_{1}$ .

Listing 2.7: MatlabOctaveBookExamples/ex_transforms_check_orthogonality.m

1if 1 %choose between two options 
2    f0 = 980;   %frequency of first sinusoid (in telecom, of bit 0) 
3    f1 = 1180;   %frequency of 2nd sinusoid (in telecom, of bit 1) 
4else %another example 
5    f0 = 1650; %1st sinusoid 
6    f1 = 1850; %2nd sinusoid 
7end 
8Fs=9600; %assumed sampling frequency 
9sumSinusoidFrequency = f0+f1; 
10subractSinusoidFrequency = f1-f0; 
11%reduce numbers to rational forms 
12[m,n]=rat(sumSinusoidFrequency/Fs) 
13[p,q]=rat(subractSinusoidFrequency/Fs) 
14Nmin = lcm(n,q) %find the least commom multiple 
15disp(['Minimum common period: ' num2str(Nmin)]) 
16T_duration = Nmin / Fs; %minimum number of samples in seconds 
17%disp(['Maximum symbol rate: ' num2str(1/T_symbolDuration) ' bauds']) 
18%% Check if the product of the two sinusoids is really orthogonal 
19Tsum = 1 / sumSinusoidFrequency; 
20Tsub = 1 / subractSinusoidFrequency; 
21disp('Numbers below should be integers:') 
22T_duration/Tsum 
23T_duration/Tsub 
24Ts=1/Fs; %sampling interval 
25t = 0:Ts:T_duration-Ts; %generate (discrete) time 
26%use sin or cos with arbitrary phase 
27b0 = sin(2*pi*f0*t+4)';  %column vector 
28b1 = sin(2*pi*f1*t)';  %column vector 
29normalizedInnerProduct = sum(b0.*b1) / length(b0) %inner product 
30if (normalizedInnerProduct > 1e-13) 
31    error('not orthogonal! should not happen!'); 
32end

Modify Listing 2.7 to use discrete-time sinusoids in radians, not Hz. Assume $N = 8$ and find a set of $M$ sinusoids that are mutually orthogonal over an interval of eight consecutive samples. What was the maximum value of $M$ that you could find? Now, consider the set of basis functions for a $N$ -point DFT and evaluate their orthogonality. Going back to the question about $M$ , can it be larger than $N$ ? Why? $□$

Application 2.8. Better visualizing the FFT spectrum: Using fftshift to reorganize the spectrum and masking numerical errors. Two improvements of the procedure adopted to obtain Figure 2.14 are typically used. The first one is with respect to the range $k = ⟨ N ⟩$ . Figure 2.14 reminds that the definition of an $N$ -point DFT assumes $k = 0, \dots, N - 1$ . Sometimes it is more convenient to work with ranges that use “negative” frequencies. Typical ranges are $k = - N ∕ 2, - N ∕ 2 + 1, \dots, 0, 1, N ∕ 2 - 1$ when $N$ is even and $- (N - 1) ∕ 2, - (N - 1) ∕ 2 + 1, \dots, 0, 1, (N - 1) ∕ 2$ when $N$ is odd. In these cases, to represent negative frequencies (or angles in discrete-time processing) the command fftshift can be used. For example, fftshift([0 1 2 3]) returns [2 3 0 1], while fftshift([0 1 2 3 4]) returns [3 4 0 1 2].

Figure 2.38: Alternative representation of the DTFS / DFT of $x [n] = 10 cos (\frac{π}{6} n + π ∕ 3)$ using fftshift. Compare with Figure 2.14 and note that the current plots clearly indicates that when $x [n]$ is real, the magnitude and phase are even and odd functions, respectively.

The second one aims at eliminating spurious values. Note that, due to numerical errors, the elements of X that should be zero, have small values such as -1.4e-15 - j 3.3e-16, which leads to random angles for $k \neq 1, 11$ in Figure 2.14. It is a good practice to eliminate these random angles based on a threshold for the magnitude. This post-processing step can be done with a command such as X(abs(X) $<$ 1e-12)=0, and is adopted in the Matlab/Octave fft routine.

Figure 2.38 benefits from these two improvements and was obtained using Listing 2.8.

Listing 2.8: MatlabOctaveCodeSnippets/snip_transforms_fftshift.m

1k=-N/2:N/2-1; %range with negative k (assume N is even) 
2X(abs(X)<1e-12)=0;%discard small values (numerical errors) 
3X=fftshift(X); %rearrange to represent negative freqs. 
4subplot(211);stem(k,abs(X));subplot(212);stem(k,angle(X));

Note that fftshift does not calculate the FFT but simply reorders the elements in a vector. $□$

Application 2.9. DTFS analysis of single cosines. Figure 2.39 depicts five cosines $x_{i} [n] = 10 cos (Ω_{i} n)$ with angular frequencies $Ω_{i} = i 2 π ∕ 32$ rad, where $i = 0, 1, 2, 16, 31$ , and their respective DTFS spectrum calculated with a DFT of $N = 32$ points and normalized by $N$ .

Figure 2.39: Five cosine signals $x_{i} [n] = 10 cos (Ω_{i} n)$ with frequencies $Ω_{i} = 0$ , $2 π ∕ 32$ , $4 π ∕ 32$ , $π$ , $31 π ∕ 16$ for $i = 0, 1, 2, 16, 32$ , and the real part of their DTFS using $N = 32$ points.

The first $x_{0} [n] = 10$ is a DC signal ( $Ω_{0} = 0$ ), the second is $x_{1} [n] = 10 cos (2 π ∕ 32 n)$ , where $Ω_{1} = \frac{2 π}{32}$ is the fundamental frequency in this DTFS analysis (given that $N = 32$ ) and the other three signals are cosines with the following harmonic frequencies: $Ω_{2} = 2 Ω_{1}$ , $Ω_{16} = 16 Ω_{1}$ and $Ω_{31} = 31 Ω_{1}$ rad. Because the cosines are even functions, the imaginary part of their DTFS is zero.

Note in Figure 2.39 that the frequency increases up to the $N ∕ 2$ -th harmonic $Ω_{16} = 16 Ω_{0} = π$ . When $N$ is even, the coefficient of $k = N ∕ 2$ corresponds to $Ω = \frac{2 π}{N} k |_{k = N ∕ 2} = π$ . It is important to note that the highest “frequency” of a discrete-time signal is always $Ω = π$ rad.

Another aspect of Figure 2.39 is that for $x_{0} [n]$ and $x_{16} [n]$ , corresponding to the angles $Ω = 0$ and $Ω = π$ , the spectra have only one non-zero coefficient $X [k]$ , while the other three signals are represented by two coefficients. Figure 2.7, for $N = 4$ and $N = 6$ can be invoked to help interpreting why the coefficients for angles different than $Ω = 0$ and $Ω = π$ occur in pairs with Hermitian symmetry (i. e., $X [k] = X^{*} [- k]$ when $x [n]$ is real). The basis functions for $Ω = 0$ and $Ω = π$ are real signals while the other angles lead to complex-valued signals. In the analysis of a real signal $x [n]$ , using the $k$ -th complex basis requires also using the $- k$ -th basis, such that their imaginary parts can cancel in the synthesis of a real $x [n]$ . $□$

Application 2.10. DTFS analysis of a sum of sines and cosines. This example aims at illustrating how the cosines and sines are mapped in the DTFS, especially when two of them have the same frequency. Figure 2.40 shows the spectrum of a more complex example obtained with Listing 2.9.

Listing 2.9: MatlabOctaveCodeSnippets/snip_transforms_DTFS_sinusoid.m. [ Python version]

1N=32; %number of DFT-points 
2n=0:N-1; %abscissa to generate signal below 
3x=2+3*cos(2*pi*6/32*n)+8*sin(2*pi*12/32*n)-... 
4    + 4*cos(2*pi*7/32*n)+ 6*sin(2*pi*7/32*n); 
5X=fft(x)/N; %calculate DTFS spectrum via DFT 
6X(abs(X)<1e-12)=0; %mask numerical errors

Figure 2.40: Analysis with DFT of 32 points of $x [n]$ composed by three sinusoids and a DC level. From top to bottom: $x [n]$ , real, imaginary, magnitude and phase of $X [k]$ .

Note in Figure 2.40 that the coefficient $X [0] = 2$ is due to the DC level, $X [6] = X [- 6] = 1.5$ are due to the cosine $3 cos (6 (2 π ∕ 32) n)$ , $X [12] = - 4 j$ and $X [- 12] = 4 j$ are due to $8 sin (12 (2 π ∕ 32) n)$ , $X [7] = - 2 - 3 j$ and $X [- 7] = - 2 + 3 j$ are due to the two parcels of frequency $7 (2 π ∕ 32)$ with the $- 4 cos (7 (2 π ∕ 32) n)$ being represented by the real part ( $- 2$ ) and $6 sin (7 (2 π ∕ 32) n)$ represented by the imaginary part ( $3 j$ ). $□$

Application 2.11. Spurious frequency components: When the number $N$ of DFT points is not a multiple of the signal period. Figure 2.41 illustrates the DTFS spectrum of a signal $x [n] = 4 cos ((2 π ∕ 6) n)$ with period of 6 samples obtained via a 16-point DFT. Note that because 16 is not a multiple of 6, the spectrum has many non-zero spurious components. The coefficients $X [3] = 0.99 - 1.48 j$ and $X [- 3] = 0.99 + 1.48 j$ , both with magnitude 1.78, are the closest to representing the angle $π ∕ 3 \approx 1.05$ rad. In fact, when using $N = 16$ points to divide $2 π$ (see Figure 2.7), the angle increment is

Δ Ω = \frac{2 π}{N},

(2.64)

which in this case is $Δ Ω = π ∕ 8 \approx 0.39$ , and the DFT / DTFS can deal only with the angles $[0, 0.39, 0.78, 1.18, 1.57, \dots, 5.89]$ rad. This explains why $X [3]$ and $X [- 3]$ are concentrating most of the power $P = 4^{2} ∕ 2$ . The other coefficients absorb the remaining power that “leaks” due to the imperfect match of the cosine frequency $π ∕ 3$ with the discrete grid of 16 points imposed by the DFT / DFTS.

Figure 2.41: Spectrum of a signal $x [n] = 4 cos ((2 π ∕ 6) n)$ with period of 6 samples obtained with a 16-point DFT, which created spurious components.

The creation of spurious components is associated to leakage and the picket-fence effect, which are discussed in Section 4.3. This phenomenon is very common when using the FFT to analyze signals that are non-periodic, have infinite duration, etc. In the current case, the spurious components could be avoided by choosing an appropriate number of points for the DFT / DTFS (a multiple of 6). In practice, spurious components due to FFT usage occur most of the times. Even when the original signal is periodic, say $x (t)$ with period $T$ , it is typically not guaranteed that $N T_{s}$ is a multiple of $T$ , where $N$ is the number of DFT points and $T_{s}$ the sampling period.

Figure 2.42: Explicitly repeating the block of $N$ cosine samples from Figure 2.41 to indicate that spurious components are a manifest of the lack of a perfect cosine in time-domain.

Figure 2.42 shows part of the signal obtained by repeating the segment of 16 samples depicted in Figure 2.41. This represents the signal “assumed” by the DFT / DTFS. Spurious components appear because, in spite of $x [n]$ having a period of $N = 6$ , an abrupt transition (from $- 4$ to 4) occurs in the boundary of the segments of 16 samples. In general, the periodic extension (that is assumed by the FFT when approximating a DTFS) of a signal $x [n]$ not commensurate²³ with its period, leads to discontinuities at the boundaries of the replicas of $x [n]$ . $□$

Application 2.12. DTFS for periodic discrete-time pulses. Assume a periodic train of pulses $x [n]$ with period $N$ and one period described as $x [n] = 1$ from $n = 0$ to $N_{1} - 1$ and $x [n] = 0$ from $N_{1}$ to $N - 1$ . Its DTFS coefficients are given by

X [k] = \frac{1}{N} \sum_{n = 0}^{N - 1} x [n] e^{- j \frac{2 π}{N} nk} = \frac{1}{N} \sum_{n = 0}^{N_{1} - 1} e^{- j \frac{2 π}{N} nk} = \frac{1}{N} \sum_{n = 0}^{N_{1} - 1} {(e^{- j \frac{2 π}{N} k})}^{n},

which is the sum of $N_{1}$ terms of a geometric series with ratio $r = e^{- j \frac{2 π}{N} k}$ . Applying Eq. (A.20) leads to

X [k] = \frac{1}{N} (\frac{1 - e^{- j \frac{2 π}{N} N_{1} k}}{1 - e^{- j \frac{2 π}{N} k}}) .

This expression can be simplified by applying Eq. (A.14) to both numerator and denominator:

X [k] = \frac{1}{N} \frac{sin (k N_{1} π ∕ N)}{sin (kπ ∕ N)} e^{- j \frac{kπ}{N} (N_{1} - 1)},

(2.65)

for $k = 0, \dots, N - 1$ and then periodically repeated in $k$ . For $k = 0$ the expression is undetermined (because of the $0 ∕ 0$ fraction) and L’Hospital’s rule leads to $X [0] = N_{1} ∕ N$ .

Listing 2.10 obtains the DTFS of the periodic pulses in two distinct ways.

Listing 2.10: MatlabOctaveCodeSnippets/snip_transforms_DTFS_pulses.m. [ Python version]

1N=10; N1=5; k=0:N-1; %define durations and k 
2Xk=(1/N)*(sin(k*N1*pi/N)./sin(k*pi/N)) .* ... 
3    exp(-j*k*pi/N*(N1-1)); %obtain DTFS directly 
4Xk(1)=N1/N; %eliminate the NaN (not a number) in Xk(1) 
5%second alternative, via DFT. Generate x[n]: 
6xn=[ones(1,N1) zeros(1,N-N1)] %single period of x[n] 
7Xk2=fft(xn)/N %DTFS via DFT, Xk2 is equal to Xk

Figure 2.43: Three periods of each signal: pulse train $x [n]$ with $N = 10$ and $N_{1} = 5$ and amplitude assumed to be in volts (a), the magnitude (b) and phase (c) of its DTFS.

Figure 2.43 illustrates the result using $N = 20$ and $N_{1} = 10$ . Note that $| X [k] |$ has the behavior of a sinc function, as expected. The DTFS phase in Figure 2.43 c) is the combined effect of the linear phase $e^{- j \frac{kπ}{N} (N_{1} - 1)}$ and the sign of $sin (k N_{1} π ∕ N) ∕ sin (kπ ∕ N)$ in Eq. (2.65). This sign is responsible for Figure 2.43 c) not being a linear function of frequency.

Figure 2.44: Behavior when $N_{1}$ of Figure 2.43 is decreased from $N_{1} = 4$ to 1.

Figure 2.44 shows the behavior of the DTFS of $x [n]$ when $N_{1}$ of Figure 2.43 is decreased from $N_{1} = 4$ to 1 Note the duality between time and frequency: as the pulse duty cycle gets smaller (smaller $N_{1}$ ), the spectrum gets wider and with smaller maximum values. When $N_{1} = 1$ , then $| X [k] | = N_{1} ∕ N = 0.05$ . Also, in this case, $N_{1}$ dictates the number of nulls in the spectrum, while $N$ controls how the unit circle $e^{j Ω}$ is sampled by the DFT. $□$

Application 2.13. Calculating the DTFT of a pulse via the DFT. Following steps similar to the ones that led to Eq. (2.65), the DTFT pair of a pulse is given by

x [n] = {\begin{matrix} 1, 0 \leq n \leq N_{1} - 1 \\ 0, otherwise \end{matrix} \Leftrightarrow \frac{sin (Ω N_{1} ∕ 2)}{sin (Ω ∕ 2)} e^{- j Ω (N_{1} - 1) ∕ 2}

(2.66)

Listing 2.11 illustrates how to obtain samples of the DTFT for a pulse with $N_{1} = 5$ non-zero samples using a DFT of $N = 20$ points.

Listing 2.11: MatlabOctaveCodeSnippets/snip_transforms_DTFT_pulse.m. [ Python version]

1N=20; %DFT size 
2N1=5; %num. non-zero samples in the (aperiodic) pulse 
3x=[ones(1,N1) zeros(1,N-N1)]; %signal x[n] 
4N2=512; %num. samples of the DTFT theoretical expression 
5k=0:N2-1; %indices of freq. components 
6wk = k*2*pi/N2; %angular frequencies 
7Xk_fft = fft(x); %use N-point DFT to sample DTFT 
8Xk_theory=(sin(wk*N1/2)./sin(wk/2)).*exp(-j*wk*(N1-1)/2); 
9Xk_theory(1) = N1; %take care of 0/0 (NaN) at k=0 
10subplot(211); stem(2*pi*(0:N-1)/N,abs(Xk_fft)); 
11hold on, plot(wk,abs(Xk_theory),'r'); 
12subplot(212); stem(2*pi*(0:N-1)/N,angle(Xk_fft)); 
13hold on, plot(wk,angle(Xk_theory),'r');

The code plots the DTFT superimposed to the DFT result, as illustrated in Figure 2.45.

Figure 2.45: DTFT of an aperiodic pulse with $N_{1} = 5$ non-zero samples and DTFT estimates obtained via a DFT of $N = 20$ points.

As stated by Eq. (2.35), Figure 2.45 pictorially illustrates that the DFT values at the discrete frequencies $Ω_{k} = k (2 π ∕ N)$ coincide with the DTFT of the aperiodic pulse $x [n]$ calculated at these frequencies using Eq. (2.66).

Figure 2.46: A version of Figure 2.45 using a DFT of $N = 256$ points.

The DTFT can be calculated in a grid of frequencies with arbitrary resolution by increasing the number $N$ of DFT points. For example, using $N = 256$ leads to the spectrum in Figure 2.46. Note that if x has less than $N$ samples, the command fft(x,N) conveniently extends its duration with zero-padding. $□$

Application 2.14. Calculating the DTFT using Matlab/Octave’s freqz. When $N$ is relatively large, it is more convenient to use plot instead of stem (as in Figure 1.19), in spite of $X [k]$ being discrete. This is also convenient when $X [k]$ is used to represent values of $X (e^{j Ω})$ as in this example. Matlab/Octave has the command freqz that uses the DFT to calculate the DTFT at a grid of discrete frequencies. Assuming the same vector x of the previous code snippet, the command freqz(x) generates Figure 2.47.

Figure 2.47: A version of Figure 2.45 using freqz with 512 points representing only the positive part of the spectrum.

Note that Figure 2.46 and Figure 2.47 provide the same information in spite of looking quite different. For instance, Figure 2.47 is not explicitly showing that the DTFT is being calculated only at a grid of discrete frequencies.

Figure 2.48: Reproducing the graphs generated by freqz in Figure 2.47.

To better understand how freqz works, Listing 2.12 mimics freqz and was used to obtain Figure 2.48.

Listing 2.12: MatlabOctaveCodeSnippets/snip_transforms_freqz.m. [ Python version]

1N1=5; %num. non-zero samples in the (aperiodic) pulse 
2x=[ones(1,N1) zeros(1,2)]; %signal x[n] 
3halfN=512; N=2*halfN; %half of the DFT dimension and N 
4k=0:N-1; %indices of freq. components 
5wk = k*2*pi/N; %angular frequencies 
6Xk_fft=fft(x,N);%use 2N-DFT to get N positive freqs. 
7Xk_fft=Xk_fft(1:halfN);%discard negative freqs. 
8wk=(0:halfN-1)*(2*pi/N); %positive freq. grid 
9subplot(211); plot(wk,20*log10(abs(Xk_fft))); %in dB 
10subplot(212); plot(wk,angle(Xk_fft)*180/pi); %in degrees

It can be seen that Figure 2.48 is very similar to Figure 2.47. The only distinction is that, by default, Matlab/Octave represents the abscissa normalizing $Ω$ by $π$ , such that the maximum frequency is not $π$ but 1. It is interesting to note that Matlab/Octave and GNU Radio, for example, indicate $Ω$ in rad/sample, while this and other texts assume $Ω$ is given in radians ( $n$ is dimensionless). $□$

¹⁸ Available from [ url2ecg].

¹⁹ A good way to visualize DCT in image coding is by running the (deprecated) dctdemo.m Matlab demonstration.

²⁰ A popular repository is [ url2usc].

²¹ See, e. g., page 451, Eq. (6.68), in [Gui09].

²² See, e. g., page 534, Eq. (6.155), in [Gui09].

²³ See Example 1.16 for a discussion of commensurate frequencies.