Digital Tracking Loops for a Programmable Digital Modem

John J. Poklemba
COMSAT Laboratories
Clarksburg, Maryland

Prepared for
Lewis Research Center
Under Contract NAS3-25715
Digital Tracking Loops for a Programmable Digital Modem

John J. Poklemba
COMSAT Laboratories
Clarksburg, MD 20871-9475

ABSTRACT

In this paper, an analysis and hardware emulation of the tracking loops for a very flexible programmable digital modem (PDM) will be presented. The modem is capable of being programmed for 2, 4, 8, 16-PSK, 16-QAM, MSK, and Offset-QPSK modulation schemes over a range of data rates from 2.34 to 300 Mbps with programmable spectral occupancy from 1.2 to 1.8 times the symbol rate; and these operational parameters are executable in burst or continuous mode. All of the critical processing in both the modulator and demodulator is done at baseband with very high-speed digital hardware and memory. Quadrature analog front-ends are used for translation between baseband and the IF center frequency.

The modulator is based on a table lookup approach, where precomputed samples are stored in memory and clocked out according to the incoming data pattern. The sample values are predistorted to counteract the effects of the other filtering functions in the link as well as any transmission impairments.

The demodulator architecture has been adapted from a joint estimator-detector (JED) mathematical analysis. Its structure is applicable to most signalling formats that can be represented in a two-dimensional space. The JED realization uses interdependent, mutually aiding tracking loops with post-detection data feedback. To expedite and provide for more reliable synchronization, initial estimates for these loops are computed in a parallel acquisition processor. The cornerstone of the demodulator realization is the pre-averager receive data filter which allows operation over a broad range of data rates without any hardware changes, and greatly simplifies the implementation complexity.
The emulation results confirmed tracking loop operation over the entire range of operational parameters listed above, as well as the capability of achieving and maintaining synchronization at BERs in excess of $10^{-1}$. The emulation results also showed very close agreement with the tracking loop analysis, and validated the resolution apportionment of the various hardware elements in the tracking loops.

INTRODUCTION

In 1977, a generic demodulator structure for digital data transmission was derived from multidimensional detection and estimation theory, and successfully implemented with primarily analog hardware. [1] Six years later a patent, "Concurrent Carrier and Clock Synchronization (CCCS) for Data Transmission Systems" was granted on its implementation; and the following year, 1984, a paper, "A Joint Estimator-Detector (JED) for QPSK Data Transmission" was published on its theory. [2,3] The JED structure is based on a mathematical description of amplitude modulated carriers in phase-quadrature. As a result, it is applicable to many widely used, filtered signal constellations that can be represented in a two-dimensional space; for example: M-ary PSK, M-ary QAM, MSK, Offset-QPSK, etc. [4]

The detection and estimation operations are interdependent, such that all of the post-detected values and estimates developed coherently aid one another. The upshots of this coherency are: 1) negligible degradation in the bit error rate (BER) for nominal tracking loop bandwidths, 2) the post-detected data feedback used in the estimator loop phase detectors results in less output jitter and cycle skipping at a given SNR than other well known techniques, and 3) the loops can achieve and maintain synchronization at BERs in excess of $10^{-1}$.

Over the years, numerous realizations of the JED approach, both analog and digital, have been built and tested. They have primarily been QPSK implementations at data rates ranging from 64 Kbps to 20 Mbps. The subject version has been designed for 2, 4, 8, 16-PSK, 16-QAM, MSK, and Offset-QPSK schemes over a range of symbol rates from 2.34 to 75 Msps. The transmission rate for the 16-ary modulation then can be as high as 300 Mbps.
An extensive hardware emulation program has been written to evaluate the hardware design. All of the performance results which will be presented have been taken from the emulations. The hardware is currently being fabricated, so no measurements are yet available.

MODEM IMPLEMENTATION

The modulator and demodulator conceptual architectures have been described in detail in reference 5, so they will only be outlined in this paper to provide continuity in analyzing the tracking loops. Both architectures were chosen to provide the maximum practical flexibility consistent with the modulation techniques, data rates, operational modes, and spectral occupancies required.

MODULATOR

To achieve the greatest degree of programmability, the critical processing elements in the modulator architecture are digital. The output of the digital processing are in-phase and quadrature (I and Q) baseband sample sequences. These are converted into the analog domain at baseband and mixed with quadrature carriers to an IF. A block diagram of the modulator is shown in Figure 1 and a brief description of its functionality follows.

Many of the most widely used modulation techniques can be represented in a two-dimensional in-phase and quadrature (I and Q) format, for example: M-ary PSK, M-ary QAM, MSK, etc. Therefore, a quadrature data mapping (QDM) between "ABCD" and I, Q is necessary prior to spectral shaping.

Nyquist bandlimiting of the I, Q digital data is most easily realized in the modulator hardware by means of a table lookup. [6] This is because the signal input for bandlimiting has only a few deterministic levels, whereas a true filter would have to operate on signals and noise which exist over a range of values.
Figure 1. Modulator Block Diagram
Multirate Operation

To provide maximum flexibility, it's desirable to use a single analog replication-removal (RR) filter for the broadest practical range of data rates. This implies that the first sample-rate-replicated spectral lobe of the data always be above a given cutoff frequency. This is accommodated by keeping the sample rate sufficiently high such that the first replica remains above the RR filter cutoff. In other words, as the data symbol rate is lowered, the number of samples per symbol must be increased proportionally. However, this technique is more easily implemented in discrete sample-per-symbol (s/s) steps rather than continuously, and henceforth will be referred to as "gear shifting". A convenient and practical size for each step is an octave in data rate. The ranges of data rates for this design are listed in Table 1. The table only goes up to 32 s/s because of transmit spectral shaping size limitations.

<table>
<thead>
<tr>
<th>Symbol Rate Range (Mps)</th>
<th>Samples/Symbol</th>
</tr>
</thead>
<tbody>
<tr>
<td>37.5 - 75</td>
<td>2</td>
</tr>
<tr>
<td>18.75 - 37.5</td>
<td>4</td>
</tr>
<tr>
<td>9.375 - 18.75</td>
<td>8</td>
</tr>
<tr>
<td>4.6875 - 9.375</td>
<td>16</td>
</tr>
<tr>
<td>2.34375 - 4.6875</td>
<td>32</td>
</tr>
</tbody>
</table>

For all the modulation techniques listed, 8-bits are sufficient to yield better than 40 dB of spectral quantization noise. To remove the replicated spectra due to sampling, an elliptic lowpass filter is used. It has a 0.2 dB equiripple passband extending from DC to 48 Mhz (64% of the maximum symbol rate) with a stopband beginning at 60 Mhz of minimum attenuation greater than 30 dB. The sample-and-hold effect of the D/A provides additional filtering to suppress the sample clock replications below 40 dB. To avoid implementing additional analog hardware, group delay dispersion in the replication-removal filter will be compensated in the modulator table lookup. In addition, the transmit spectral shaping will be predistorted to account for IF and anti-aliasing filters, as well as transmission link impairments.
DEMODULATOR

Like the modulator, to achieve the greatest degree of programmability the critical processing elements in the demodulator architecture will also be digital; and quadrature down-conversion as well as baseband sampling will be used in the demodulator front-end.

The anti-aliasing (AA) filter requirements for limiting the incoming bandwidth prior to A/D conversion are very comparable to those for the replication-removal (RR) filter in the modulator. Hence, both filters will be designed with identical passband, transition-band, and stopband parameters.

A general block diagram of the demodulator is shown in Figure 2. Fundamentally, it consists of in-phase and quadrature (I & Q) detection paths and three estimation or tracking loops, namely, AGC, carrier phase, and symbol timing. As indicated in the figure, the detection and estimation operations are interdependent, such that all of the estimates developed coherently aid one another. Pre-averaging receive data filters in both the I and Q channels are critical up-front digital processing elements.

For burst mode operation, a parallel acquisition estimate processor is employed to develop initial tracking loop startup values to expedite synchronization and minimize the probability of PLL hangup. [7, 8, 9] Its estimates are injected directly into the digital tracking loop accumulators.

The analog front-end converts the incoming modulated RF signal down to baseband with a fixed local oscillator, such that a carrier offset beat frequency is superimposed on the baseband I and Q channels. Therefore, after A/D conversion the I and Q paths are passed through a carrier phase rotator to remove the beat, thereby simplifying the subsequent processing. The A/D sample clock is always an integer-multiple of the recovered symbol clock. Hence, its phasing is properly aligned with the detection and transition sample points.
Figure 2. Demodulator Block Diagram
Detection Path Processing

There are three major processing blocks in the I, Q data detection paths: the pre-averager (PA) receive data filter, the carrier phase rotator (CPR), and the quadrature channel detection mapper (QCDM). The detection path is aided by the tracking loop estimates in the following ways. The arithmetic is fixed point, so to avoid overflow the amplitude level estimate is used to control the gain of the incoming signal. Automatic gain-control (AGC) is also important when the modulation technique has an amplitude level dependency, such as QAM. The carrier phase estimate simply rotates out the carrier beat. The symbol timing phase estimate is embedded in the sample clock at the A/D converter. Hence, the detection path samples are phase-coherent at their inception.

Pre-Averager Receive Data Filter

The PA performs three very important functions in the demodulator:

1) it provides for a very broad range of data rate operation without any hardware changes (typically a factor of 32),

2) it functions as a receive data filter and limits the single-sided incoming bandwidth to $R_s/2$,

3) and it decimates the sample rate to 1 s/s for subsequent processing. [10]

One patent has been granted on its basic conceptual implementation, and another is pending on its extension to provide more selective filtering. [11] For the pre-averager realization used, the effective impulse response aperture length was two symbols wide, and the output sample rate was 1 s/s in the data detection and transition sample paths.

Notation

The notation that will be used throughout this paper is listed in Table 2. The $2k$ and $2k-1$ (even and odd) subscripts denote sample values at the data detection and transition timing instants, respectively; whereas the $k$ subscripts refer to all sample points. Hats denote estimated or detected values.
Table 2. Notation

<table>
<thead>
<tr>
<th>Symbol</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>$I'_k$</td>
<td>Incoming baseband in-phase samples</td>
</tr>
<tr>
<td>$Q'_k$</td>
<td>Incoming baseband quadrature samples</td>
</tr>
<tr>
<td>$A_k$</td>
<td>Unknown incoming amplitude level</td>
</tr>
<tr>
<td>$\theta_k$</td>
<td>Random incoming carrier phase</td>
</tr>
<tr>
<td>$\tau_k$</td>
<td>Random incoming symbol time</td>
</tr>
<tr>
<td>$I_{2k}$</td>
<td>Post carrier phase rotated in-phase &quot;detection&quot; samples</td>
</tr>
<tr>
<td>$Q_{2k}$</td>
<td>Post carrier phase rotated quadrature &quot;detection&quot; samples</td>
</tr>
<tr>
<td>$I_{2k-1}$</td>
<td>Post carrier phase rotated in-phase &quot;transition&quot; samples</td>
</tr>
<tr>
<td>$Q_{2k-1}$</td>
<td>Post carrier phase rotated quadrature &quot;transition&quot; samples</td>
</tr>
<tr>
<td>$i_{2k}$</td>
<td>In-phase post-detected data feedback samples</td>
</tr>
<tr>
<td>$\hat{Q}_{2k}$</td>
<td>Quadrature post-detected data feedback samples</td>
</tr>
<tr>
<td>$\Delta i_{2k-1}$</td>
<td>In-phase post-detected data feedback difference samples</td>
</tr>
<tr>
<td>$\Delta \hat{Q}_{2k-1}$</td>
<td>Quadrature post-detected data feedback difference samples</td>
</tr>
<tr>
<td>$\Delta A_0$</td>
<td>Initial amplitude level offset estimate</td>
</tr>
<tr>
<td>$\hat{\theta}_0$</td>
<td>Initial carrier phase estimate</td>
</tr>
<tr>
<td>$\Delta \hat{\theta}_0$</td>
<td>Initial carrier Frequency estimate</td>
</tr>
<tr>
<td>$\hat{\tau}_0$</td>
<td>Initial symbol timing estimate</td>
</tr>
<tr>
<td>$\hat{A}$</td>
<td>Amplitude reference level</td>
</tr>
<tr>
<td>$\Delta A_{2k}$</td>
<td>Amplitude level error</td>
</tr>
<tr>
<td>$\Delta \theta_{2k}$</td>
<td>Carrier phase error</td>
</tr>
<tr>
<td>$\Delta \tau_{2k-1}$</td>
<td>Symbol timing error</td>
</tr>
<tr>
<td>$K_d$</td>
<td>Carrier and Clock phase detector gain constant</td>
</tr>
<tr>
<td>$K_A$</td>
<td>AGC loop gain constant</td>
</tr>
<tr>
<td>$K_\theta$</td>
<td>Carrier phase gain constant</td>
</tr>
<tr>
<td>$K_{\Delta \theta}$</td>
<td>Carrier frequency gain constant</td>
</tr>
<tr>
<td>$K_\tau$</td>
<td>Symbol timing loop gain constant</td>
</tr>
<tr>
<td>$\Delta \hat{A}_{2k}$</td>
<td>Amplitude level offset output estimate</td>
</tr>
<tr>
<td>$\hat{\theta}_{2k}$</td>
<td>Carrier phase output estimate</td>
</tr>
<tr>
<td>$\Delta \hat{\theta}_{2k}$</td>
<td>Carrier Frequency output estimate</td>
</tr>
<tr>
<td>$\hat{\tau}_{2k-1}$</td>
<td>Symbol timing output estimate</td>
</tr>
</tbody>
</table>
Carrier Phase Rotator

The carrier phase rotator (CPR) removes the frequency beat in the I and Q channels resulting from the use of a fixed carrier LO, frequency translation. Prior to the CPR the I and Q channels may be described by

\[ I'_k = A_k [i_k \cos(\theta_k) + q_k \sin(\theta_k)] \]  
(1a)
\[ Q'_k = A_k [q_k \cos(\theta_k) - i_k \sin(\theta_k)] \]  
(1b)

If the carrier phase tracking is sufficiently accurate, the frequency beat is taken out by performing the two 1x2 matrix multiplications of equations (2a, b).

\[ I_k = \begin{bmatrix} \cos(\hat{\theta}_k) & -\sin(\hat{\theta}_k) \end{bmatrix} \begin{bmatrix} I'_k \end{bmatrix} \]
\[ = A_k [i_k \cos(\theta_k-\hat{\theta}_k) + q_k \sin(\theta_k-\hat{\theta}_k)] = A_k [i_k] \]  
(2a)
\[ Q_k = \begin{bmatrix} \sin(\hat{\theta}_k) & \cos(\hat{\theta}_k) \end{bmatrix} \begin{bmatrix} Q'_k \end{bmatrix} \]
\[ = A_k [q_k \cos(\theta_k-\hat{\theta}_k) - i_k \sin(\theta_k-\hat{\theta}_k)] = A_k [q_k] \]  
(2b)

Quadrature Channel Detection Mapper

The quadrature channel detection mapper (QCDM) performs the inverse operation of its counterpart in the modulator, but at a much higher resolution because of the inclusion of noise in the demodulator. In general, the QCDM maps the post-CPR baseband I and Q channels into as many as four parallel data streams (ABCD) for 16-ary signalling. For PSK modulation techniques, the decision boundaries are radial lines that bisect the distance between adjacent signals in the constellation (assuming equally likely a-priori signal probabilities); whereas for QAM, the boundaries form an amplitude-dependent grid pattern. The different mappings are most easily implemented as a table lookup.
Tracking Loop Analysis

To facilitate coherent detection of the digital data, three tracking loops are typically necessary: amplitude level or automatic gain control (AGC), carrier phase, and symbol timing. In each loop there is a nominal operating setting, for example, an amplitude reference level or a numerically-controlled oscillator (NCO) center frequency. An error signal is added to the nominal settings in each of the three loops to make corrections for tracking. The error signals were derived in reference 3 for QPSK modulation, but are readily extendable to most two-dimensional signalling formats. These include such popular modulation techniques as 2-, 4-, 8-, 16-PSK, MSK, Offset-QPSK, and 16-QAM. The three digitally implemented, interdependent tracking loops will now be analyzed.

AGC Tracking Loop

For signal constellations that can be placed on a common power circle such as the PSK and MSK formats listed above, AGC isn't explicitly necessary for the detection process because the decision boundaries are radial lines which aren't amplitude dependent. However, the error signal detector gain in all three tracking loops scales with the incoming signal level; so the incoming amplitude level affects their open loop gains and closed loop bandwidths. In addition, with a fixed-point arithmetic digital implementation, the maximum sample value excursions must be limited to prevent overflow.

Incoming gain or level fluctuations are generally extremely slow relative to the data rate. Hence, for most applications a first order tracking loop is sufficient; whereas for operation with relatively short bursts and a sufficiently accurate acquisition estimate, the estimate can be held for the duration of the burst and no AGC tracking is necessary. [7, 12]

A block diagram for a gain parameter model of the AGC tracking loop is depicted in Figure 3. It consists of a gain-controlled amplifier, amplitude level error detector, and a multiplier-accumulator (MAC) to provide filtering. The input data and error detector multiplier input resolutions were 8-bits. The accumulator resolution was 24-bits of which the 8 most significant were used to control the gain.
Figure 3. AGC Tracking Loop Model
AGC Error Detector

The AGC tracking error signal is defined in equation (3);

\[ \Delta A_{2k} = (i_{2k} \hat{i}_{2k} + q_{2k} \hat{q}_{2k}) - \hat{A} \]

\[ = A_{2k} (i_{2k} \hat{i}_{2k} + q_{2k} \hat{q}_{2k}) - \hat{A} \]  \hspace{1cm} (3)

with \( \hat{A} = (i_{2k} \hat{i}_{2k} + q_{2k} \hat{q}_{2k}) \). When the signals all reside on the same power circle as with the PSK modulation family, \( \hat{A} \) is a constant; but for constellations such as 16-QAM, \( \hat{A} \) takes on three values as a function of the post-detection data feedback.

AGC Closed Loop Transfer Function

Dropping the subscripts for clarity, the amplifier gain may be expressed in terms of the amplitude level error output estimate as

\[ G(\Delta \hat{A}) = G_{\text{nom}} \hat{A} / (\hat{A} + \Delta \hat{A}) \], such that \( \Delta \hat{A} > -\hat{A} \)  \hspace{1cm} (4)

Note when \( \Delta A = 0 \), the amplifier is set to its nominal gain.

The MAC transfer function may be written in terms of its z-transform as

\[ \Delta \hat{A} = K_A \Delta A / (z-1) \]  \hspace{1cm} (5)

where \( \Delta A = G(\Delta \hat{A}) A - \hat{A} \). Substituting for \( \Delta A \) into equation (5) gives

\[ \Delta \hat{A} = K_A [G(\Delta \hat{A}) A - \hat{A}] / (z-1) \]

\[ = K_A \hat{A} [G_{\text{nom}} A - (\hat{A} + \Delta \hat{A})] / [(z-1) (\hat{A} + \Delta \hat{A})] \]  \hspace{1cm} (6)

Equation (6) can be rewritten in standard second order polynomial format as

\[ \Delta \hat{A}^2 (z-1) + \Delta \hat{A} \{ A[ (z-1) + K_A ] + K_A \hat{A} (\hat{A} - G_{\text{nom}} A) \} = 0 \]

\[ \Delta \hat{A}^2 (a) + \Delta \hat{A} (b) + (c) = 0 \]  \hspace{1cm} (7)

The roots of equation (7) may be found from

\[ \Delta \hat{A} = -b (1 \pm \sqrt{1 - 4ac / b^2}) / 2a \]

\[ = -b \{ 1 \pm [1 - 2(ac / b^2) + 2(ac / b^2) - \ldots ] \} / 2a \]

\[ = -c / b \]  \hspace{1cm} (8)
where only the first term in the expansion has been used since $b^2 >> ac$. The resultant transfer function is then

$$\Delta \hat{A} = K_A \left( G(\Delta \hat{A}) A - \hat{A} \right) / \left[ z + (K_A-1) \right]$$

(9)

A great deal is known about setting the closed loop parameters to maintain stability and the desired transient response for analog tracking loops. [13] Hence, the impulse invariance technique will be used to find the corresponding digital loop roots. The conversion is related as

$$z_{root} = \exp(-\omega AT_s) = -(K_A-1)$$

(10)

The loop gain constant is then determined by

$$K_A = 1 - \exp(-\omega AT_s)$$

$$= 1 - (1 - \omega AT_s)$$

$$= \omega AT_s$$

$$= 4(B_A/R_s)$$

(11)

where $\omega_A = 4B_A$ and $T_s = 1/R_s$. The AGC closed loop time constant in symbol time units is then

$$(\tau_A/T_s) = (1/4) (R_s/B_A)$$

(12)

with $\tau_A = 1/\omega_A$.

Symbol Timing Tracking Loop

Like the AGC loop, the symbol timing loop is first order and requires only a single phase accumulator. Since a numerically-controlled oscillator (NCO) is used to reconstruct the clock, its inherent phase accumulator serves as the loop filter. A block diagram of the loop is given in Figure 4. Note that the loop also contains a multiplicative gain, $M$ before the NCO, as well as a divide-by-$N$ gain after the NCO. The multiplicative gain comes about because the NCO input sample clock is higher than the incoming sample rate. Thus, incoming samples are typically accumulated more than one-for-one. Also, since the NCO output is at the sample rate frequency, $NR_s$, there is a divide-by-$N$ in the loop. The input data and error detector multiplier input resolutions were 8-bits. The timing phase accumulator resolution was 24-bits of which the 10 most significant were mapped into sinusoidal sample values. After
Symbol Timing Phase Error Detector

The symbol timing phase error signal is defined in equation (13);

\[ \Delta \tau_{2k-1} = I_{2k-1} \Delta I_{2k-1} + Q_{2k-1} \Delta Q_{2k-1} \]

\[ = A_{2k-1} ( i_{2k-1} \Delta I_{2k-1} + q_{2k-1} \Delta Q_{2k-1} ) \]  (13)

where \( \Delta I_{2k-1} = ( I_{2k} - I_{2k-2} ) \). If no data transitions occur, the symbol timing phase detector output is zero. The loop gain and bandwidth are then effectively zero, and the loop time constant is extremely long. Moreover, the best the loop can do in the absence of data transition information is to hold its last accumulated error value and coast. For multilevel modulation techniques, the error signal is weighted in proportion to the differential magnitude of the data transition.

Symbol Timing Closed Loop Transfer Function

The NCO transfer function may be written in terms of its z-transform as

\[ \hat{\tau} = (M/N) K_\tau \Delta \tau / (z-1); \]  (14)

where \( \Delta \tau = K_d (\tau - \hat{\tau}) \). To avoid using an extra multiplier, the loop gain parameter, \( K_\tau \) is incremented in powers-of-two by shifting the bits coming in to the NCO up or down. The phase detector gain, \( K_d \) is proportional to the incoming signal amplitude, \( A_{2k-1} \), and the number as well as the magnitude of the transitions in the data pattern. For example, for QPSK with "10..." data patterns in both the I and Q channels, \( K_d \) is maximized; whereas with PN data, \( K_d \) would be half of that maximum value.

Substituting for \( \Delta \tau \) in equation (14) gives

\[ \hat{\tau} = (M/N) K_\tau K_d (\tau - \hat{\tau}) / (z-1) \]  (15)

Equation (15) may be written in standard form as

\[ \hat{\tau} = (M/N) K_\tau K_d \tau / [z + ((M/N) K_\tau K_d - 1)] \]  (16)

Again, the impulse invariance conversion is related as

\[ \exp(-\omega_c T_s) = -((M/N) K_\tau K_d - 1); \]  (17)
and this yields

\[ K_T = \frac{4}{K_d} \frac{(N/M)}{(B_\tau/R_s)} \]

(18)

with the symbol timing closed loop time constant in symbol time units as

\[ (\tau_c/\tau_s) = \frac{(K_d/4)}{(M/N)} \frac{(R_s/B_\tau)}{\tau_s} \]

(19)

**Carrier Phase Tracking Loop**

The carrier phase loop is second order so it can track frequency as well as phase variations. A block diagram of the loop is depicted in Figure 5. It consists of a phase detector, two MACs (one for phase and one for frequency), a quadrature component generator, and a carrier phase rotator. As before, the input and multiplicative processing uses 8-bit resolution, whereas the phase and frequency accumulators employ 24-bits. The quadrature component generator table lookup uses the most significant 10-bits from the output phase estimate accumulator.

**Carrier Phase Error Detector**

The carrier phase detector response is described by

\[ \Delta \theta_{2k} = I_{2k} \hat{Q}_{2k} - Q_{2k} \hat{I}_{2k} \]

\[ = A_{2k} \left( i_{2k} \hat{Q}_{2k} - q_{2k} \hat{I}_{2k} \right) \]

\[ = K_d \sin(\theta_{2k} - \hat{\theta}_{2k}) \]

(20)

When the phase difference is sufficiently small, linear tracking results

\[ \Delta \theta_{2k} = K_d (\theta_{2k} - \hat{\theta}_{2k}) \]

(21)

**Carrier Phase Closed Loop Transfer Function**

The z-transform for the two accumulators is

\[ \hat{\theta} = [K_{\theta} \Delta \theta/(z-1) + K_{\Delta \theta} \Delta \theta/(z-1)^2] \]

(22)

Substituting in equation (21) gives

\[ \hat{\theta} = [K_{\theta} K_d (\theta - \hat{\theta})/(z-1) + K_{\Delta \theta} K_d (\theta - \hat{\theta})/(z-1)^2] \]

(23)
D/A conversion and lowpass filtering, the sinusoidal samples were passed through a limiter to produce the properly phased clock waveform.
Figure 5. Carrier Phase Tracking Loop Model
Grouping terms and rewriting equation (23) into a transfer function format yields

\[ \frac{\dot{\theta}/\theta = [K_0 K_d (z-1) + K_{\Delta \theta} K_d]}{[(z-1)^2 + K_0 K_d (z-1) + K_{\Delta \theta} K_d]} \]

\[ = K_d [K_0 (z-1) + K_{\Delta \theta} ] / [z^2 + z (K_0 K_d - 2) + K_d (K_{\Delta \theta} - K_0) + 1] \] (24)

Because the analysis of analog tracking loops is well known, the discrete-time response of equation (24) will be related to the analog transient response as follows. In standard second order notation, the complex roots in the analog s-domain are

\[ s = -\omega_n (\zeta \pm j \sqrt{1-\zeta^2}) \] (25)

To preserve the closed loop phase stability predicted by the s-plane analysis, the impulse invariance transform of equation (10) is combined with equation (25) to give

\[ z_{\text{roots}} = -\exp [-\omega_n T_s (\zeta \pm j \sqrt{1-\zeta^2})] \] (26)

Expanding equation (26) and equating it with the denominator of equation (24) yields two simultaneous equations for setting the carrier tracking loop parameters

\[ K_d (K_{\Delta \theta} - K_0) + 1 = \exp (-2\omega_0 T_s) \] (27a)
\[ K_0 K_d - 2 = -2 \exp (\omega_0 T_s) \cos (\omega_0 T_s) \cos (\omega_0 T_s (1 - \zeta^2)) \] (27b)

These equations may be expanded in series form as

\[ K_d (K_{\Delta \theta} - K_0) + 1 = 1 - (2\omega_0 T_s) + (2\omega_0 T_s)^2/2 - \ldots \] (28a)
\[ K_0 K_d - 2 = -2 [1 - (\omega_0 T_s) + \omega_0 T_s (1 - \zeta^2)^2/2 - \ldots] [1 - (\omega_0 T_s)^2 (1 - \zeta^2)/2 + \ldots] \] (28b)

Recall that the sampling rate of these loops is one sample/symbol, and the closed loop bandwidths typically average sample values over hundreds of symbol times. Hence, \(2\omega_0 T_s \ll 1\), so equations (28) may be approximated with lower order terms as

\[ K_d K_{\Delta \theta} = (\omega_0 T_s)^2 \] (29a)
\[ K_d K_0 = 2\omega_0 T_s \] (29b)
For a second order tracking loop, the carrier noise equivalent bandwidth, $B_0$ is equated with the natural frequency, $\omega_0$ by

$$B_0 = \left[\zeta + 1/(4\zeta)\right] \omega_0 / 2$$

$$= 0.5303 \omega_0$$  \hspace{1cm} (30)

when the damping ratio, $\zeta = 1/\sqrt{2}$. Then, equations (29) can be related in terms of the normalized ratio of symbol rate to bandwidth as

$$K_{\Delta \theta} = (3.556/K_d) (B_0/R_s)^2$$ \hspace{1cm} (31a)

$$K_{\phi} = (2.667/K_d) (B_0/R_s)$$ \hspace{1cm} (31b)

EMULATION RESULTS

The transient responses of the three tracking loops was emulated as a means for validating the analysis. The tracking loop output estimate frequency domain quantization noise was also emulated to verify the hardware bit resolutions selected. Finally, the acquisition and tracking behavior of the loops with additive white Gaussian noise (AWGN) was evaluated at very low SNR. Although the emulations were run for all of the two-dimensional modulation techniques described in the introductory section, space limitations dictate that only the QPSK results will be presented.

TRANSIENT RESPONSE

To evaluate the transient response of the tracking loops, the acquisition estimate injection was disabled. The first order AGC tracking loop has a normalized time constant of $R_s/4B_A$ symbol times, whereas the symbol timing loop is $R_s/2B_T$. Their respective transient responses are shown in Figures 6a, b for normalized design time constants of 75 and 150, respectively. The marks on the horizontal axes indicate the measured emulation time constants of approximately 80 and 130 symbol times. The agreement is actually very close considering that only 7-bits are available to quantize the gain coefficients over a relatively broad range of $R_s/B$. In addition, the AGC loop analysis was non-linear, and the symbol timing loop transient is pattern dependent; such that it is faster or slower depending on whether the probability for the localized groups of data transitions is greater or less than 0.5, respectively.
Figure 6a. AGC Tracking Loop Transient with Acquisition Estimate Disabled

Figure 6b. Symbol Timing Tracking Loop Transient with Acquisition Estimate Disabled
The second order carrier tracking loop transient is depicted in Figure 7. With the damping ratio, \( \zeta = 1/\sqrt{2} \) the overshoot should be about 22% or 10\(^\circ\), and the first zero crossing of the transient should occur at 0.583 \((R_s/B_0) = 175\) symbol times. The emulated values in the figure agree quite precisely with these predictions which are based on the analysis.

![Graph of Carrier Phase Detector Output and Carrier Tracking Loop Output](image)

**Figure 7. Carrier Phase Tracking Loop Transient with Acquisition Estimate Disabled**

**OUTPUT ESTIMATE QUANTIZATION NOISE**

The worst case output estimate quantization noise is illustrated in Figures 8a, b, and c, for operation at the minimum symbol rate, 2.34375 Msps with no noise added. All three outputs are exceptionally pure and exhibit negligible quantization noise levels that are well below 40 dB.
Figure 8a. AGC Tracking Loop Output Phase Noise

Figure 8b. Carrier Phase Tracking Loop Output Phase Noise

Figure 8c. Symbol Timing Tracking Loop Output Phase Noise
MINIMUM SNR REQUIRED FOR ACQUISITION AND TRACKING

Tracking loop operation at low SNRs was evaluated by including AWGN in the emulation channel. The minimum SNR, rounded up to the nearest half dB, at which the demodulator could consistently acquire and track in burst mode is listed in Table 3. Because of the complexity of the emulation and the relatively long period of time it took to run a single data burst, these measurements were made on a burst-by-burst basis and as such cannot be directly related to a burst detection reliability.

Observe that the BERs are all on the order of $10^{-1}$. This is due to the effect of data feedback, where the tracking loop SNR drops by approximately 3 dB when the BER = $1.5 \times 10^{-1}$. That is, 85% of the feedback is correct, and 15% drives the loop in the wrong direction.

Table 3. Minimum SNR for Acquisition and Tracking

<table>
<thead>
<tr>
<th>Mod. Technique</th>
<th>$E_b/N_0$</th>
<th>BER</th>
<th>Preamble Length</th>
</tr>
</thead>
<tbody>
<tr>
<td>BPSK</td>
<td>0.0</td>
<td>1.0 $\times 10^{-1}$</td>
<td>32</td>
</tr>
<tr>
<td>QPSK</td>
<td>0.0</td>
<td>1.0 $\times 10^{-1}$</td>
<td>64</td>
</tr>
<tr>
<td>Offset-QPSK</td>
<td>1.5</td>
<td>1.2 $\times 10^{-1}$</td>
<td>64</td>
</tr>
<tr>
<td>8-PSK</td>
<td>4.0</td>
<td>1.3 $\times 10^{-1}$</td>
<td>96</td>
</tr>
<tr>
<td>16-QAM</td>
<td>2.5</td>
<td>1.6 $\times 10^{-1}$</td>
<td>112</td>
</tr>
<tr>
<td>16-PSK</td>
<td>5.0</td>
<td>2.2 $\times 10^{-1}$</td>
<td>144</td>
</tr>
</tbody>
</table>

The three time-domain tracking loop outputs are shown in Figures 9a, b, and c, for $E_b/N_0 = 0$ dB. Observe that there is little discernable noise on their outputs with the exception of the symbol timing, which has been amplified by a factor of 1024/96 relative to the actual design output level, because of limitations in the emulation run time.
Figure 9a. AGC Tracking Loop Output Estimate

Figure 9b. Carrier Phase Tracking Loop Output Estimate

Figure 9c. Symbol Timing Tracking Loop Output Estimate
CONCLUSIONS

The tracking loop architecture based on the JED approach does indeed perform very well for a broad class of modulation techniques. The analysis is quite accurate in setting the loop parameters as well as predicting their transient behavior. Both acquisition and tracking performance abruptly degrade at BERs on the order of $10^{-1}$ as a result of an effective reduction in loop SNR through errors in the data feedback. It was not possible to run emulations over a large number of bursts because a single burst required on the order of 10 minutes to process. Although the measurements presented in this paper were confined to QPSK signalling, the measured results were analogous for all of the two-dimensional modulation techniques mentioned in the introduction.

ACKNOWLEDGMENTS

The author would like to acknowledge J. Thomas who built the original JED-based demodulator in 1977 and made major contributions to the design and development of its counterpart, the digitally implemented modem (DIM), in 1982-84. The author would also like to recognize F. Faris and P. Chang who both made significant contributions to the emulation program.

REFERENCES


**Digital Tracking Loops for a Programmable Digital Model**

**John J. Poklemba**

**COMSAT Laboratories**
22300 Comsat Drive
Clarksburg, Maryland 20871

**National Aeronautics and Space Administration**
Lewis Research Center
Cleveland, Ohio 44135-3191

Prepared for the Milcom 1992 sponsored by the Institute of Electrical and Electronics Engineers, San Diego, California, October 11-14, 1992. Project Manager, James M. Budinger, Space Electronics Division, (216) 433-3496.

In this paper, an analysis and hardware emulation of the tracking loops for a very flexible programmable digital modem (PDM) will be presented. The modem is capable of being programmed for 2, 4, 8, 16-PSK, 16-QAM, MSK, and Offset-QPSK modulation schemes over a range of data rates from 2.34 to 300 Mbps with programmable spectral occupancy from 1.2 to 1.8 times the symbol rate; and these operational parameters are executable in burst or continuous mode. All of the critical processing in both the modulator and demodulator is done at baseband with very high-speed digital hardware and memory. Quadrature analog front-ends are used for translation between baseband and the IF center frequency. The modulator is based on a table lookup approach, where precomputed samples are stored in memory and docked out according to the incoming data pattern. The sample values are predistorted to counteract the effects of the other filtering functions in the link as well as any transmission impairments. The demodulator architecture has been adapted from a joint estimator-detector (JED) mathematical analysis. Its structure is applicable to most signalling formats that can be represented in a two-dimensional space. The JED realization uses interdependent, mutually aiding tracking loops with post-detection data feedback. To expedite and provide for more reliable synchronization, initial estimates for these loops are computed in a parallel acquisition processor. The cornerstone of the demodulator realization is the pre-averager received data filter which allows operation over a broad range of data rates without any hardware changes, and greatly simplifies the implementation complexity. The emulation results confirmed tracking loop operation over the entire range of operational parameters listed above, as well as the capability of achieving and maintaining synchronization at BERs in excess of $10^{-1}$. The emulation results also showed very close agreement with the tracking loop analysis, and validated the resolution apportionment of the various hardware elements in the tracking loops.

**ABSTRACT (Maximum 200 words)**

**SUBJECT TERMS**

**DISTRIBUTION/AVAILABILITY STATEMENT**
Unclassified - Unlimited
Subject Category 59

**DISTRIBUTION CODE**
A03

**NUMBER OF PAGES**
29

**PRICE CODE**
A03

**SECURITY CLASSIFICATION OF REPORT**
Unclassified

**SECURITY CLASSIFICATION OF THIS PAGE**
Unclassified

**SECURITY CLASSIFICATION OF ABSTRACT**
Unclassified

**LIMITATION OF ABSTRACT**
Standard Form 298 (Rev. 2-89)
Prescribed by ANSI Std. Z39-18
298-102