Low Power, Compact Charge Coupled Device Signal Processing System

P. W. Bosshart, D. D. Buss, W. L. Eversole,
C. R. Hewes, and D. J. Mayer

CONTRACT NAS1-14290
FEBRUARY 1980
Low Power, Compact Charge Coupled Device Signal Processing System

Texas Instruments Incorporated
Dallas, Texas

Prepared for
Langley Research Center
under Contract NAS1-14290

NASA Contractor Report 3215

National Aeronautics
and Space Administration
Scientific and Technical
Information Office
1980
# TABLE OF CONTENTS

<table>
<thead>
<tr>
<th>SECTION</th>
<th>INTRODUCTION</th>
<th>PAGE</th>
</tr>
</thead>
<tbody>
<tr>
<td>I</td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>A.</td>
<td>Frequency Domain Correlation</td>
<td>3</td>
</tr>
<tr>
<td>B.</td>
<td>Time Domain Correlation</td>
<td>8</td>
</tr>
<tr>
<td>II</td>
<td>32-POINT CCD CZT IC</td>
<td>14</td>
</tr>
<tr>
<td>A.</td>
<td>Chip Overview</td>
<td>14</td>
</tr>
<tr>
<td>B.</td>
<td>CCD Filter</td>
<td>22</td>
</tr>
<tr>
<td>C.</td>
<td>Operational Amplifiers</td>
<td>27</td>
</tr>
<tr>
<td>1.</td>
<td>Design</td>
<td>27</td>
</tr>
<tr>
<td>2.</td>
<td>Experimental Results</td>
<td>52</td>
</tr>
<tr>
<td>D.</td>
<td>Multiplying Digital-to-Analog Converters (MDACs)</td>
<td>59</td>
</tr>
<tr>
<td>1.</td>
<td>Design</td>
<td>59</td>
</tr>
<tr>
<td>2.</td>
<td>Experimental Results</td>
<td>71</td>
</tr>
<tr>
<td>E.</td>
<td>Read-Only Memory (ROM) and ROM Address</td>
<td>71</td>
</tr>
<tr>
<td>1.</td>
<td>Read-Only Memory</td>
<td>71</td>
</tr>
<tr>
<td>2.</td>
<td>Up/Down Ring Counter (ROM Address)</td>
<td>82</td>
</tr>
<tr>
<td>3.</td>
<td>Synchronization</td>
<td>87</td>
</tr>
<tr>
<td>F.</td>
<td>Timing Circuitry</td>
<td>98</td>
</tr>
<tr>
<td>1.</td>
<td>Clock Generators</td>
<td>98</td>
</tr>
<tr>
<td>2.</td>
<td>Experimental Results</td>
<td>109</td>
</tr>
<tr>
<td>G.</td>
<td>Experimental Results</td>
<td>114</td>
</tr>
<tr>
<td>H.</td>
<td>Applications</td>
<td>121</td>
</tr>
<tr>
<td>III</td>
<td>64-POINT CCD ANALOG/ANALOG CORRELATOR</td>
<td>125</td>
</tr>
<tr>
<td>A.</td>
<td>Design</td>
<td>125</td>
</tr>
<tr>
<td>B.</td>
<td>Performance Limitations in Design</td>
<td>129</td>
</tr>
<tr>
<td>1.</td>
<td>Multiplier Nonlinearity</td>
<td>129</td>
</tr>
<tr>
<td>2.</td>
<td>Offsets</td>
<td>132</td>
</tr>
<tr>
<td>3.</td>
<td>Input Stage Offsets and Nonlinearities</td>
<td>133</td>
</tr>
<tr>
<td>C.</td>
<td>Measured Performance</td>
<td>133</td>
</tr>
<tr>
<td>1.</td>
<td>CCD Signal Capacity</td>
<td>133</td>
</tr>
<tr>
<td>2.</td>
<td>Feedback Linearized Input Stage</td>
<td>134</td>
</tr>
</tbody>
</table>
### TABLE OF CONTENTS
(Continued)

<table>
<thead>
<tr>
<th>SECTION</th>
<th>PAGE</th>
</tr>
</thead>
<tbody>
<tr>
<td>3. Multiplier Linearity.</td>
<td>136</td>
</tr>
<tr>
<td>4. Offsets</td>
<td>140</td>
</tr>
<tr>
<td>5. Multiplier Scale Factor Matching.</td>
<td>148</td>
</tr>
<tr>
<td>6. Evaluation.</td>
<td>148</td>
</tr>
<tr>
<td>D. Directions for Future Designs.</td>
<td>150</td>
</tr>
<tr>
<td>1. System Organization</td>
<td>150</td>
</tr>
<tr>
<td>2. CCD Design.</td>
<td>152</td>
</tr>
<tr>
<td>3. Input Stage Design.</td>
<td>152</td>
</tr>
<tr>
<td>4. Buffer Design</td>
<td>152</td>
</tr>
<tr>
<td>5. Multipliers</td>
<td>153</td>
</tr>
<tr>
<td>E. Applications and Summary</td>
<td>153</td>
</tr>
<tr>
<td>1. Applications.</td>
<td>153</td>
</tr>
<tr>
<td>2. Summary</td>
<td>153</td>
</tr>
<tr>
<td>IV</td>
<td>CCD BINARY/ANALOG CORRELATOR.</td>
</tr>
<tr>
<td>A. Introduction</td>
<td>155</td>
</tr>
<tr>
<td>B. Binary/Analog Programmable CCD Correlator Technology</td>
<td>159</td>
</tr>
<tr>
<td>C. Application of Binary/Analog Correlator to Pattern Classification</td>
<td>171</td>
</tr>
<tr>
<td>D. Preliminary Design of Matrix Multiplier IC</td>
<td>174</td>
</tr>
<tr>
<td>E. Four-Sensor Demonstration Unit</td>
<td>177</td>
</tr>
<tr>
<td>V</td>
<td>CONCLUSIONS</td>
</tr>
<tr>
<td>REFERENCES</td>
<td>184</td>
</tr>
</tbody>
</table>

### LIST OF ILLUSTRATIONS

<table>
<thead>
<tr>
<th>FIGURE</th>
<th>PAGE</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>Block Diagram of a Transversal Filter Showing Delay Stages D and Weighting Coefficients h_n, n = 0, N - 1.</td>
</tr>
<tr>
<td>2</td>
<td>Schematic of the CZT Algorithm.</td>
</tr>
<tr>
<td>3</td>
<td>Interpretation of the CZT in Terms of Chirp Input Waveforms in Chirp Filters.</td>
</tr>
<tr>
<td>4</td>
<td>Block Diagram of a CCD Programmable Correlator.</td>
</tr>
<tr>
<td>5</td>
<td>Block Diagram of the Analog/Analog Time Domain Correlator</td>
</tr>
<tr>
<td>6</td>
<td>Circuit Schematic of the Four-Quadrant Multiplier Used in the Analog/Analog Correlator of Figure 5.</td>
</tr>
<tr>
<td>FIGURE</td>
<td>LIST OF ILLUSTRATIONS</td>
</tr>
<tr>
<td>--------</td>
<td>-----------------------</td>
</tr>
<tr>
<td>7</td>
<td>Block Diagram of the Binary/Analog Correlator</td>
</tr>
<tr>
<td>8</td>
<td>Block Diagram of 32-Point CZT IC.</td>
</tr>
<tr>
<td>9</td>
<td>Photomicrograph of 32-Point CCD CZT IC.</td>
</tr>
<tr>
<td>10</td>
<td>Silicon Gate CCD Coplanar Electrode Structure</td>
</tr>
<tr>
<td>11</td>
<td>Block Diagram of Internal ROM, Up/Down Counter, and Dual MDACs.</td>
</tr>
<tr>
<td>12</td>
<td>Block Diagram of Second Dual MDAC</td>
</tr>
<tr>
<td>13</td>
<td>Block Diagram of CCD Input Amplifiers, CCDs, DCI, and Output Buffers.</td>
</tr>
<tr>
<td>14</td>
<td>Photomicrograph of CCD Transversal Filters.</td>
</tr>
<tr>
<td>15</td>
<td>Schematic of CCD Filters.</td>
</tr>
<tr>
<td>16</td>
<td>CCD Timing.</td>
</tr>
<tr>
<td>17</td>
<td>CCD Capacitances.</td>
</tr>
<tr>
<td>18</td>
<td>A Possible Switching Diagram for Figure 17.</td>
</tr>
<tr>
<td>19</td>
<td>Symmetrized and Compensated Two-Phase CCD Layout.</td>
</tr>
<tr>
<td>20</td>
<td>DCI Amplifier Compensated for Gain of 30.</td>
</tr>
<tr>
<td>21</td>
<td>Photomicrograph of Operation Amplifier Used on CZT IC.</td>
</tr>
<tr>
<td>22</td>
<td>Block Diagram of Single-Loop Amplifier.</td>
</tr>
<tr>
<td>23</td>
<td>Differential Amplifier.</td>
</tr>
<tr>
<td>24</td>
<td>Differential Amplifier Theoretical Open Loop Characteristics</td>
</tr>
<tr>
<td>25</td>
<td>Input Amplifier Closed Loop Frequency Response.</td>
</tr>
<tr>
<td>26</td>
<td>DCI Amplifier</td>
</tr>
<tr>
<td>27</td>
<td>DCI Timing.</td>
</tr>
<tr>
<td>28</td>
<td>CCD Clocks.</td>
</tr>
<tr>
<td>29</td>
<td>Model for DCI</td>
</tr>
<tr>
<td>30</td>
<td>Capacitance Model for DCI Input</td>
</tr>
<tr>
<td>31</td>
<td>DCI Diff-Amp Open Loop Performance.</td>
</tr>
<tr>
<td>32</td>
<td>Closed Loop Frequency Response of DCI Amplifier</td>
</tr>
<tr>
<td>33</td>
<td>Differential Amplifiers Test.</td>
</tr>
<tr>
<td>34</td>
<td>Single-Ended Transfer Characteristics of Input Differential Amplifier.</td>
</tr>
</tbody>
</table>
## LIST OF ILLUSTRATIONS

(Continued)

<table>
<thead>
<tr>
<th>FIGURE</th>
<th>Description</th>
<th>PAGE</th>
</tr>
</thead>
<tbody>
<tr>
<td>35</td>
<td>Response of Input Differential Amplifier to Common Mode Input.</td>
<td>57</td>
</tr>
<tr>
<td>36</td>
<td>Differential Mode Characteristics of Input Amplifier.</td>
<td>58</td>
</tr>
<tr>
<td>37</td>
<td>Single-Ended Response of Differential Amplifier to Ramp Input.</td>
<td>60</td>
</tr>
<tr>
<td>38</td>
<td>Photograph Showing the Gain and Linearity of Input Amplifiers.</td>
<td>61</td>
</tr>
<tr>
<td>39</td>
<td>DCI Amplifier Open Loop Characteristics.</td>
<td>62</td>
</tr>
<tr>
<td>40</td>
<td>Photograph of Dual MDACs Used on CZT IC.</td>
<td>63</td>
</tr>
<tr>
<td>41</td>
<td>MDAC.</td>
<td>64</td>
</tr>
<tr>
<td>42</td>
<td>Buffered Input Sample-and-Hold.</td>
<td>67</td>
</tr>
<tr>
<td>43</td>
<td>ROM Sense and MDAC Drive Circuitry.</td>
<td>69</td>
</tr>
<tr>
<td>44</td>
<td>Typical MDAC Operation (Multiplication by 0.5).</td>
<td>70</td>
</tr>
<tr>
<td>45</td>
<td>Transfer Curve for Input Sample-and-Hold Circuit of MDAC.</td>
<td>72</td>
</tr>
<tr>
<td>46</td>
<td>Block Diagram of MDAC Test.</td>
<td>73</td>
</tr>
<tr>
<td>47</td>
<td>Response of MDAC to dc Input and Sine Chirp of Internal ROM ($V^+ - V^-$) &lt; 0</td>
<td>74</td>
</tr>
<tr>
<td>48</td>
<td>Relative Accuracy of MDACs.</td>
<td>76</td>
</tr>
<tr>
<td>49</td>
<td>8-Bit by 17-Word ROM (Sine Chirp Coded).</td>
<td>78</td>
</tr>
<tr>
<td>50</td>
<td>Photograph of ROM.</td>
<td>80</td>
</tr>
<tr>
<td>51</td>
<td>Analog Representation of Sine Chirp Coded into ROM.</td>
<td>81</td>
</tr>
<tr>
<td>52</td>
<td>Three-Stage Up/Down Ring Counter.</td>
<td>83</td>
</tr>
<tr>
<td>53</td>
<td>Typical Shift Register Operation (SRO).</td>
<td>84</td>
</tr>
<tr>
<td>54</td>
<td>Up/Down Counter Waveforms - Normal Operation.</td>
<td>88</td>
</tr>
<tr>
<td>55</td>
<td>Photograph of Up/Down Counter.</td>
<td>89</td>
</tr>
<tr>
<td>56</td>
<td>Synchronization of Two CCD CZT ICs.</td>
<td>91</td>
</tr>
<tr>
<td>57</td>
<td>External Sync Pulse Generator.</td>
<td>92</td>
</tr>
<tr>
<td>58</td>
<td>External Sync Pulse Generator Timing - Normal Operation.</td>
<td>93</td>
</tr>
<tr>
<td>59</td>
<td>GEN 30 (Internal Sync Pulse).</td>
<td>95</td>
</tr>
<tr>
<td>60</td>
<td>GEN 30 Timing.</td>
<td>96</td>
</tr>
<tr>
<td>61</td>
<td>Synchronization Outputs of Two CZT ICs Operating Together</td>
<td>97</td>
</tr>
<tr>
<td>62</td>
<td>Block Diagram of Clock Generators.</td>
<td>99</td>
</tr>
</tbody>
</table>
# List of Illustrations (Continued)

<table>
<thead>
<tr>
<th>FIGURE</th>
<th>Description</th>
<th>PAGE</th>
</tr>
</thead>
<tbody>
<tr>
<td>63</td>
<td>G1 Clock Pulse Generator.</td>
<td>100</td>
</tr>
<tr>
<td>64</td>
<td>CCD $\phi_2^+$ Clock Pulse Generator.</td>
<td>102</td>
</tr>
<tr>
<td>65</td>
<td>G3 Inverter</td>
<td>103</td>
</tr>
<tr>
<td>66</td>
<td>G4 Inverter</td>
<td>104</td>
</tr>
<tr>
<td>67</td>
<td>G5 Generator</td>
<td>105</td>
</tr>
<tr>
<td>68</td>
<td>G6 Generator</td>
<td>106</td>
</tr>
<tr>
<td>69</td>
<td>Connections of the Clock Generators Showing Capacitive Load of Each Driver</td>
<td>107</td>
</tr>
<tr>
<td>70</td>
<td>System Timing</td>
<td>110</td>
</tr>
<tr>
<td>71</td>
<td>Clock Circuit Performance</td>
<td>111</td>
</tr>
<tr>
<td>72</td>
<td>Relationship of $\phi_B$ and $\phi_C$</td>
<td>112</td>
</tr>
<tr>
<td>73</td>
<td>Photograph of CCD $\phi_2$ Clock Driver</td>
<td>113</td>
</tr>
<tr>
<td>74</td>
<td>A System for Obtaining Power Density Spectrum of a Real Input Using One CZT IC (50% Input/Output Duty Cycle)</td>
<td>115</td>
</tr>
<tr>
<td>75</td>
<td>Operation of CCD CZT IC with 10 kHz Clocks.</td>
<td>116</td>
</tr>
<tr>
<td>76</td>
<td>Operation of CCD CZT IC with 1 MHz Clocks.</td>
<td>117</td>
</tr>
<tr>
<td>77</td>
<td>Power Density Spectra for Three Sine Waves Obtained Using the 32-Point CCD CZT at a 10 kHz Sample Rate.</td>
<td>118</td>
</tr>
<tr>
<td>78</td>
<td>Power Density Spectra for Three Sine Waves Obtained Using the 32-Point CCD CZT at 1 MHz Sample Rate</td>
<td>119</td>
</tr>
<tr>
<td>79</td>
<td>Response of CCD CZT IC to Square-Wave Inputs (10 kHz Clock Frequency)</td>
<td>120</td>
</tr>
<tr>
<td>80</td>
<td>Realization of Complex I/O CZT with One Chip (50% Duty Cycle)</td>
<td>122</td>
</tr>
<tr>
<td>81</td>
<td>Realization of Complex I/O CZT Using Two CZT ICs (100% Duty Cycle)</td>
<td>123</td>
</tr>
<tr>
<td>82</td>
<td>Realization of Correlator Using Two CZT ICs (Real 25% Duty Cycle Input, Real 50% Duty Cycle Output)</td>
<td>124</td>
</tr>
<tr>
<td>83</td>
<td>Analog Correlator Block Diagram</td>
<td>126</td>
</tr>
<tr>
<td>84</td>
<td>Floating Gate Reset Technique</td>
<td>127</td>
</tr>
<tr>
<td>85</td>
<td>Analog Multiplier</td>
<td>128</td>
</tr>
<tr>
<td>86</td>
<td>CCD Input Stage</td>
<td>130</td>
</tr>
<tr>
<td>87(a)</td>
<td>CCD Input Stage Feedback-Linearized Output</td>
<td>135</td>
</tr>
<tr>
<td>87(b)</td>
<td>CCD Serial Output vs Input</td>
<td>135</td>
</tr>
<tr>
<td>FIGURE</td>
<td>DESCRIPTION</td>
<td>PAGE</td>
</tr>
<tr>
<td>--------</td>
<td>---------------------------------------------------------------------------------------------</td>
<td>--------</td>
</tr>
<tr>
<td>88(a)</td>
<td>CCD1 Serial Output Linearity $0.4 \ V_{pp}$ Signal</td>
<td>137</td>
</tr>
<tr>
<td>88(b)</td>
<td>CCD1 Serial Output Linearity $0.5 \ V_{pp}$ Signal</td>
<td>137</td>
</tr>
<tr>
<td>89(a)</td>
<td>CCD2 Serial Output Linearity $0.4 \ V_{pp}$ Signal</td>
<td>138</td>
</tr>
<tr>
<td>89(b)</td>
<td>CCD2 Serial Output Linearity $0.55 \ V_{pp}$ Signal</td>
<td>138</td>
</tr>
<tr>
<td>90</td>
<td>Multiplier Linearity Test Circuit</td>
<td>139</td>
</tr>
<tr>
<td>91(a)</td>
<td>Multiplier Output $V_{in} = \pm 0.3 \ V_{pk}$</td>
<td>141</td>
</tr>
<tr>
<td>91(b)</td>
<td>Multiplier Output $V_{in} = \pm 1 \ V_{pk}$</td>
<td>141</td>
</tr>
<tr>
<td>92(a)</td>
<td>Convolution of Two 300 mV Impulses (CCD2 Offsets)</td>
<td>142</td>
</tr>
<tr>
<td>92(b)</td>
<td>Convolution of Impulse with CCD Offsets (CCD1 Offsets)</td>
<td>142</td>
</tr>
<tr>
<td>93(a)</td>
<td>CCD1 Frequency Domain Offset Measurements (Low Pass Output to 2.5 kHz)</td>
<td>145</td>
</tr>
<tr>
<td>93(b)</td>
<td>CCD1 Low Pass and Offset Error Outputs to 10 kHz (Offset Error Output)</td>
<td>145</td>
</tr>
<tr>
<td>94(a)</td>
<td>CCD1 Low Pass and Offset Error Outputs to 2.5 kHz</td>
<td>146</td>
</tr>
<tr>
<td>94(b)</td>
<td>CCD2 Low Pass and Offset Error Outputs to 2.5 kHz</td>
<td>146</td>
</tr>
<tr>
<td>95(a)</td>
<td>CCD2 Low Pass Output to 25 kHz.</td>
<td>147</td>
</tr>
<tr>
<td>95(b)</td>
<td>CCD2 Offset Error Output to 25 kHz</td>
<td>147</td>
</tr>
<tr>
<td>96(a)</td>
<td>Multiplier Scale Factor Accuracy; Convolution of CCD1 (Impulse of Four Samples) with CCD2 (Full Scale DC)</td>
<td>149</td>
</tr>
<tr>
<td>96(b)</td>
<td>Multiplier Scale Factor Accuracy; Multiplier Scale Factor Error.</td>
<td>149</td>
</tr>
<tr>
<td>97(a)</td>
<td>64-Point Convolution of Two Low-Frequency Triangle Waves</td>
<td>151</td>
</tr>
<tr>
<td>97(b)</td>
<td>64-Point Convolution of Two Low-Frequency Square Waves</td>
<td>151</td>
</tr>
<tr>
<td>98</td>
<td>Block Diagram of the Binary/Analog Correlator</td>
<td>160</td>
</tr>
<tr>
<td>99</td>
<td>Schematic of the Operation of the Programmable CCD Correlator</td>
<td>162</td>
</tr>
<tr>
<td>100</td>
<td>Schematic of a Programmable CCD Correlator with M-Bit Accuracy on the Weighting Coefficients.</td>
<td>165</td>
</tr>
<tr>
<td>101</td>
<td>Shift Register and Clock Generator for CCD Correlator</td>
<td>166</td>
</tr>
</tbody>
</table>
LIST OF ILLUSTRATIONS
(Continued)

<table>
<thead>
<tr>
<th>FIGURE</th>
<th>PAGE</th>
</tr>
</thead>
<tbody>
<tr>
<td>102</td>
<td>Photograph of a 32-Stage Binary/Analog Correlator with Four-Bit Weighting Coefficients</td>
</tr>
<tr>
<td>103</td>
<td>Experimental Impulse Response of 32-Stage Binary/Analog Convolver</td>
</tr>
<tr>
<td>104</td>
<td>Response of the Test Device Operated as a Bandpass Filter</td>
</tr>
<tr>
<td>105</td>
<td>Frequency Response of the Binary/Analog Correlator Programmed to Give a Bandpass Filter with Passband at 125 kHz</td>
</tr>
<tr>
<td>106</td>
<td>Matched Filter Operation of the Test Device Using a 13-Bit Barker Code</td>
</tr>
<tr>
<td>107</td>
<td>Block Diagram for Computing $Y^TY$</td>
</tr>
<tr>
<td>108</td>
<td>Block Diagram of Binary/Analog Correlator IC</td>
</tr>
<tr>
<td>109</td>
<td>Preliminary Design of Four-Column by 16-Stage by Eight-Bit CCD Matrix Multiplier IC</td>
</tr>
<tr>
<td>110</td>
<td>System Block Diagram for Pattern Classifier</td>
</tr>
<tr>
<td>111</td>
<td>Block Diagram of Pattern Classifier Using 32 x 2-Bit Binary/Analog Correlator</td>
</tr>
<tr>
<td>112</td>
<td>Illustration of 8-Bit Accuracy of the Binary/Analog Correlator</td>
</tr>
</tbody>
</table>

LIST OF TABLES

<table>
<thead>
<tr>
<th>TABLE</th>
<th>PAGE</th>
</tr>
</thead>
<tbody>
<tr>
<td>I</td>
<td>FET Geometries and Supplies</td>
</tr>
<tr>
<td>II</td>
<td>MDAC Error</td>
</tr>
<tr>
<td>III</td>
<td>MDAC Characteristics</td>
</tr>
</tbody>
</table>
SECTION I
INTRODUCTION

The mathematical operation of convolution is a very important function in many of the pattern recognition or pattern classification algorithms that are used to synthesize useful information about the earth from the electrical outputs of satellite-borne environmental sensors. Since charge coupled devices (CCDs) are so well suited to performing convolution, it is not surprising that substantial interest has arisen within NASA in developing CCDs for preprocessing environmental sensor data preparatory to its transmittal to the ground.

The one complication that limits the immediate usefulness of CCDs is the fact that the satellite preprocessors are learning machines and must adapt to the terrain they are observing. In terms of CCD hardware, this means that the convolutions must be programmable; i.e., the impulse responses of the filters must be electronically variable.

The canonical CCD transversal filter, illustrated in Figure 1, has fixed weighting coefficients \( h_n \) \( n = 0, N - 1 \) that are coded into the photomask with which the device is made. The impulse response is determined by the \( h_n \) and is not electrically alterable. Various approaches have been tried to construct filters with electronically alterable \( h_n \),\(^2,3\) but all such attempts have failed to maintain the electronic simplicity of the fixed weighting coefficient transversal filter.

This program addresses two different techniques for performing programmable correlation:

- Frequency domain correlation using the CCD chirp z-transform (CZT).
- Time domain correlation using direct analog/analog correlation and binary/analog correlation.

Two separate ICs have been developed under the program:

- 32-point CCD CZT IC.
Figure 1.- Block Diagram of a Transversal Filter Showing Delay Stages $D$ and Weighting Coefficients $h_n$, $n=0, N-1$. 
32-point CCD analog/analog correlator.

These two ICs are discussed in Sections II and III, respectively. In addition, a prototype IC capable of correlating a digital reference word with an analog input signal was evaluated under this program. The evaluation of the binary/analog correlator is discussed in Section IV.

A. Frequency Domain Correlation

This approach to programmable correlation is based on the fact that correlation in the time domain is equivalent to multiplication in the frequency domain. Consequently, the frequency domain correlator is based on taking the discrete Fourier transform (DFT) of the input signal.

The discrete Fourier transform operation can be performed using the chirp z-transform algorithm. The CZT gets its name from the fact that it can be implemented in an analog manner by (1) premultiplying the time signal with a chirp (linear FM) waveform, (2) filtering in a chirp convolution filter, and (3) postmultiplying with a chirp waveform.

When implemented digitally, the CZT has no advantages over the conventional fast Fourier transform algorithm. However, the algorithm lends itself naturally to implementation with CTD transfersal filters.

Starting with the definition of the DFT

$$F_k = \sum_{n=0}^{N-1} f_n e^{-i2\pi nk/N} \quad (1)$$

and using the substitution

$$2nk = n^2 + k^2 - (n - k)^2 \quad , \quad (2)$$

the following equation results.

$$F_k = e^{-i\pi k^2/N} \left[ \sum_{n=0}^{N-1} \left( f_n e^{-i\pi n^2/N} \right) e^{i\pi (k-n)^2/N} \right] \quad . \quad (3)$$
This equation has been factored to emphasize the three operations that make up the CZT algorithm. It is illustrated in Figure 2.

To implement the conventional $N$-point CZT, the CCD filters are chirp filters of length $2N-1$ that chirp from $-f_c$ to $+f_c$, and the premultiply waveform has a time duration $N/f_c$ and chirps from zero to $-f_c$. A physical interpretation in terms of correlation of the input chirp with the filter is given in Figure 3. When the input signal has zero frequency, the product with the premultiply chirp results in an input waveform to the filter that chirps from 0 to $-f_c$. The samples corresponding to frequencies near $f = 0$ are clocked into the filter first, and those near $f = -f_c$ are clocked in last. This sequence of samples results in a correlation peak at $t = t_0$, when the product waveform has been clocked into the first half of the filter. When the input frequency is $f_1 \neq 0$, the product with the premultiply chirp results in an input to the filter that chirps from $f_1$ to $-f_c + f_1$. The input waveform $(v \times \text{chirp})$ in Figure 3 corresponds to an input signal at a frequency $f_1$ at time $t = t_0$. This waveform is shifted to the right as $t$ increases, resulting in a correlation peak at $t_1$. The shift in time relative to the dc correlation peak is

$$t_1 - t_0 = \frac{N}{f_c} \frac{f_1}{2}.$$  \hspace{1cm} (4)

In this way, the time axis of the output is calibrated in frequency.

The CZT is very convenient for performing programmable correlation. Figure 4(a) shows that correlation of $x(t)$ with $h(t)$ can be accomplished by (1) performing the discrete Fourier transform (DFT) of $x(t)$ to obtain $X(f)$; (2) multiplying $X(f)$ by $H(f)$, where $H(f)$ is the DFT of $h(t)$; and (3) performing the inverse DFT on $Y(f) = X(f) \cdot H(f)$.
Figure 2.- Schematic of the CZT Algorithm.
Figure 3.—Interpretation of the CZT in Terms of Chirp Input Waveforms in Chirp Filters.
Figure 4.—Block Diagram of a CCD Programmable Correlator.
Figure 4(b) is obtained from 4(a) by replacing the DFT and inverse DFT blocks with their CCD CZT equivalents, and 4(c) is obtained by simplifying 4(b). Figure 4(c), therefore, represents an economical and useful way to perform programmable correlation in pattern classification machines, and this program had as one of its goals the development of CCD CZT ICs to perform spectral analysis for a broad range of NASA requirements including programmable correlation.

The 32-point CCD CZT IC developed under this program is discussed in Section II.

B. Time Domain Correlation

An alternate approach to achieving programmable correlation is the direct time domain correlator. The analog/analog correlator is illustrated in Figure 5. It is similar in operation to the transversal filter illustrated in Figure 1, except that the weighting coefficients are electronically programmable by clocking an analog reference signal into the device. It operates by first loading CCD1 with a reference signal and then clocking a second signal through CCD2. Since the charge is stored dynamically in CCD1, it decays due to thermal leakage and must be refreshed after 10 ms to 100 ms.

Each stage of the CCD is tapped with a diffusion whose potential takes on a value approximately proportional to the signal charge, and four-quadrant multiplication is accomplished using the circuit of Figure 6. Transistors Q₃ and Q₄ act as voltage-controlled resistors, and the balanced configuration overcomes the inherent nonlinearity of MOS transistors used as resistors. The output $V_2$ from CCD2 is applied to the gate of $Q_4$, and the gate of $Q_3$ is biased to $V_{\text{ref}}$, which is equal to the value of $V_2$ when CCD₂ has the zero level of charge (half-full well). The output from CCD1 is buffered by a source-follower whose output $V_a$ is applied to the drain of $Q_3$ and $Q_4$. The sources of $Q_3$ and $Q_4$ are connected to virtual ground current buses $\Sigma^-$ and $\Sigma^+$ whose currents are differenced in an off-chip differential amplifier.
Figure 5. - Block Diagram of the Analog/Analog Time Domain Correlator.
Figure 6.- Circuit Schematic of the Four-Quadrant Multiplier Used in the Analog/Analog Correlator of Figure 5.
The currents flowing in $Q_3$ and $Q_4$ are

$$I_3 = B (V_{\text{ref}} - V_t - \frac{1}{2}V_a) V_a,$$  \hspace{1cm} (5)

and

$$I_4 = B (V_2 - V_t - \frac{1}{2}V_a) V_a.$$  \hspace{1cm} (6)

The difference current

$$I_4 - I_3 = B (V_2 - V_{\text{ref}}) V_a$$  \hspace{1cm} (7)

contains no nonlinear terms.

The 64-point analog/analog time domain correlator designed, fabricated, and tested under this contract is discussed in detail in Section III.

An alternate approach to the analog/analog correlator is the binary/analog correlator. The concept of the binary/analog correlator is illustrated in Figure 7. The weighting coefficients ($h_0, h_1, \ldots, h_{N-1}$ of Figure 1) are made electronically programmable by decomposing each one into a binary representation. $h_n$ is represented with $M$-bit accuracy by

$$h_n = \sum_{k=0}^{M-1} h_n^k 2^{-k}.$$  \hspace{1cm} (8)

For $M$-bit accuracy, $M$ parallel CCD binary/analog correlators are put on a chip as shown in Figure 7. The most significant bit $h_n^0$ of each coefficient is loaded into the static shift register (coefficient store) shown as elongated rectangles in the filter at the top of the figure. The second most significant bit $h_n^1$ of each coefficient is loaded into the second coefficient store, and the least significant bit $h_n^{M-1}$ is loaded into the coefficient store shown at the bottom of the figure. The analog input signal to be filtered is applied without
Figure 7.- Block Diagram of the Binary/Analog Correlator.
attenuation to the top filter (most significant bit). The input is attenuated by a factor of two at the input of the second filter (second most significant bit), and it is attenuated by a factor of $2^{M-1}$ at the input to the bottom filter (least significant bit). This attenuation is performed using capacitive ratio techniques similar to those employed in MOS MDAC technology. The coefficients stored in the static shift registers then control the weighting of the transversal filters, and when the outputs of each filter are summed together as shown, the result is

$$H(z) = V_{out}(z) = \frac{V_{out}(z)}{V_{in}(z)} = \sum_{n=0}^{N-1} h_n^0 z^{-n} + 2^{-1} \sum_{n=0}^{N-1} h_n^1 z^{-n} + \ldots + 2^{-(M-1)} \sum_{n=0}^{N-1} h_n^{M-1} z^{-n} \quad (9)$$

$$= \sum_{n=0}^{N-1} \left[ \sum_{k=0}^{M-1} \left( h_n^k 2^{-k} \right) \right] z^{-n} \quad (10)$$

$$= \sum_{n=0}^{N-1} h_n^k z^{-n} \quad . \quad (11)$$

The binary/analog correlator is discussed in detail in Section IV.
This section describes in detail the design and operation of the frequency domain correlator. The design of each component of the CZT IC is discussed, and applications and experimental results are described.

A. Chip Overview

The frequency domain correlator contains all the hardware needed to implement a 32-point CZT, as discussed in the introduction and shown schematically in Figure 4. The IC, shown in block diagram form in Figure 8, is composed of the following:

- ROM
- Up/down counter
- Four MDACs
- Four operational amplifiers
- Eight output buffers
- Clock generators
- Four 64-stage CCD filters.

A photomicrograph of the IC is shown in Figure 9.

The IC measures 6.04 x 5.69 mm² (238 x 224 mils²) and was fabricated using TI's two-phase silicon gate coplanar electrode structure, shown in Figure 10. To provide storage and directionality, n-type ion implants (wells) are diffused under the two clock electrodes. As seen in Figure 10, when both the ϕ₁ and ϕ₂ clocks are off, charge (signal) resides in the wells (under the ϕ₁ electrode for this example). When ϕ₂ is turned on, a potential energy gradient is formed, forcing the charge to move under the ϕ₂ electrode. Next, both clocks are turned off, and the charge remains in the ϕ₂ well. When ϕ₁ is turned on, the charge moves under the ϕ₁ electrode and has moved one bit down the CCD structure.
Figure 8.—Block Diagram of 32-Point CZT IC.
Figure 9: Photomicrograph of 32-Point CCD CZT IC.
Figure 10.-Silicon Gate CCD Coplanar Electrode Structure.
Because of the complexity of the analog circuits surrounding the CCD filters, much circuit modeling and MOSFET modeling was needed. To obtain accurate MOSFET models for use in the circuit modeling, data were taken on MOSFETs from previously processed ICs. These data were tabulated and fitted to the Shichman-Hodges MOS equations using the computer, and parameter values were extracted for circuit modeling. This procedure allowed the design of the peripheral circuitry to be accomplished with a high degree of confidence.

In an effort to provide the greatest possible flexibility, the IC was partitioned into three main signal processing sections. The first two sections are both dual MDACs. The MDACs and their associated circuitry are shown in Figures 11 and 12. The ROM is common to both sets of MDACs and has 16 output lines, 8 for the sin chirp words and 8 for the cosine chirp words. The sin chirp is available to one MDAC in each figure, and the cos chirp is available to the other two MDACs. The eight-bit external word is available to all four MDACs and may be individually selected for each set of dual MDACs. Each MDAC consists of an input sample-and-hold buffer, ROM sense and MDAC drive circuitry, the MDAC capacitor array, and an output buffer. The third section of the partition, shown in Figure 13, consists of two CCD input amplifiers (A₁ and A₂), CCD input blanking circuitry, four 64-stage CCD transversal filters, two differential current integrators (CCD output amplifiers), and four output buffers. The CCD input amplifiers make the inverse of the signal available to the CCDs so that both CCD outputs to the DCI may be summed the same way. The CCD input blanking circuitry uses the T₁ and T₂ pulses from the up/down counter to load "zeros" into the CCD for appropriate periods. The T₂ pulse is always used. The T₁ pulse is used only to implement the noncircular correlation function and must be stitch-bonded on-chip if it is required.

Each of the three partitioned sections utilizes output buffering circuits so that the sections can be connected in several configurations using one or more ICs. Also, each section has the circuitry required to process real and
Figure 11.- Block Diagram of Internal ROM, Up/Down Counter, and Dual MDACs.
Figure 12.-Block Diagram of Second Dual MDAC.
Figure 13.- Block Diagram of CCD Input Amplifiers, CCDs, DCI, and Output Buffers.
imaginary signal components. In addition to the three partitioned sections described above, the chip contains clock-generating circuitry. The clock generators derive all the internal timing pulses from the two-phase clocks input to the IC.

B. CCD Filter

The four filters needed to implement the CZT algorithm are chirp convolution filters with 2N-1 stages. A photograph of the four CCD filters is shown in Figure 14. The weighting coefficients for the sine and cosine chirp filters are

\[
h_k^{\cos} = \cos \frac{\pi k^2}{32} \quad k = 0, 31
\]

\[
h_k^{\sin} = \sin \frac{\pi k^2}{32} \quad k = 0, 31
\]

The coefficients were realized using the split-electrode technique, and because the CZT IC was designed to perform the true CZT, the filter tapweights were repeated for the remaining N-1 stages of the filters.

The CCD filter structure is illustrated in Figure 15. Each filter is surface channel and uses two-phase clocking. Each stage is 25.4 µm (1 mil) long and consists of a 12.7 µm (0.5 mil) long first poly electrode and a second poly electrode that is effectively 12.7 µm (0.5 mil) long. The barriers are 5.1 µm (0.2 mil) each, and the wells are 7.6 µm (0.3 mil) each. The channel width is 155 µm (6.1 mils). The split electrodes are the first poly electrodes. The split electrode gaps are 5 µm (0.2 mil), and the channel stop beneath each gap is 12.7 µm (0.5 mil) wide.

The channel stop beneath each gap serves two purposes. The first is to prevent the electrons of the signal charge packets from being trapped in these
Figure 15.—Schematic of CCD Filters.
areas and causing serious degradation of the charge transfer efficiency. The second reason for the channel stop is that the position of the weighting coefficients is defined by the same photomask that defines the CCD channel. Thus, the weighting coefficient accuracy is determined by the accuracy with which the photomask is made and is not dependent on alignment of different photomask levels in the device fabrication process. With this approach approximately 0.16% accuracy in the weighting coefficients is possible with a 155 μm (6.1 mil) wide channel in the CCD. This gives an area for the first well area of $7.6 \times (1.55 - 12.7) = 1084 \, \mu m^2 \Rightarrow [0.3 \times (6.1 - 0.5) = 1.68 \, mil^2]$.

The CCD clocking sequence is illustrated in Figure 16 relative to the two-phase input clocks $\phi_1$ and $\phi_2$. The analog input to the CCD is capacitively coupled to the input diode. The flat-zero level on the diode is set to $V_{FZ}$ when $\phi_1'$ comes up. After $\phi_1'$ falls, the CCD input signal is allowed to change and the IP/G signal pulls up, dumping signal charge into the receiving well, which is controlled by a dc bias. The total receiving well area is $710 \, \mu m^2$ (1.1 mil$^2$), making the capacity of this well 65% as large as the capacity of the following wells. This prevents saturation of the filter. The signal is sampled when IP/G falls. Next, the CCD first $\phi_1$ clock rises and falls, dumping the sampled signal charge under the first split electrode. The CCD first $\phi_1$ electrode rises to a level less than that of the CCD $\phi_2^\pm$ clock lines to make sure that all charge is transferred. At the same time, the CCD $\phi_1$ electrodes fall, transferring the charge packets under the CCD $\phi_1$ electrodes to the remaining split electrodes. These split electrodes must be floating when this occurs in order to sense the charge packets.

The capacitive coupling of the CCD $\phi_1$ line to the CCD $\phi_2^\pm$ lines is compensated for by another clock, $\overline{\text{CCD}}\phi_1$, which is capacitively coupled to the CCD $\phi_2^\pm$ lines by approximately the same amount. The $\overline{\text{CCD}}\phi_1$ clock comes up when CCD $\phi_1$ clock falls. The output signal is sampled before the CCD $\phi_1$ line rises
Figure 16.—CCD Timing.
again. When the CCD $\phi_2^\pm$ electrodes are pulled down, the charge packets are transferred to the following CCD $\phi_1$ electrodes. When each charge packet reaches the end of the CCD, it is dumped onto the output diode, which is tied to $V_{DD}$. An effort was made in laying out the four CCDs and their interconnects to match the capacitance on all four output lines. Consideration was given both to capacitance to the substrate and to capacitance to the other clock lines.

C. Operational Amplifiers

1. Design

In the realization of the 32-point CZT, four operational amplifiers are needed to perform two functions. The first function is to provide gain and the inverse of the signal to the inputs of the CCDs. These amplifiers are called the input diffamps and are designated as A1 and A2 in Figure 13. The need for the inverse of the signal can be understood best by considering the function of the differential current integrator (DCI).

The general DCI scheme and the capacitances associated with the CCD are shown in Figure 17. The following capacitances are indicated in the figure:

- $C_S =$ stray to substrate, made equal on $V^\pm$ nodes.
- $C_{OF} =$ fixed overlaps from phase 1 ($\phi_1$) to $V^\pm$ nodes, also made equal.
- $C_{OL} = (1 \pm h_i)/2 =$ overlap to $\phi_1$ for each stage, depends on $h_i$.
- $C_{OF} + N/2 C_{OL} =$ a coupling to $\phi_1$, equal for each node $V^\pm$.
- $C_{OX} = (1 \pm h_i)/2 =$ $V^\pm$ electrode to channel capacitance for each stage.
- $C_D = (1 \pm h_i)/2 =$ depletion layer capacitance for each stage.

A possible timing scheme is shown in Figure 18. The operation is as follows. The split electrode nodes $V^\pm$ are clocked to the voltage $V_2$ via the series switches $S_2$, while the phase 1 ($\phi_1$) clock is high. The switches $S_2$ are then turned off, and nodes $V^\pm$ are left floating. The next point in the timing
Figure 17—CCD Capacitances.
Figure 18.- A Possible Switching Diagram for Figure 17. A high voltage on a switch turns it on.
diagram is the turn-off of phase 1. At this point the charge transfers under the split electrodes. The nodes $V'_{-}$ are isolated, and feedthrough from $\Phi_1$ via the overlap capacitance could reduce the voltage $V'_{+} \approx V_2$ to a value low enough so that incomplete transfer could take place. To prevent this, a capacitive coupling to $\Phi_1$ is provided, which is approximately equal to the coupling to $\Phi_1$.

As the signal charge $Q_S$ flows under the split electrodes, the node voltages $V'_{\pm}$ change in proportion to the sum of the tap weights $(1 \pm h_i)$. The node voltages then are

$$V'_{\pm} = V_2 - \Phi_1 \left[ C_{OF} + \sum_{i=1}^{N} (1 \pm h_i) \frac{C_{OL}}{2C'_{\pm}} \right]$$

$$+ \Phi_1 \left( C_{OF} + \frac{N}{2} \frac{C_{OL}}{2C'_{\pm}} + \sum_{i=1}^{N} \frac{C_{OX}}{2C'_{\pm}} (1 \pm h_i) (V_i + V_z)/2C'_{\pm} \right), \quad (12)$$

where

$$C'_{\pm} = C_S + 2C_{OF} + \frac{N}{2} (2 \pm \bar{h}) + \frac{C_E}{2} (1 \pm \bar{h}), \quad (13)$$

and $C_E = C_{OS}, C_D$ in series, and

$$\bar{h} = \frac{1}{N} \sum_{i=1}^{N} h_i,$$

$V_z =$ bias voltage for zero signal (fat zero),

$V_i =$ signal voltage sampler.

The node voltages $V'_{\pm}$ are to be sensed by the differential feedback amplifier $A$ when the switches $SA$ are turned on at the next stage in the timing diagram. The negative feedback of the amplifier causes its output to change until the input voltages $V'_{+}$ and $V'_{-}$ are equal.
The output voltage \( V_{\text{out}} \) is proportional to the difference:

\[
V^+ - V^- = -V_0 \sum_{i=1}^{N} h_i V_i / \ell \phi + V_z \sum_{i=1}^{N} h_i V_i / \ell \phi + C_{\text{OL}} / \ell \phi + C_{\text{OX}} / \ell \phi + (C_{\text{OL}} + C_{\text{E}}) \nu \ell \phi \text{ to node } V^-.
\]

where \( C^+ \) and \( C^- \) have been made equal by the addition of \( (C_{\text{OL}} + C_{\text{E}}) \nu \ell \phi \) to node \( V^- \). In practice, the addition may have to be made to \( V^+ \) if \( h \) is negative.

Only the third term in Equation (14) is the desired output of the convolution filter. The first two terms need to be eliminated. This can be accomplished in two ways. The first involves an additional channel in parallel with the signal channel. This new channel is of width \( |h| \cdot W_S \) if \( W_S \) is the signal channel width, and is located on the side of the \( V^- \) electrode if \( h > 0 \). The new channel only carries a signal equal to \( V_z \). Again, in practice, two such channels may be required, one on each side of the main signal channel. Their widths are then adjusted so that they differ by \( |h| W_S \). A possible layout of such a structure is shown in Figure 19. The addition not only cancels the two unwanted parts in the output [Equation (14)], but also adds just the required capacitances to equalize \( C^+ \) and \( C^- \).

A second way the two additional terms in Equation (14) can be cancelled is appropriate to the chirp z-transform. Here the output of a cos and a sin chirp are added. However, the values of \( \bar{h} \) for the sin and cos chirp are equal, at least for an even number of stages. Even when apodization is added to the CCD chirp filter, as may be done for a sliding transform, the values of \( \bar{h} \) differ by only a small amount. For example, for even \( N \), \( \bar{h}(\text{cos}) = \bar{h}(\text{sin}) = \sqrt{2N} \). For a Hamming weighted filter we calculate
Figure 19. - Symmetrized and Compensated Two-Phase CCD Layout.
The difference is small for the larger transforms.

The required equalization of capacitances and cancellation of unwanted signals can then be obtained by subtracting the output of a cos and sin chirp filter. In practice, this subtraction is achieved by cross-connecting the $V^+$ (cos) and $V^-$ (sin) nodes. No additional channel is needed. However, one of the input signals has to be available in both normal and inverted form. Since the length of our transform was small and gain was needed at the inputs to the CCD filters, we chose to use the second approach. Two operational amplifiers are used at the input to the filters and two amplifiers are used as DCIs at the output of the filters. The designs of the two diffamps are identical except for the values of the compensation capacitors and the feedback capacitors.

The basic amplifier is shown in Figure 20 with the design parameters listed in Table I. A photomicrograph of the amplifier is shown in Figure 21.

The amplifier has two stages of differential gain. Each stage is cascaded and uses depletion loads. Between the first and second stages are buffers (M14 and M15) which also serve as level translators. After the second gain stage, the outputs are double-buffered by M30 and M31, and then by M33 and M34. The current source biases of all buffers are cascaded to provide a
Figure 20. DCI Amplifier Compensated for Gain of 30.
Figure 21.- Photomicrograph of Operation Amplifier Used on CZT IC.
Table I
FET Geometries and Supplies

<table>
<thead>
<tr>
<th>FETS</th>
<th>W/L (mils)</th>
<th>W/L (μm)</th>
</tr>
</thead>
<tbody>
<tr>
<td>M1, M7, M23</td>
<td>0.35/0.3</td>
<td>8.9/7.6</td>
</tr>
<tr>
<td>M2</td>
<td>1.25/0.3</td>
<td>32/7.6</td>
</tr>
<tr>
<td>M3, M4</td>
<td>18/0.3</td>
<td>457/7.6</td>
</tr>
<tr>
<td>M5, M6</td>
<td>1.30/0.3</td>
<td>33/7.6</td>
</tr>
<tr>
<td>M8, M9</td>
<td>0.35/0.7</td>
<td>8.9/17.8</td>
</tr>
<tr>
<td>M10, M11</td>
<td>0.7/0.3</td>
<td>17.8/7.6</td>
</tr>
<tr>
<td>M12, M13</td>
<td>0.75/0.5</td>
<td>19/12.7</td>
</tr>
<tr>
<td>M14, M15</td>
<td>1.5/0.3</td>
<td>38/7.6</td>
</tr>
<tr>
<td>M16, M17, M21, M22</td>
<td>0.45/0.5</td>
<td>11.4/12.7</td>
</tr>
<tr>
<td>M18</td>
<td>0.6/0.3</td>
<td>15.2/7.6</td>
</tr>
<tr>
<td>M19, M20</td>
<td>2.45/0.3</td>
<td>62/7.6</td>
</tr>
<tr>
<td>M24, M25</td>
<td>0.35/0.5</td>
<td>8.9/12.7</td>
</tr>
<tr>
<td>M26, M27</td>
<td>0.5/0.3</td>
<td>12.7/7.6</td>
</tr>
<tr>
<td>M28, M29</td>
<td>0.55/0.5</td>
<td>14/12.7</td>
</tr>
<tr>
<td>M30, M31</td>
<td>0.95/0.3</td>
<td>24/7.6</td>
</tr>
<tr>
<td>M32</td>
<td>0.7/0.5</td>
<td>17.8/12.7</td>
</tr>
<tr>
<td>M33, M41, M42</td>
<td>0.65/0.5</td>
<td>16.5/12.7</td>
</tr>
<tr>
<td>M34, M37</td>
<td>0.35/0.65</td>
<td>8.9/16.5</td>
</tr>
<tr>
<td>M35</td>
<td>0.75/1.0</td>
<td>19/25.4</td>
</tr>
<tr>
<td>M36</td>
<td>0.65/1.7</td>
<td>16.5/43</td>
</tr>
<tr>
<td>M38</td>
<td>0.8/0.4</td>
<td>20/10</td>
</tr>
<tr>
<td>M39, M40</td>
<td>0.65/0.3</td>
<td>16.5/7.6</td>
</tr>
<tr>
<td>M43, M44</td>
<td>5.1/0.3</td>
<td>130/7.6</td>
</tr>
<tr>
<td>M45</td>
<td>0.75/1.7</td>
<td>19/43</td>
</tr>
<tr>
<td>M53</td>
<td>0.45/0.3</td>
<td>11.4/7.6</td>
</tr>
<tr>
<td>M54, M55</td>
<td>0.35/0.85</td>
<td>8.9/22</td>
</tr>
<tr>
<td>M56</td>
<td>0.4/0.6</td>
<td>10/15.2</td>
</tr>
</tbody>
</table>

\[ V_{DD} = +20 \text{ at } 3.5 \text{ mA} \]
\[ V_{SS} = 0 \]

\[ V_{BB} = -5 \text{ at } 1.5 \text{ mA} \]
\[ C1 = C2 = 0.075 \text{ pF} \]
higher output impedance. Dc operating point and common mode rejection are obtained by two overlapping feedback loops connected to the current source of each differential pair.

From the outputs of the second stage to the outputs of the first stage are two compensation capacitors. These capacitors make the second stage look like an integrator, creating a dominant pole in the amplifier's frequency response. This effect is identical to the effect of the compensation capacitor in μA741-type op-amps. The open loop gain of the amplifier is then the gm of the input differential transistors M3 and M4, times the impedance of the compensation capacitors. There is naturally some attenuation through the two output buffers.

\[ A = \frac{g_m}{\alpha_{BUF} C_{COMP}} \]

where

\[ \alpha_{BUF} = \text{attenuation of buffer}, \]
\[ g_m = \text{forward transfer conductance ratio of MOSFET M3, and} \]
\[ C_{COMP} = \text{value of compensation capacitor}. \]

This formula will hold true for frequencies below the crossover frequency of the inner loop formed by the buffers M14 and M15, the second gain stage, and the feedback path through the compensation capacitors back to the buffers' input. Therefore, the crossover frequency of the op-amp with its integration capacitors must be well below the unity gain point of the inner loop.

The common mode feedback loop dynamics are discussed by stages. In the first stage the gain is the gm of M1 (the first stage current source) times...
the impedance at nodes 7 and 8 (the first stage outputs) in parallel. The dominant pole is created by the capacitance at those nodes, including the compensation capacitors. The other ends of the compensation capacitors (nodes 18 and 19) tend to be held at virtual grounds by the action of the second stage common node feedback loop.

\[ A = \alpha_{\text{BUF}} \times g_{m_1} \times \frac{1}{C_2} , \]

where

- \( \alpha_{\text{BUF}} \) = attenuation of buffer,
- \( g_{m_1} \) = transconductance of MOSFET M1, and
- \( C_2 \) = compensation capacitor.

The action of the second stage common mode loop is similar to that of the first stage. Complications arise due to common mode feedback through the compensation capacitors and interactions with the first stage feedback loop. The effect of the first interaction is to create an unwanted pole due to the capacitance at node 15, the sources of the second stage differential pair. Though the second interaction tends to counteract this effect, it is still advisable to minimize the capacitance at node 15 by using the buffer M53 to drive M1.

As in the first stage feedback loop, care must be taken to prevent excessive phase shifting in the buffers which level-translate back down to the current source FETs gate.

The computer-predicted parameters of the two common mode loops are listed on the following page.
<table>
<thead>
<tr>
<th></th>
<th>First Stage</th>
<th>Second Stage</th>
</tr>
</thead>
<tbody>
<tr>
<td>Low Frequency Gain</td>
<td>5.5</td>
<td>5.5</td>
</tr>
<tr>
<td>Crossover Frequency</td>
<td>14 MHz</td>
<td>22 MHz</td>
</tr>
<tr>
<td>Phase Margin</td>
<td>67°</td>
<td>60°</td>
</tr>
</tbody>
</table>

Each amplifier required 47 mW dc power and occupies approximately 0.658 mm² (1020 mil²).

Both the input amplifiers and the DC1 amplifiers were modeled using the block diagram of Figure 22. From Figure 22 the closed-loop transfer function is

\[
A(s) = \frac{V_o(s)}{V_i(s)} = \frac{a(s)}{1 + a(s)f(s)}
\]

where \(a(s)\) is the open loop gain,

\(f(s)\) is the feedback gain, and

\(a(s)f(s)\) is the loop transmission gain.

The stability of each amplifier was based on the loop transmission, \(a(s)f(s)\).

A schematic of the input differential amplifier is shown in Figure 23. Transistors \(Q_1\) and \(Q_2\) are used to provide the proper input bias level for the amplifier. Assuming the open loop gain is infinite, the closed loop gain is

\[
A(s) = \frac{C_{IN}}{C_{FB}} = \frac{3.0 \text{ pF}}{0.5 \text{ pF}} = 6
\]

where \(C_{IN}\) is the value of the input capacitance and

\(C_{FB}\) is the feedback capacitance.

Both the input capacitors and the feedback capacitors were formed using first and second polysilicon as the capacitor plates. The dielectric constant of the gate oxide between the two polysilicon levels determine the capacitance.
Figure 22.- Block Diagram of Single-Loop Amplifier.
Figure 23.-Differential Amplifier.
The amplifier closed loop gain is determined by a capacitance ratio that is also the ratio of the areas of the two polysilicon capacitors. Since these areas can be accurately controlled, the gain is very accurate and predictable. A gain of 6 was chosen to compensate for the attenuation of the MDACs. Based on computer modeling, 1 pF compensation capacitors were chosen.

A plot of the predicted open loop characteristics of the input amplifier is shown in Figure 24. From this plot the following parameters can be obtained:

- Crossover Frequency: 6 MHz
- Feedback Gain: 1/6
- Gain Bandwidth Product: 36 MHz
- Phase Margin: 63°

The predicted closed loop frequency response is shown in Figure 25. The -3 dB frequency is 9 MHz.

The design of the DCI amplifier is the same as the input amplifier, but instead of providing only signal inversion and gain, the DCI amplifiers function as integrators, integrating the CCD output signal onto the feedback capacitors. A model of the DCI circuit is shown in Figure 26. The CCD output is shown as the split electrodes with a clock driver for each side of the electrodes. Two series-pass MOSFETs (Q₁ and Q₂) are shown that gate the signal into the DCI amplifier. The DCI amplifier uses the φ₂⁺ pulse as a reset pulse. The timing of the DCI circuitry is shown in Figure 27.

To prevent the DCI amplifier from seeing a large common mode signal (CCD φ₂⁺), series-pass transistors are used to isolate the amplifier from the split electrodes. To keep the amplifier outputs from drifting apart when the amplifier is isolated, M7 and M23 (Figure 20) were added. When the reset pulse goes high, the outputs of each differential stage are shorted together, holding the outputs of the amplifier at the same bias point. When the reset
Figure 24.-Differential Amplifier Theoretical Open Loop Characteristics.
Figure 25.- Input Amplifier Closed Loop Frequency Response.
Figure 26. DCI Amplifier.
Figure 27. DCI Timing.
pulse is off and the DCI sample pulse is high, the amplifier outputs settle to a new bias point and the signal output of the amplifier is the difference between this new bias point and voltage swing of the amplifier.

While the CCD $\phi_2^\pm$ clock lines are held low, the DCI amplifier is reset with the DCI reset pulse ($\phi'_2$). Also, the DCI sample pulse is off, isolating the $\phi_2^\pm$ electrodes from the DCI amplifier. Before the $\phi_2^\pm$ electrodes are allowed to float, the DCI sample pulse goes high, connecting the amplifier to the split electrodes. Next, the reset pulse goes low, and the amplifier is ready to integrate the signal charge onto its feedback capacitors when the CCD $\phi_1$ clock goes low. A photograph showing the CCD clocks is shown in Figure 28. The time the CCD $\phi_2^\pm$ clock is floating is indicated by the arrows.

The gain from the CCD filter input to the DCI amplifier output can be calculated from the input charge of the CCD filter

$$Q_{in} = C_{in} V_{in},$$

where $C_{in}$ is the input well capacity and $V_{in}$ is the input voltage,

and the output charge

$$Q_{out} = N(\frac{1}{8}) C_{in} V_{in},$$

where $N$ is the number of points in the transform (32). The factor of $\frac{1}{8}$ arises when a non-dc input is assumed. Since

$$Q_{out} = C_{out} V_{out}$$

then

$$V_{out} = \frac{Q_{out}}{C_{out}} = \frac{16 C_{in} V_{in}}{C_{out}},$$

where $C_{out}$ is the feedback capacitor of the DCI amplifier.
Figure 28.- CCD Clocks.
Solving for $C_{\text{out}}$:

$$C_{\text{out}} = \frac{16 \, C_{\text{in}} \, V_{\text{in}}}{V_{\text{out}}} = 16 \times 0.26 \times \left(\frac{V_{\text{in}}}{V_{\text{out}}}\right) = 4.16 \, \text{pF} \left(\frac{V_{\text{in}}}{V_{\text{out}}}\right) .$$

To provide some gain to overcome the attenuation of the following ac-couple/dc-restore circuitry and the output source-follower, a value of 1.4 pF was chosen for the feedback capacitor. Substituting this value into the gain equation yields

$$\text{Gain} = \frac{V_{\text{out}}}{V_{\text{in}}} = \frac{16 \, V_{\text{in}}}{C_{\text{out}}} = \frac{16 \times 0.26 \, \text{pF}}{1.4 \, \text{pF}} = 3 .$$

A model for computer simulation of the DC1 amplifier only is shown in Figure 29. This model was constructed to examine the characteristics of the amplifier. The CCD split electrode has been modeled as an equivalent capacitor, and the on-resistance of the series-pass transistors has been calculated to be 160 $\Omega$, based on the physical size of the MOSFETs and the gate-to-source voltage. A model for calculating the CCD $\phi_2^+$ capacitance is shown in Figure 30. The values of the capacitors for two filters in parallel are:

- Depletion capacitance $C_{\text{dep}} = 4.6 \, \text{pF}$
- Oxide capacitance $C_{\text{OX}} = 41.4 \, \text{pF}$
- First poly-to-moat and metal-to-substrate $C_1 = 5.38 \, \text{pF}$
- Overlap of CCD $\phi_1$ $C_2 = 19.2 \, \text{pF}$
- Overlap to CCD $\phi_1$ $C_3 = 19.2 \, \text{pF}$.

These capacitances can be combined into one equivalent capacitor for gain calculations.
Figure 29. Model for DC1.
Figure 30.- Capacitance Model for DC1 Input.
The predicted open loop characteristics of the DCI amplifier are shown in Figure 31. From this figure the following parameters can be calculated:

<table>
<thead>
<tr>
<th>Parameter</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>Crossover frequency</td>
<td>17 MHz</td>
</tr>
<tr>
<td>Feedback gain</td>
<td>1/32</td>
</tr>
<tr>
<td>Gain-bandwidth product</td>
<td>550 MHz</td>
</tr>
<tr>
<td>Phase margin</td>
<td>65°</td>
</tr>
</tbody>
</table>

The predicted closed loop frequency response of the DCI amplifiers is shown in Figure 32, and the -3 dB frequency is seen to be 25 MHz.

2. **Experimental Results**

Experimental results have been obtained for the input amplifiers using the test setup shown in Figure 33. A pulse generator was used as the inputs to the amplifiers, and attenuators were used to obtain low-level signals. Because there are two input amplifiers, the inverting inputs and the noninverting inputs were connected to allow testing of both amplifiers. The input signal and the inputs of the amplifiers were connected through the connection box in a number of ways, depending on the test performed. Because the output signal of the amplifiers has a large dc component, an ac-couple/dc-restore circuit was used. Finally, the output signal was sampled so that a digital voltmeter could be used to record the test results. As can be seen in Figure 33, both the input signals and all the output signals were recorded through the same ac-couple/dc-restore and sample-and-hold circuit.

The results of the amplifier tests are shown in Figures 34 - 36. Figure 34 shows the single-ended transfer characteristics of the amplifier. The single-ended gain is approximately 3, and output is linear up to about 0.4 V input. Figure 35 is a plot of the response of the amplifier to a common mode input signal. The differential mode characteristics are shown in Figure 36.
Figure 31.- DCI Diff-Amp Open Loop Performance.
Figure 32.- Closed Loop Frequency Response of DCI Amplifier.
Figure 33.- Differential Amplifiers Test.
Figure 34.-Single-Ended Transfer Characteristics of Input Differential Amplifier.
Figure 35.-Response of Input Differential Amplifier to Common Mode Input.
Figure 36. Differential Mode Characteristics of Input Amplifier.
Some photographs of the input amplifier in operation are shown in Figures 37 and 38. Figure 37 shows the single-ended output of the amplifier with a ramp input. Figure 38 shows two photographs of the input amplifier. In the top photograph the input and output signals of the amplifier are shown, and again the gain can be measured to be approximately 3. The bottom photograph is better for determining gain and linearity. The horizontal axis has been calibrated so that it corresponds to the differential input voltage. The vertical axis is the single-ended output of the amplifier. By comparing the amplifier output to the ideal straight-line output which has been superimposed, one can see the output becomes very nonlinear above 0.6 V input. The open-loop transient characteristics of the DCI amplifier are shown in Figure 39. The top photograph shows the open-loop dc transfer curve, and the bottom photograph shows the open-loop step response. The rise time is about 1.6 μs.

D. **Multiplying Digital-to-Analog Converters (MDACs)**

Four multiplying D/A converters (MDACs) are on the CZT IC. These perform the pre- and post-multipliers of the CZT algorithm. They also perform multiplication by the reference for the correlation and filtering configurations. Each MDAC output is an analog signal whose value is proportional to the product of an analog input signal and a digital word. For the CZT pre- and post-multiplications, the required chirp signals are stored digitally on chip in a ROM. For multiplication by a reference, the digital signal must be supplied externally.

1. **Design**

Each MDAC consists of an input sample-and-hold circuit, ROM sense and MDAC drive circuitry, a capacitor array and an output buffer. A photograph of two MDACs is shown in Figure 40.

The MDAC minus the sample-and-hold and ROM sense/MDAC circuitry is shown in Figure 41 with W/L and capacitor values indicated. This circuit
Figure 37.- Single-Ended Response of Differential Amplifier to Ramp Input.
Figure 38. Photograph showing the gain and linearity of input amplifiers.
Figure 39.-DC1 Amplifier Open Loop Characteristics.
Figure 40.- Photograph of Dual MDACs Used on CZT IC.
Figure 4.1.- MDAC.
performs a full four-quadrant multiplication using a capacitor divider network. For noninverting operation, a sampled input signal is applied to the \( V^+ \) line, and a reference voltage is applied to the \( V^- \) line. During the initial portion of each clock cycle, the common output node \( (V_{out}) \) of the capacitor array is clamped to \( V_{ref} \) through M150 by \( \phi_D \) (refer to Figure 44). Also, the switching signals \( v_i \) are high, and \( \bar{v}_i \) are low. (The \( v_i \) and \( \bar{v}_i \) waveforms are nonoverlapping complementary waveforms.) This connects the largest capacitor to the signal \( V^+ \) and all other capacitors to the reference \( V^- \) through the switching transistors M205 through M275. Next, \( \phi_D \) goes low, leaving the common node floating. A complementary pulse to \( \phi_D \) is capacitively coupled to the common node by M151 to reduce the level translation of \( V_{out} \) when M150 is turned off. Next, the \( \bar{v}_i \) and \( v_i \) signals corresponding to '1's at the \( i^{th} \) bits, \( b_i \), of the digital word are inverted, causing capacitors \( C_i \) to be switched from one input line to the other through the M216s. The \( \bar{v}_i \) and \( v_i \) corresponding to \( B_i = '0' \) remain unchanged. The result of switching capacitor \( C_i \) from one input line to the other is that the common node voltage, \( V_{out} \), is changed by an amount

\[
\pm (V^+ - V^-)[(C_o/2^i \times b_i)/(2C_o + C_{stray})].
\]

This change has a positive sign for \( i = 1 \) to 7 and a negative sign for \( i = 0 \), since \( b_0 \) is the sign bit. The resulting output, \( V_{out} \), for multiplication by an arbitrary digital word is given by superposition to be

\[
V_{out} = V_{ref} + (V^+ - V^-)(\frac{C_o}{2C_o + C_{stray}}) \times [-b_0 + \sum_{i=1}^{n} b_i 2^{-i}] .
\]

Two's complement coding is used for the digital word.

The output voltage \( V_{out} \) of the MDAC is equal to the product of the signal voltage \( (V^+ - V^-) \) and the digital input word times the attenuation factor \( C_o/(2C_o + C_{stray}) \). For \( C_{stray} = 0 \), the maximum possible gain is \( \frac{1}{2} \). The actual stray capacitance was calculated to be 5 pF, making the gain to the common node \( V_{out} = 10.56/(2 \times 10.56 + 5) = 0.4 \). The capacitor array was formed
by using first polysilicon to second polysilicon capacitance. First poly-
silicon was used as the output node, and the major source of the stray capaci-
tance is from the first polysilicon to the substrate. To control the capaci-
tance ratios in the capacitor divider network, the larger capacitors were
formed as binary multiples of a smaller capacitor. Capacitor $C_4$ is a
51 $\mu$m x 51 $\mu$m (2 mil x 2 mil) poly-poly capacitor with a value of about 0.66 pF.
Capacitor $C_3$ is obtained by tying two 51 $\mu$m x 51 $\mu$m poly-poly capacitors
together so that $C_3 = 2C_4 = 1.32$ pF. The three larger capacitors are formed
by tying 51 $\mu$m x 51 $\mu$m capacitors together ($C_2 = 4C_4$, $C_1 = 8C_4$, $C_0 = 16C_4$).
In this way, the correct capacitor ratios are obtained without maintaining
perfect control of the absolute capacitance. The capacitors $C_5$, $C_6$, and $C_7$
are formed by smaller poly-poly areas, since a small error in their value will
not significantly affect the output accuracy. An attempt was made in layout to
keep all 51 $\mu$m x 51 $\mu$m capacitors identical by placing dummy first poly around
certain parts of the array to maintain identical first poly edges. The $V^+$ and
$V^-$ inputs required for the MDAC are supplied by the buffered sample-and-hold
circuit shown in Figure 42. The $V^+$ and $V^-$ outputs are from identical stages
so that the circuit can be used in either an inverting or a noninverting mode.
For noninverting operation, the signal voltage is applied to the $+$ input, and
a reference voltage corresponding to the dc value of the signal voltage is
applied to the $-$ input. The signal is buffered by the first source-follower
(M101, M102, M103) and is sampled by the input sample-and-hold pulse through
M104. The holding capacitor consists of a 2 pF first polysilicon to second
polysilicon capacitor and the gate capacitance of M107 for a total of about
5.3 pF. The sampled signal is buffered by a second source-follower (M105, M106,
M107) before being applied to the MDAC. This second source-follower was
designed to drive the capacitive load of the MDAC at clock frequencies of up
to 2.5 MHz. This accounts for the large device size of M107. $\phi_1$ is a pulse
approximately complementary to the input sample-and-hold pulse. It is applied
to the gated capacitor M114 to reduce the nonlinear level translation occurring
when M104 is turned off. Dual series current sources are used in the source-
Figure 42.- Buffered Input Sample-and-Hold.
followers to decrease the effect of \( \lambda \) (the channel length modulation parameter) on the gain. The gates of the current sources are fed by the voltage division stage consisting of M108, M109, and M110. The gain of each source-follower is 0.9, making the gain of the entire buffered sample-and-hold circuit 0.81.

The switching signals \( \bar{v}_i \) and \( v_i \) are supplied by the ROM sense and MDAC drive circuitry shown in Figure 43. Each MDAC contains eight of these circuits. \( \phi_B \) and \( \phi_C \) are complementary nonoverlapping clocks. \( \phi_B \) comes up during the first portion of each clock cycle, pulling \( v_i \) up (if it is not already up), and charging the gates of M201 and M204 to the value of the ROM (or external word) output voltage. This voltage will be high if \( b_i = '1' \) and low if \( b_i = '0' \). If \( b_i = '1' \), the gate of M201 will be high and \( v_i \) will fall with \( \phi_B \). Then \( \phi_C \) comes up, pulling \( v_i \) up since the gate of M204 is high. M114 is a gated capacitor that bootstraps the gate voltage of M204 high enough to allow \( v_i \) to reach the full voltage of \( \phi_C \). At the end of the clock cycle, \( v_i \) falls with \( \phi_C \). If \( b_i = '0' \), the gate of M201 will be low, and \( v_i \) remains high when \( \phi_B \) falls. Since the gate of M204 is uncharged, \( \bar{v}_i \) does not come up with \( \phi_C \).

The switching waveforms and typical signal waveforms for a multiplication by 0.5 are illustrated in Figure 44. The digital word is 01000000. The \( v_i \) and \( \bar{v}_i \) pulses switch the bottom plate of capacitor \( C_j \) from the \( V^- \) line to the \( V^+ \) line (Figure 41) to accomplish the multiplication by 0.5. The MDAC output is valid while \( \phi_C \) is up after the signal has settled.

The only dc power consumed by the entire MDAC is in the four source-followers and the voltage divider of the buffered sample-and-hold stage and in the output buffer of the MDAC. The total dc power consumed by these stages in one MDAC is 40 mW.

The total attenuation of the signal from the buffered input to the buffered output is 0.3. This is the result of transmission through three source-followers with gains of 0.9 each, and the MDAC capacitor array with a gain of 0.4.
Figure 43.—ROM Sense and MDAC Drive Circuitry.
Figure 44.-Typical MDAC Operation (Multiplication by 0.5).
2. Experimental Results

Figure 45 is a plot of the transfer curve for each side of the input sample-and-hold of Figure 42. The gain is 0.84, and the nonlinearity is very low (< 0.2% of input span). The input sample-and-hold circuitry functions properly for inputs of 1 to 7 V.

The MDACs were evaluated using the test setup shown in Figure 46. The MDAC was tested with both an external digital signal and chirps stored in internal ROM.

The evaluation of the MDAC using the sine chirp of the ROM is shown in Figure 47 and Table II. Figure 47 shows the response of the MDAC to a dc input and the sine chirp. Table II shows the word number, its binary code, and the analog value of the binary code for the sine chirp. The MDAC output voltages were normalized and compared to the analog representation of the code stored in the ROM.

The errors were found to be not more than ± 0.25 least significant bit (LSB), or one part in 512. Since the ROM is accurate to within ± 0.5 LSB, the worst-case error from the MDAC should be ± 0.75 LSB.

The MDAC was also tested using external digital words, and the relative accuracy is plotted in Figure 48. The differential nonlinearity was measured to be 0.25 LSB. Other parameters measured on the MDACs using external digital words were digital feedthrough, analog feedthrough, analog nonlinearity, and analog differential nonlinearity. The results of these tests are summarized in Table III.

E. Read-Only Memory (ROM) and ROM Address

1. Read-Only Memory

The on-chip ROM that supplies the cosine and sine chirp signals to the MDACs is illustrated in Figure 49. Two ROMs like the one shown are on the
Figure 45.- Transfer Curve for Input Sample-and-Hold Circuit of MDAC.
Figure 46.—Block Diagram of MDAC Test.
Figure 47.- Response of MDAC to dc Input and Sine Chirp of Internal Rom $(v^+ - v^-) < 0$. 
<table>
<thead>
<tr>
<th>Word Number</th>
<th>Binary Code</th>
<th>Analog Value</th>
<th>MDAC Output*</th>
<th>Error †</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>00000000</td>
<td>0.0000</td>
<td>0.0000</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>00001100</td>
<td>0.0945</td>
<td>0.0951</td>
<td>0.077</td>
</tr>
<tr>
<td>2</td>
<td>00110001</td>
<td>0.3858</td>
<td>0.3870</td>
<td>0.154</td>
</tr>
<tr>
<td>3</td>
<td>01100010</td>
<td>0.7717</td>
<td>0.7725</td>
<td>0.103</td>
</tr>
<tr>
<td>4</td>
<td>01111111</td>
<td>1.0000</td>
<td>1.0000</td>
<td>0.000</td>
</tr>
<tr>
<td>5</td>
<td>01010001</td>
<td>0.6378</td>
<td>0.6388</td>
<td>0.128</td>
</tr>
<tr>
<td>6</td>
<td>11001111</td>
<td>-0.3858</td>
<td>-0.3843</td>
<td>0.192</td>
</tr>
<tr>
<td>7</td>
<td>10000010</td>
<td>-0.9921</td>
<td>-0.9933</td>
<td>0.154</td>
</tr>
<tr>
<td>8</td>
<td>00000000</td>
<td>0</td>
<td>0.0009</td>
<td>0.115</td>
</tr>
<tr>
<td>9</td>
<td>01111110</td>
<td>0.9921</td>
<td>0.9920</td>
<td>0.013</td>
</tr>
<tr>
<td>10</td>
<td>11001111</td>
<td>-0.3858</td>
<td>-0.3844</td>
<td>0.179</td>
</tr>
<tr>
<td>11</td>
<td>10101111</td>
<td>-0.6378</td>
<td>-0.6373</td>
<td>0.064</td>
</tr>
<tr>
<td>12</td>
<td>01111111</td>
<td>1.0000</td>
<td>1.0004</td>
<td>0.051</td>
</tr>
<tr>
<td>13</td>
<td>10011110</td>
<td>-0.7717</td>
<td>-0.7715</td>
<td>0.026</td>
</tr>
<tr>
<td>14</td>
<td>00110001</td>
<td>0.3858</td>
<td>0.3872</td>
<td>0.179</td>
</tr>
<tr>
<td>15</td>
<td>11110100</td>
<td>-0.0945</td>
<td>-0.0938</td>
<td>0.090</td>
</tr>
<tr>
<td>16</td>
<td>00000000</td>
<td>0.0000</td>
<td>-0.0002</td>
<td>0.026</td>
</tr>
</tbody>
</table>

*Normalized

†Fraction of LSB
Figure 48.- Relative Accuracy of MDACs.
<table>
<thead>
<tr>
<th>Characteristic</th>
<th>Specification</th>
</tr>
</thead>
<tbody>
<tr>
<td>Analog Nonlinearity (1)</td>
<td>0.2% of input span</td>
</tr>
<tr>
<td>Analog Feedthrough (2)</td>
<td>&lt; 0.5 mV at 100 kHz</td>
</tr>
<tr>
<td>Analog Differential Nonlinearity (3)</td>
<td>± 0.25 LSB</td>
</tr>
<tr>
<td>Digital Feedthrough (4)</td>
<td>± 1.25 mV</td>
</tr>
</tbody>
</table>

(1) Measured by holding digital input constant and varying the analog input.
(2) Variable analog input with digital word programmed for zero gain.
(3) Maximum deviation of any bit size from theoretical value of 1 LSB.
(4) Variable digital word with analog input held at zero.
Figure 49: 8-Bit by 17-Word ROM (Sine Chirp Coded).
IC. One supplies the cos chirp and the other supplies the sine chirp. The outputs $b_i$ are attached to the ROM sense circuit of Figure 43. These outputs can come from an external digital input by holding the external select line high and the internal select line low. To use the internal ROM, the internal select line is held high and the external select line is held low (to reduce capacitive loading of the ROM). There is actually a second set of internal/external select switches and outputs $b_i$ on chip. Each set feeds a pair of MDACs. A photograph of the ROM is shown in Figure 50. It occupies an area of 0.62 mm$^2$ (35.5 x 27 mil$^2$).

The ROM shown consists of 17 eight-bit words. One word consists of a horizontal row of eight possible transistors in Figure 49. A bit is coded as a "0" or a "1" by the presence or absence, respectively, of a transistor. The ROM address lines $a_j$ are fed by a 17-stage up/down ring counter. The ROM is addressed in an up/down manner since the 32-point chirp is symmetrical about the seventeenth point. Each word, $j$, is addressed sequentially by pulling the appropriate address line to a high voltage. All other address lines are held low. $\phi_1$ pulls the output lines $b_i$ high, and when $\phi_1$ falls, the output $b_i$ will fall if there is a transistor in bit $i$ of word $j$. If there is no transistor there, $b_i$ will remain high. For example, if word two (line $a_2$) is addressed in Figure 49, outputs $b_0$ and $b_1$ will be low "0," and outputs $b_2$ and $b_7$ will be high "1". To illustrate the ROM timing, refer to Figure 44.

The appropriate address line, $a_j$, comes up and down with $\phi_0$ so that $a_j$ is high when $\phi_1$ falls. The ROM output is valid only after $\phi_1$ has fallen. The fact that all ROM outputs are high when $\phi_1$ is high insures that M204 of the ROM sense circuit will discharge the gate of M206 of Figure 41 once each cycle so that no excess charge can build up on this node.

A plot of the sine chirp coded in the ROM is shown in Figure 51. This plot can be compared to the photograph in Figure 47.
Figure 50.- Photograph of ROM.
Figure 5L—Analog Representation of Sine Chirp Coded into ROM.
2. **Up/Down Ring Counter (ROM Address)**

The ROM address lines  \(a_0 - a_{17}\) are fed by a 17-stage shift register that has the ability to reverse the shifting direction. A three-stage version of this circuitry is shown in Figure 52. Each basic shift register stage (one bit) is outlined in Figure 52. \(\phi_1\) and \(\phi_2\) are the nonoverlapping two phase clocks input to the IC. The basic shift register consists of two dynamic inverters with transmission gates, and its operation depends on charge storage. The operation of shift register 0 (SRO) of Figure 52 is illustrated in Figure 53. When \(\phi_1\) comes up, the 0.2 pF capacitor on node 2101 is charged to the level of the input to SRO through M2100. Also, the 1 pF capacitor on the first inverter output node 2102 is charged to a high voltage through M2102. When \(\phi_1\) falls, node 2102 will discharge through M2101 if node 2101 is high; otherwise, node 2102 remains high. Thus, node 2102 is the inverse of the input to SRO. Next, \(\phi_2\) comes up, turning on M2103. If node 2102 is high, the charge on the 1 pF capacitor will divide between itself and the 0.2 pF capacitor on node 2103, and the voltage on node 2103 will be \([1 \text{ pF}/(1 \text{ pF} + 0.2 \text{ pF})] v_0\), where \(v_0\) was the voltage on node 2102 before \(\phi_2\) turned M2103 on. If node 2102 is low when \(\phi_2\) comes up, any charge on node 2103 will discharge through M2101, since the gate of this transistor must be high. \(\phi_2\) also charges the 1 pF output capacitor on the second inverter output node 2104 through M2105. When \(\phi_2\) falls, node 2104 will remain high if node 2103 is low or will discharge through M2104 if node 2103 is high. Node 2103 is now isolated from 2102, since M2103 is off, allowing node 2102 to change without affecting the output node 2104. Thus, the voltage at the output of SRO is that of the input delayed by one clock cycle.

To address the ROM properly, the shift register must shift one "1" back and forth across its 17 stages. This requires circuitry to detect the "1" when it reaches each end of the shift register and to reverse the transfer direction. It also requires circuitry to insure that only one stage of the shift register contain a "1," and that the other sixteen stages contain "0"s.
Figure 52. Three-Stage Up/Down Ring Counter.
Figure 53. Typical Shift Register Operation (SRO). Capacitive coupling, time delays, and MOSFET thresholds ignored. Previous SRO inputs have been '0' s.
The shift register must also have circuitry to synchronize it with an external pulse.

The direction of transfer is determined by the state of the static S-R flip-flop FF1. With flip-flop output node 2182 high and node 2183 low, the feed-forward transmission gates M2107, M2106, and M2116 are on, and the feedback transmission gates M2127, M2126, and M2117 are off. The output of each stage of the shift register is electrically connected to the input of the next stage, and transfer is in the forward direction. When the "1" reaches the final stage of the shift register, transistor M2128 will be on and the reset node 2180 will come up and down with $\phi_1$. This causes the flip-flop to change state, and the feed-forward transistors are turned off while the feedback transistors are turned on. Each stage's output is now electrically connected to the previous stage's input. Node 2111 rises to a high voltage, since it is now connected to node 2124. The "1" appears next at the output of the next-to-last stage (SR 1 in this example). "0"s are loaded into the last stage by M2127, which is tied to ground. Transfer continues in the reverse direction until the "1" reaches the output of the first stage. Then when $\phi_1$ comes up, M2108 is on and the set node 2181 comes up and down, causing the flip-flop to change stage again. The feed-forward transistors are now on, and the feedback transistors are off. The "1" next appears at the output of the second stage. "0"s are loaded into the first stage by M2107, which is connected to ground.

To insure that the shift register contains only one "1," two cases must be considered. The first case is that when power is supplied to the circuit, the shift register contains one or more "1"s. In this case, the output of the first stage of the shift register will eventually be high. When this occurs, M2108 will be on, and node 2181 will come up and down with $\phi_1$. This sets the inputs to each stage of the shift register low through M2109 and M2129, except for the second stage, which is set high by M2119. At the end of this clock cycle only the second shift register contains a "1," and transfer is in the forward direction.
The second case is that when power is supplied to the circuit, the shift register contains all "0"s. In this case, a 17-input dynamic NOR circuit sets the first stage to "1." The NOR circuit consists of transistors M2130 - M2133, in Figure 52. Node 2130 is pulled high with \( \phi_1 \). If the input to any stage is high, node 2130 falls with \( \phi_1 \), since at least one of the transistors M2130 - M2132 is on. If the inputs to all stages are low, then all transistors M2130 - M2132 are off, and node 2130 will remain high. This leaves M2134 on. Node 2131 will then come up and down with \( \phi_2 \). This discharges node 2103 through M2135 so that when \( \phi_2 \) falls, the output of the first stage, node 2104, will remain high or "1." After this occurs, the shift register contains a "1," and node 2131 should never come up again. Node 2130, however, will rise and fall with \( \phi_1 \) every clock cycle.

Three transistors, M2138, M2139, and M2191, are added to the circuit of Figure 52 to synchronize the counter with a sync pulse (M2119 is also only necessary for synchronization). The sync pulse is derived from another circuit. When it occurs, it comes up with \( \phi_1 \) and down with the rising edge of \( \phi_2 \). The set node 2181 is pulled up with \( \phi_1 \), since M2138 is on. This resets the inputs of all stages except the second low through M2109 and M2129. The second stage input is set high by M2119. Also, node 2103 is discharged by M2139 so that M2104 will not sink current from node 2111 to ground. The sync pulse is also applied to the gate of M2191 to make sure that the flip-flop comes up in the forward transfer state (node 2182 high). This transistor is necessary because it is possible for the reset pulse to occur when a "1" is in the last stage, in which case the reset line 2180 would also come up and down with \( \phi_1 \). Since the sync pulse remains high after \( \phi_1 \) is low again, the flip-flop will be set in the proper stage. At the end of the clock cycle, the output of the second stage will be "1," and the shift register will be transferring in the forward direction. (M2191 is necessary only to guarantee synchronization after the first sync pulse. This is not required here. If M2191 were removed, synchronization would be guaranteed after the second sync pulse.)
The shift register outputs go to the gates of transistors M2150 - M2152, which allow $\phi_D$ to pull up the ROM address line attached to the transistor whose gate is high. The depletion devices (M2160 - M2162) are attached to the ROM address lines to keep capacitive coupling in the ROM from pulling the other address lines up. The depletion device on the address line that is high will sink 0.6 mA of current from $\phi_D$. The $\phi_D$ driver is able to supply this current with very little drop in output voltage. The output pulse is illustrated in Figure 53. Figure 54 illustrates the counter in normal operation (i.e., one "1" bit, counter synchronized to sync pulse 3004 if it exists). It shows the first and last shift register outputs, the first and last ROM outputs, the relationship to the sync pulse, and the S-R flip-flop inputs and outputs. The flip-flop FF1 consists of a pair of static inverters with bootstrapped loads. M2187 and M2188 keep the gates of M2185 and M2186 from falling below a threshold below $V_{DD}$. M2189 and M2190 are gated capacitors that boot-strap the gates of M2185 or M2186 when the output 2182 or 2183 starts to come up. This turns the load devices on hard, decreasing the rise time and allowing the output voltage to reach $V_{DD}$. The set and reset transistors M2183 and M2182 are in parallel with the driver devices M2184 and M2181.

A photograph of the up/down counter is shown in Figure 55. Its area is approximately 0.75 mm² (40 x 29 mil²).

3. Synchronization

To operate two or more ICs together, the up/down counters must be synchronized to each other. It is required that the counter of an IC be 180° out of phase with the counter it is synchronized to. One cycle of the chirp z-transform algorithm consists of 64 clock cycles. Therefore, when two ICs are operated together, one should be on count 32 ($t_{32}$) when the other is on count 0 ($t_0$).
Figure 54.-Up/Down Counter Waveforms - Normal Operation (Synchronized to 3004).
Figure 55.- Photograph of Up/Down Counter.
In order to obtain this synchronization, each counter must have an external sync pulse output to provide a pulse to the other IC, and a sync pulse input to accept a pulse for synchronization to another IC. The output sync pulse is a square wave. The sync pulse input circuit accepts the square wave output from another IC and converts this to an internal pulse (3004) with duration of less than one clock cycle to reset the counter and external sync pulse generation (refer to Figures 52 and 54). The relationship of the sync pulses for two synchronized chips is illustrated in Figure 56. Also shown is the relationship of the sync pulses to the flip-flop FFl (Figures 52 and 54) output (2182) to the feed-forward transistors in the up/down counter. It can be seen that the up/down counters of the two ICs that have a period of 32 clock cycles are in phase, and the sync pulse output of the second IC is 180° out of phase with that of the first stage. The external sync pulse generator is shown in Figure 57. The circuit uses a set/rest flip-flop (identical to FFl in the up/down counter) and seven more transistors to implement a toggle flip-flop that changes state each time a pulse appears on the set line 2181 to FFl. This occurs once every 32 clock cycles in normal operation. The basic RS flip-flop consists of the unmarked transistors in Figure 57. Figure 58 illustrates the circuit's operation. Assume the output 2282 is low and 2283 is high. Then node 2286 will be low and node 2287 will be high, since \( \phi_2 \) will have turned M2289 and M2290 on and off. Node 2181 will come up and down with \( \phi_1 \) every 32 clock cycles. With the flip-flop in the above state, M2292 is on, and node 2281 will come up and down with the input node 2181. M2291 is off and node 2280 stays low. This causes the flip-flop to change state. The transistor M2293 is on during this process to keep node 2280 from being pulled up by capacitive coupling to the flip-flop. M2294 is off. Now output 2282 is high and 2283 is low. When \( \phi_2 \) comes up, node 2287 discharges through M2290, and node 2286 charges through M2289. The circuit is triggered during synchronization so that the output waveform at 2282 is 180° out of phase with the sync pulse. The duration of the internal sync pulse (3004) is longer than that of 2181, so that sync is assured. The
Figure 56.—Synchronization of Two CCD CZT ICs.
Figure 57.-External Sync Pulse Generator.
Figure 58.- External Sync Pulse Generator Timing — Normal Operation (Synchronized to 3004).
The relationship of the internal sync pulse (3004) to the generated external sync pulse is shown in Figure 58 during normal operation. It is out of phase with node 2282. The actual sync pulse generated for use off-chip comes from a driver that will be discussed later. The inputs to the driver are 2282 and 2283, and the output waveform is that of node 2282 with slightly more rise and fall time delays. This driver is necessary to drive the larger capacitance associated with going off-chip. The output pulse is called T2.

The circuit used to generate the short internal sync pulse from an applied external sync squarewave is called GEN30 and is shown in Figure 59. Figure 60 illustrates its operation. When the external sync input is low, node 3001 remains high after \( \phi_2 \) falls. Node 3002 comes up and down with \( \phi_1 \) every clock cycle, and node 3003 is charged to a threshold below \( V_{DD} \). When the external input comes up (which will be with \( \phi_1 \) plus a small delay), the output node 3004 comes up immediately, since M3003 is on. The gate of M3005 is bootstrapped by M3008 so that 3004 reaches the full potential of the input. Node 3001 falls immediately with the input rising edge, leaving 3002 floating high. When \( \phi_2 \) comes up again, node 3001 comes up, allowing 3002 to discharge through M3003. M3006 and M3007 are turned on, turning off M3005 and pulling the output node 3004 down. The output cannot come up again until M3004 charges node 3003 high. This cannot occur until after the input external pulse falls. In this way, GEN30 generates a pulse with a duration of about \( \frac{1}{2} \) clock cycle each time the external input comes up.

Figure 61 is a photograph of the sync outputs of two CZT ICs operating together. The sync output of one IC is fed into the sync input of the other. The sync output waveforms have a period 64 times that of the 10 kHz input clocks, \( \phi_1 \) and \( \phi_2 \). Note that the two sync outputs are 180° out of phase as in Figure 56.
Figure 59. - GFN 30 (Internal Sync Pulse).
Figure 60—GEN 30 Timing.
Figure 61.- Synchronization Outputs of Two CZT ICs Operating Together.
F. Timing Circuitry

1. Clock Generators

The derivation of the internal clocking pulses from the two-phase inputs $\phi_1$ and $\phi_2$ is illustrated in Figure 62. The 13 generators shown there use one or more of the circuits described below.

The first circuit will be referred to as Gl and is illustrated in Figure 63. This circuit configuration is used to produce most of the clock pulses. It requires a pullup pulse (node 4000) and a nonoverlapping pulldown pulse (node 4001). Assume that a pulldown pulse has occurred and the output (4006) and all internal nodes are low except for node 4003, which is high. When the pullup pulse occurs at node 4000, nodes 4002 and 4004 closely track the input. Nodes 4005 and 4006 are slightly delayed because 4003 is still high, keeping M4006 and M4008 on. Since 4005 is held down and 4004 rises quickly, the gated capacitor M4010 is charged to a level of $V_{DD} - V_T$. When 4002 has risen high enough to turn M4002 on, pulling node 4003 down, M4006 and M4008 are turned off. At this point nodes 4005 and 4006 begin rising rapidly, and node 4004 is bootstrapped by M4010 to a level 2 ($V_{DD} - V_T$). This increases the rise times and allows the output node 4006 to reach $V_{DD}$. The time delays for the actual circuits used are typically 20 ns. The risetimes are typically 20 ns to 30 ns. When the pulldown pulse occurs at node 4001, node 4002 is pulled down, allowing 4003 to be pulled up while node 4004 is pulled down. Nodes 4005 and 4006 are then pulled down. The time delay before node 4006 starts down is typically 10 ns, and the fall time is typically 10 ns to 15 ns. The device sizes required for the Gl generator to produce 20 ns risetimes depend on the capacitive load it must drive. Typical designs set the W of M4009 in mils equal to the capacitive load in pF and the W of M4008 was set to 1.5 times that of M4009. The values of the other transistors depend on internal node capacitance ratios. Therefore, each Gl driver required a custom design.
Figure 62.- Block Diagram of Clock Generators.
Figure 63. GI Clock Pulse Generator.
A special version of the G1 driver is required to allow the CCD electrodes to float. This driver, G2, is shown in Figure 64. The difference between G2 and G1 is that the gate of M4004 on G2 is tied to a third pulse, which comes up when it is desired to float the outputs. The G2 driver also has a second pair of output transistors so that one generator drives two of the four CCD outputs. A second G2 generator drives the other two. The $V_{DD}$ line to the second G2 generator can be trimmed to adjust for offsets in the DCI's.

The generator G3 shown in Figure 65 is a bootstrapped inverter circuit. The basic inverter consists of M4101 and M4102. M4103 keeps the gate of M4102 from falling below a threshold below $V_{DD}$. M4104 is a gated capacitor which bootstraps the gate of M4102 to a high voltage when the output node 4102 comes up. This speeds up the risetime by turning M4102 on hard. The generator G4 is an inverter whose pullup transistor is not bootstrapped. It is shown in Figure 66.

Figure 67 shows the generator G5. G5 produces a short pulse at node 4204 from a longer trigger pulse at node 4200. Before the trigger pulse occurs, node 4002 is pulled up by a set pulse on node 4201. When 4200 comes up, the output 4204 comes up immediately, since M4200 is on. The gate of M4200 is bootstrapped by gate-drain and gate-source capacitance so that 4204 reaches the same potential as 4200. While the trigger pulse is up, a pulldown pulse occurs on node 4203, turning off M4200 and pulling the output down.

Figure 68 shows the simple G6 generator. A pullup pulse and a pulldown pulse are required. The output is designed to have a maximum voltage of $V_{DD} - V_T$.

The connections of the generators G1 - G6 used to produce the internal timing pulses is shown in Figure 69. The approximate capacitive load of each
Figure 64. - CCD $\frac{\pi}{2}$ Clock Pulse Generator.
Figure 6. 63 Inverter.
Figure 66.- G4 Inverter.
Figure 67: G5 Generator.
Figure 68.- G6 Generator.
Figure 69(a) - Connections of the Clock Generators Showing Capacitive Load of Each Driver.
Figure 69(b). Connections of the Clock Generators Showing Capacitive Load of Each Driver.

11 = \phi_1, 12 = \phi_2
driver as calculated from the bar layout is indicated. The node numbers correspond to the names given each set of generators and to the SPICE modeling performed. Many clock waveforms required series connections of G1 drivers to generate propagation delays critical to the chip timing. Figure 70 shows the resulting internal timing pulses as predicted by the computer simulation program, SPICE, for a clock period of 400 ns (2.5 MHz operation).

2. Experimental Results

Photographs of some of the actual clock waveforms are shown in Figure 71 through 73. These pulses were monitored on-chip with an external probe. Figure 71 shows the leading edge of \( \phi_B \) and its pullup pulse \( \phi_1 \). The \( \phi_B \) leading edge is delayed from the \( \phi_1 \) leading edge through two G1 drivers, as seen in Figure 69(a). The actual delay as seen in Figure 71(a) is about 60 ns. The risetime is 20 ns. The delay predicted by computer simulation was 55 ns, and the predicted risetime was 20 ns, as shown in Figure 70. Figure 71(b) shows the trailing edge of \( \phi_B \) and its pulldown pulse \( \phi_2 \). No delay is added to the \( \phi_B \) trailing edge. The actual trailing edge delay is 30 ns, and the fall time is 15 ns. The predicted delay was 10 ns, and the predicted fall time was 15 ns, as shown in Figure 70. These results show close agreement between the computer simulations and the actual generator outputs. The \( \phi_B \) and \( \phi_C \) waveforms were designed to be complementary and nonoverlapping. Figure 72 illustrates the success of the design of these clocks. \( \phi_C \) is generated using the same dual G1 driver configuration as \( \phi_B \) with the pullup and pulldown pulses interchanged. Note that \( \phi_B \) and \( \phi_C \) are never high at the same time, and their leading and trailing edges do not overlap each other. Figure 72 may be compared to the computer-simulated timing of Figure 70.

A photograph of two of the larger clock generators (CCD \( \phi^\pm \)) is shown in Figure 73. These are G2 drivers as in Figure 64. It measures 1.75 x 0.4 mm\(^2\) (69 x 16 mil\(^2\)).
Figure 70.—System Timing.
Figure 71.-Clock Circuit Performance.
Figure 72.- Relationship of $\phi_B$ and $\phi_C$. 
G. Experimental Results

Evaluation of the CZT IC has been confined to obtaining power density spectra of real inputs using the system shown in Figure 74. This system requires only two MOS level clocks, voltage bias for the various circuits, and the squaring function. Figures 75 through 79 are photographs showing the operation of the CZT IC. Figure 75 shows the operation of the IC with 10 kHz clocks and no postmultiplication. The upper photograph shows the response of the MDACs to a dc input and the internal ROM. The bottom photograph shows the response of the IC to a 1.8 kHz sinusoid. The output shows the envelope of the real and imaginary outputs, since the input is not synchronized to the chip clocks. Similar operation of the IC is shown in Figure 76 with 1 MHz clocks. The input signal is a 440 kHz sinusoid with a dc offset. Figure 76 also illustrates a problem with the tapweights of the filters. The CCD tapweights were erroneously shifted one bit, causing the 32nd bit to appear as the first bit, while the first bit shifts to the second bit, and so on. This will affect only spectral power density analysis in that the output sequence is rotated. It will also affect function implementations requiring postmultiplications. To perform postmultiplications, a second chip will be required whose timing is delayed by one clock period. This error can easily be corrected with photomask changes.

Figure 77 shows the power density spectra for three different sinusoidal inputs with a dc offset. The output is observed to be delayed as the input frequency increases from 625 Hz to 4.0 kHz. Similar operation is shown in Figure 81 with 1 MHz clocks and the input varying from 125 kHz to 375 kHz. The apparent sidelobes are caused by the limited bandwidth and inaccuracies of the squaring devices. The response of the CCD CZT IC to a 312.5 Hz and a 625 Hz square wave input is shown in Figure 79. The output can be seen to decrease as $1/n^2$ (where $n$ is the harmonic index).
Figure 74.- A System for Obtaining Power Density Spectrum of a Real Input Using One CZT IC (50% Input/Output Duty Cycle).
(a) Response of MDACs to dc Input and Internal ROM

(b) Response of IC to 1.8 kHz Sinusoid

Figure 75.-Operation of CCD CZT IC with 10 kHz Clocks.
(a) Response of MDACs to dc Input and Internal ROM.

(b) Response of IC to 440 kHz Sinusoid.

Figure 76.-Operation of CCD CZT IC with 1 MHz clocks.
Figure 77.- Power Density Spectra for Three Sine Waves Obtained Using the 32-Point CCD CZT at a 10 kHz Sample Rate.
Figure 78 - Power Density Spectra for Three Sine Waves Obtained Using the 32 Point CCD CZT at 1 MHz Sample Rate.
Figure 79.- Response of CCD CZT IC to Square-Wave Inputs (10 kHz Clock Frequency).
H. Applications

The most straightforward application of the 32-point CCD CZT IC is to obtain power density spectra using the block diagram of Figure 74.

With a few external components, a complex input/output CZT can be performed with one chip as shown in Figure 80. The external components are necessary to disable the inputs when a postmultiply operation is needed for the true CZT. Because of this blanking, the system in Figure 80 has only a 50% input/output duty cycle. To obtain a 100% input/output duty cycle CZT, two chips are needed as shown in Figure 81. This configuration does not require the external circuitry of Figure 80.

The realization of a 16-point correlator using two CZT ICs is shown in Figure 82. This correlator convolves the real input signal with an impulse response $h(t)$. True correlation is obtained, since the pulse $T_1$ can blank the CCD input when required.

Another application (not shown) is the realization of a 16-point programmable transversal filter. This realization would require eight CZT ICs arranged as four correlators of the type shown in Figure 82. $T_1$ and $T_2$ will provide synchronization between the chips. These correlators would have the inputs tied together and their outputs summed.

These are only a few of the many applications the general-purpose CZT IC can perform, but they demonstrate its tremendous computational power.
Figure 80.-Realization of Complex I/O CZT with One Chip (50% Duty Cycle).
Figure 81. Realization of Complex I/O CZT Using Two CZT ICs (100% I/O Duty Cycle).
Figure 82.- Realization of Correlator Using Two CZT ICs (Real 25% Duty Cycle Input, Real 50% Duty Cycle Output).
SECTION III
64-POINT CCD ANALOG/ANALOG CORRELATOR

A. Design

The convolver is organized according to the block diagram of Figure 83. Two 64-stage CCDs shift charge in opposite directions. At each stage non-destructive outputs are obtained using floating gate amplifiers. Signals are buffered, and corresponding outputs from each stage are passed to analog multipliers. The outputs of the multipliers are summed to complete the function. Feedback linearization from the first buffer output is provided to remove circuit nonlinearities.

Parallel outputs are obtained using floating gates, as in Figure 84. A floating first poly electrode is placed underneath the phase-two clock. The floating gate is capacitively coupled to the CCD channel potential. The output voltage is attenuated approximately 50% due to parasitic capacitances, mostly capacitance to the phase-two clock. The floating gates are periodically reset through small series FETs to a reference.

The analog multiplier is shown in Figure 85, together with the required buffers. Each buffer is biased by a FET current source. The multiplier itself is a two-transistor circuit, which eliminates the effects of drain resistance nonlinearity, as shown below. The current in each of the multiplier FETs as a function of the terminal voltages is:

\[ I_{d2} = B(V_{gs2} - V_t - \frac{1}{2}V_{ds})V_{ds} + f(V_{ds}) \]
\[ I_{d1} = B(V_{gs1} - V_t - \frac{1}{2}V_{ds})V_{ds} + f(V_{ds}) \]  

(15)

where \( f(V_{ds}) \) is the next higher order approximation for the current, which takes into account the transistor threshold voltage variation with substrate voltage. It is only a function of \( V_d \) and \( V_s \). Subtracting the two currents we obtain:
Figure 83.-Analog Correlator Block Diagram.
Figure 84.-Floating Gate Reset Technique.
Figure 85: Analog Multiplier.
\[ I_{d2} - I_{d1} = B(V_{gs2} - V_{gs1}) V_{ds} \]  

since the drain and source voltages are identical for the two devices. Note that the drain nonlinearities, as well as the higher order terms, have cancelled. The residual nonlinearities that remain arise from variation in FET transconductance with gate voltage and from buffer output impedance.

A feedback linearization circuit has been incorporated into the input stage to control the dc voltage levels and eliminate nonlinearities in the CCD input to buffer output transfer function. It is shown in Figure 86. The input stage is identical to the other CCD stages and has its own floating gate output and buffer. Charge is input to the CCD stage through a FET, where the input signal is on the source (the input diode), and the feedback signal is on the gate (the feedback gate). As the charge fills up the well in the CCD input stage, the floating gate senses it, and the voltage level of the feedback gate drops until it is a threshold above the voltage on its source. Thus, the relative input and buffer output voltage levels depend only on the characteristics of the input FET, and not on all the other adjustments required to make the device operate, such as the floating gate reference voltage.

Note that the charge is metered into the input well in exactly the right amount to provide a linear output from the buffer, and the buffer nonlinearity caused by body effect and static loading becomes unimportant.

B. Performance Limitations in Design

1. Multiplier Nonlinearity

There are two main sources of multiplier nonlinearity. The first is due to the variation in FET transconductance with gate voltage. It comes from carrier velocity saturation effects to a small degree, but mostly from the variation in surface mobility as a function of surface concentration. That
effect is not generally modelled on SPICE simulation programs accurately, and in any case, such simulation was not available to the designer when the device was made. The only guidelines available were to attempt to keep the FET gate voltages relative to threshold fairly small, such as 3 or 4 V. Distortion from this effect was estimated at a fraction of a percent for 1 V signals.

The second major source of distortion came from the fact that the multiplier FETs did not have a voltage source on their drains, but a FET buffer with a finite output impedance. This resistance allowed the output voltage to be modulated by the loading. This problem was reduced by the feedback linearization circuitry. The buffer of the input stage was loaded with two FETs, both with their gates at the reference, or zero voltage level. The effects of this "average" load were therefore eliminated. However, the buffer output voltage could still be modulated by changes in the loading that occurred whenever a signal was presented to the gate of the multiplier FET. This would cause an attenuated gate signal to appear at the drain. Whereas the desired output current is of the form:

\[
I_o = B_m V_g V_d
\]

\[
V_g = V_{gs2} - V_{gs1}
\]  

When \( V_d \) is modulated by \( V_g \), \( V_d \) becomes

\[
V_d = V_d (1 - B_m Z_b V_g)
\]  

and the output becomes

\[
I_o = B_m V_g V_d (1 - B_m Z_b V_g)
\]

\[
= B_m (V_g V_d - B_m Z_b V_g^2 V_d)
\]

(17)

(18)

(19)
The amount of quadratic distortion is

\[ D = B_m Z_b = \frac{A B_m}{B_b V_{gst(b)}} \]  

(20)

where \( A \) is a factor that accounts for the increase in FET output impedance due to the capacitive source impedance at the gate:

\[ A = \frac{C_{\text{source}} + C_{gs}}{C_{\text{source}}} \]  

(21)

The amount of distortion can be decreased by making the buffers wide (low impedance) and the multiplier FETs high impedance. The dimensions used in the design were 7.2 mils wide x 0.3 long for the buffers versus 0.5 W x 2 mils long for the resistor FETs. The B-ratio was 96, resulting in calculated distortion of about a percent at 1 V signal levels with 500 \( \mu \)A buffer bias current.

2. Offsets

A severe performance limitation comes from the variations in offset voltages of the 64 buffers on each CCD. This adds a random offset term to each term of each multiplication, limiting filter coefficient definition, and thus filter stopband suppression. It was estimated that the offsets would be about 30 mV rms, or about 30 dB below the 1 V signals. Assuming the offsets to be totally uncorrelated, the processing gain of 8 (square root of N) would drop the sidelobe levels to -48 dB.

However, it was not expected that the offsets would be completely uncorrelated, though a design technique was used that helped that to occur. The buffer current sources were geometrically identical to the buffers themselves, so offsets in the current sources would tend to cancel those of the buffers. Since they were within 0.4 mil of each other, reasonable correlation was expected between those two offsets. This correlation will reduce the magnitude of offsets and decrease the correlation between the offsets of adjacent buffers.
3. **Input Stage Offsets and Nonlinearities**

There is still a residual nonlinearity left in the transfer function of the feedback linearized input stage. Since the output equilibrates at a threshold above the input, the nonlinearities in threshold voltage versus input level would be added to the signal. These were expected to be, however, on the order of only a few tenths of a percent.

Further problems were expected from the input stage in terms of defining the absolute dc level of the output. The threshold voltage shift would have to be compensated at the input to bring the output to the desired level. This level would then have the temperature dependence of a FET threshold voltage. In this type of input stage the output voltage approaches the correct level asymptotically as $1/T$. Assuming that it will never really get there, there will be a frequency dependent offset at the buffer outputs.

C. **Measured Performance**

1. **CCD Signal Capacity**

The output voltage swing available at the floating gate buffer outputs was found to be significantly smaller than the design specified. Although $2 \, V_{PP}$ had been desired, only about $0.8 \, V_{PP}$ was generally measured. Several factors were responsible for this severe problem. First, the CCD had been designed for $5 \, V$ implanted wells, but the measured change in threshold voltage was only $3.2 \, V$. This would translate to about $2.7 \, V$ change in the CCD surface potential, or about 85% of the threshold change.

The interlevel (between first and second poly) oxide thickness was reduced between the time the device was designed and the time it was processed. This increased the parasitic loading on the floating gate and decreased the signal gain. At the calculated gain of 50%, the floating-gate output would be $1.36 \, V_{PP}$. 
The charge capacity of the phase-one well was smaller than the phase-two well that contained the floating gate, partly because the reduced interlevel oxide thickness increased the charge capacity of the phase-two well and partly because of design error. The charge capacity of the phase-one well was only about 80% of that of the well with the floating gate. That 20% reduction in output voltage swing would reduce the available voltage to 1.09 V.

Finally, with a buffer gain of 0.8, the buffer output swing would be 0.87 V_{pp}, which is in agreement with experiment.

This limitation can be overcome in a future design, since it was imposed mostly by the two-phase structure and by the inappropriate sandwiching of the floating gate under another gate, which produced much attenuation of the signal.

In general, the signal levels used were on the order of +/- 1/3 V. The measured dynamic range, limited by signal size on the top end and noise on the bottom end, was 75 dB, using external BIFET opamp output amplifiers. In a future design with 1 V signal levels at each multiplier input, the dynamic range would be increased by 20 dB.

2. Feedback Linearized Input Stage

The feedback linearized input stage performed as designed. Figure 87(a) shows a triangle wave at the CCD input with the identical triangle wave offset by a threshold at the feedback gate output. The feedback gate waveform is a sampled data signal, and the reset portion of each cycle is evident in the photo. Note that this signal is about 1.9 V_{pp}, much larger than the CCD is capable of handling. This signal is permitted at the input stage by overfilling the well. The barely visible intermediate voltage waveform in the photo shows the output voltage after charge transfer when the excess charge is left in the input stage, since all of it could not be transferred out.
Figure 87(a).- CCD Input Stage Feedback—Linearized Output.

Figure 87(b).- CCD Serial Output vs Input.
The linearity of the transfer function from the CCD input diode to the serial output buffer, which should have the same waveforms as the feedback gate, was measured using a spectrum analyzer. The particular chip used had a smaller signal capacity than observed on chips from other slices. Output voltage signal swing was $0.6 \, \text{V}_{pp}$ for CCD1 (which drove the drain of the multiplier FETs) and $0.7 \, \text{V}_{pp}$ for CCD2. The spectrum analyzer outputs showed good linearity until the limits of signal swing were reached. At that point the distortion increased due to CTE loss in the case of nearly empty wells, or due to a combination of CTE loss and CCD flooding in the case of overfull or nearly full wells.

Figures 88(a) and 88(b) show CCD1 output linearity with a $4 \, \text{V}_{pp}$ and $0.5 \, \text{V}_{pp}$ signal swings, respectively. Distortion in the two cases was 0.2% and 0.28%. Figures 89(a) and 89(b) show CCD2 output linearity with $0.4 \, \text{V}_{pp}$ and $0.55 \, \text{V}_{pp}$ signal swings. Distortion in the two cases was 0.27% and 1.5%.

It is believed that performance of the input stage circuit was good, but signal swings were small enough to prohibit truly demanding tests. It is also believed that the circuit had enough problems in terms of defining dc levels, frequency dependent offsets, and the possibility of instability in some applications that its duplication in other circuits is not recommended. Other similar circuits will be discussed later.

3. **Multiplier Linearity**

The multipliers were tested using the circuit of Figure 90. The CCDs were disabled, and the buffer FETs were driven through the floating gate reset
Figure 88(a).-CCDI Serial Output Linearity 0.4 $V_{pp}$ Signal.

Figure 88(b).-CCDI Serial Output Linearity 0.5 $V_{pp}$ Signal.
Figure 89(a).- CCD2 Serial Output Linearity 0.4 V pp Signal.

Figure 89(b).- CCD2 Serial Output Linearity 0.55 V pp Signal.
Figure 90. - Multiplier Linearity Test Circuit.
FETs from the floating gate reference. This circuit was placed inside the feedback loop of an opamp, which sampled the serial output of the CCD, to accomplish the feedback linearized transfer function. All multipliers were tested in parallel. Note that this arrangement allowed the multiplier to be tested with larger signals than the CCD would deliver to this circuit.

The multipliers were tested using a spectrum analyzer. The signal on the CCD1 buffers (at the multiplier FET drains) was at 2 kHz, while the CCD2 buffers were given a 10 kHz sine wave. Figures 91(a) and 91(b) show the spectral output with 0.6 V \text{pp} and 2 V \text{pp} sine wave inputs on both terminals. In each case, 2 kHz and 10 kHz feedthroughs due to offsets can be seen. The largest distortion terms are at 18 and 22 kHz from the buffer output impedance effects. Distortion products at 6 and 14 kHz are due to a buffer distortion effect.

At 0.3 V peak the measured multiplier distortion was about 0.3%, while at 1 V peak it measured about 1%. At the 1 V signal levels the multiplier reference FET gate voltage was increased to avoid putting the multiplier FETs in the saturation region of operation with the larger signals.

4. Offsets

The performance of the unit in terms of offset amplitudes was surprisingly good. Offsets of 30 mV \text{pp} across the entire chip were measured for CCD1. Initially, offsets were measured directly in the time domain. 300 mV impulses were sent through both CCDs, giving a correlation peak equal to full scale for a single multiplier. This is shown in Figure 92(a), where the correlation peak has an amplitude of 1 V out of the current differencing amplifiers. This waveform also shows the offsets both impulses measure as they shift along the chip.
Figure 91(a). Multiplier Output $V_{in} = \pm 0.3 \, V_{pk}$. 

Figure 91(b). Multiplier Output $V_{in} = \pm 1 \, V_{pk}$. 
Figure 92(a).- Convolution of Two 300 mV Impulses (CCD2 Offsets).

Figure 92(b).- Convolution of Impulse with CCD Offsets (CCD1 Offsets).
With impulses sent through only one of the CCDs the measured result would be an indication of the offsets of the opposite CCD tap. Figure 92(b) shows on a 2X expanded scale the results of those measurements. The top trace shows the offsets of CCD2. The offsets are within 60 mV_{pp}, except for the two endpoints, which are offset outputs from the dummy multipliers at each end of the convolver. The bottom trace shows the offsets of CCD1. They are within 30 mV_{pp}.

The offset performance of CCD1 is better than that of CCD2 for two reasons. First, the CCD1 output buffers must drive the drains of the multiplier FETs and therefore have a rather large geometry. The output buffers of CCD2 must only drive FET multiplier gates and therefore were physically smaller by a factor of 4. Since offset variations would be expected to average out over area, the offsets of these smaller buffers should be twice as large as those of CCD1.

The second reason that the offsets of the CCD2 terms were larger is that included in those offset terms is any threshold difference between the two multiplier FETs. While midway in area between the two buffer types, the corresponding multiplier FET pairs were not as physically close together, which would cause threshold variations to have a smaller degree of correlation between the two.

Therefore it is logical that CCD1 should have the better offset performance.

Knowing the absolute limits on the offset voltages is not enough to describe them adequately, however. Even with a knowledge of the rms value of the offsets, it is not known how they would affect the performance of a filter. For example, until it is known what the correlation is between the different offsets. If it can be assumed that the offsets are uncorrelated, then their spectrum is white, and there is a processing gain of the square root of N in summing them together.
For this reason it is advantageous to make frequency domain measurements of the offset performance. Such measurements will also give results that have a much higher signal-to-noise ratio, since the offsets of all the multipliers are being combined.

The offsets of CCD1 were measured by clocking zeroes through it, while running CCD2 from the spectrum analyzer tracking generator. Full scale can also be defined by running a large amplitude dc level through CCD1, thus implementing a square wave impulse response, or a sin(x)/x frequency response. Figure 93(a) shows the resulting outputs with full scale defined as a 0.3 V square box response in CCD1. The CCD clocking frequency is 25 kHz. The top trace shows the sin(x)/x frequency response, while the bottom trace shows the response of the offsets alone. Except below 1 kHz, the response is 50 dB below full scale. The increased amplitude at low frequencies is due to very slow variations in offset across the length of the chip. Figure 93(b) shows the same response out to 10 kHz, and Figure 94(a) shows the same result out to 25 kHz. The offset output is at a level of -50 dB out to the Nyquist frequency. Note that in Figure 94(a), there are strong offset components at the Nyquist frequency. This is due to the buffers being a mirror-image design: alternate buffers are mirror images of each other, so that the two power supply buses can be shared. This mirror-image design should therefore be avoided in future implementations.

Figure 94(b) shows the same measurements applied to determining the offset performance of CCD2. Offsets are about 10 dB worse than for CCD1. Figures 95(a) and 95(b) show CCD2 output for full scale and offset output, respectively, out to the sampling frequency.
Figure 93(a).- CCD1 Frequency Domain Offset Measurements (Low Pass Output to 2.5 kHz).

Figure 93(b).- CCD1 Low Pass and Offset Error Outputs to 10-kHz (Offset Error Output).
Figure 94(a). - CCD1 Low Pass and Offset Error Outputs to 2.5 kHz.

Figure 94(b). - CCD2 Low Pass and Offset Error Outputs to 2.5 kHz.
Figure 95(a).- CCD2 Low Pass Output to 25 kHz.

Figure 95(b).- CCD2 Offset Error Output to 25 kHz.
5. **Multiplier Scale Factor Matching**

Another measure of performance of the multiplier is the degree of matching of multiplier scale factors across the length of the chip. This matching was found to be within about plus or minus 1% over the entire chip. It was measured by sending an impulse through CCD1, with either plus or minus full scale in CCD2. The difference between these outputs is the multiplier scale factor. It was found that the scale factor changed very slowly, almost linearly across the chip, as if it were due to a very slow change in the oxide thickness in the multiplier FETs. Since the scale factor variations were spatially very slow, the test impulse was made four samples wide to increase the S/N ratio. Figure 96(a) shows the convolver output when the test impulse was convolved with either a plus or minus dc level. The up and down variations of the two outputs are due to offsets, while the difference in slopes is due to scale factor variations. Figure 96(b) shows the same type of measurement with the dc output levels subtracted out. The two outputs should now be superposed. The difference in slopes is seen to be very small.

6. **Evaluation**

There were several aspects of performance of this device that were interesting and important because they had not been tried before. The CCD feedback linearized input stage performed as expected and represents an important portion of the system. It also has the potential for use in a modified form in other CCD structures.

The CCD floating gate output performed as designed, although in the future it is recommended that alternate layouts be used that do not attenuate the CCD signal as much. This attenuation, as well as other design and processing problems, limited the signal swings to about 0.8 V<sub>pp</sub>, which was the worst limitation of the device.
Figure 96(a).- Multiplier Scale Factor Accuracy; Convolution of CCD1 (Impulse of Four Samples) with CCD2 (Full Scale Dc).

Figure 96(b).- Multiplier Scale Factor Accuracy; Multiplier Scale Factor Error.
The analog multiplier performed as expected, having about 1% distortion level with 1 V signals. The buffer and multiplier offset variations were significantly better than expected, and gave perhaps the most important find from this circuit: that it is possible to achieve low offset variations in this type of correlator, with sidelobe levels typically 60 dB below 1 V signals in a 64-stage device.

Some sample outputs of the convolver in operation are shown in Figures 97(a) and 97(b). Figure 97(a) shows the convolution of two triangle waves, whose periods are very long compared to the 64 stages of convolution. The parabolic outputs are essentially the multiplication of the two triangle square waves. Again, the period of the signals is long compared to the convolution length, with the result that most of the time the output is multiplying a dc level by itself, yielding a positive result. At the square wave transitions the transitions approach the center of the convolver from opposite sides; and the result becomes the multiplication of two out-of-phase square waves, yielding a negative result.

D. Directions for Future Designs

1. System Organization

It is not felt that there are any advantages at this time to the organization wherein both signals are sent through CCDs. More preferable would be an organization where the impulse response is loaded onto an array of MOS capacitors, which would then drive the multiplier FET drain buffers. This has the advantage that a signal can be fed back after the buffer to the sample-and-hold circuitry, so that the offset term in the impulse response can be eliminated. It would also permit longer storage times of the impulse response.
Figure 97(a).-64-Point Convolution of Two Low-Frequency Triangle Waves.

Top and Middle:
CCD1 and CCD2,
0.5 V/Div

Bottom:
Correlator Out,
2 V/Div
5 ms/Div

Figure 97(b).-64-Point Convolution of Two Low-Frequency Square Waves.

Top and Middle:
CCD1 and CCD2,
Out, 0.5 V/Div

Bottom:
Correlator Out,
1 V/Div
2. **CCD Design**

   The design of the CCD will undoubtedly be quite different in future devices. The movement at TI to four-phase structures will change the ground rules for design radically. The floating gate will probably become one of the four phases, guarded on one side from moving clocks by a shield gate. The use of four-phase CCDs will result in larger signal capacity, eliminating the small signal problem of this device. In any design it will be better not to cover the floating gate with another clock electrode.

3. **Input Stage Design**

   It is believed that the use of a linearized input stage is mandatory. The type of circuit used here performed adequately, but it would be desirable to use a circuit that had no dc level shift from the input to the output, such as a potential equilibration CCD input with feedback.

   Care should be taken in the design of the input stage with respect to the loading on the output buffer. In this design a dummy stage was added with the gates of both multiplier FETs tied to the reference. This will introduce signals into the output summing bus unless the thresholds of the dummy multiplier FETs are matched exactly. They will not be matched, so the FETs should have their outputs connected to a different output ground.

4. **Buffer Design**

   To control offsets, it is recommended that large geometry buffers be used. This is, of course, necessary in the case of the buffers that drive the drains of the multiplier FETs, but should be done also with the other buffers.

   The use of mirror-image geometries from stage-to-stage in the layout of the buffers is discouraged because it generates offset components at the Nyquist frequency. The buffers and their current sources should be of matching geometries and as physically close as possible, so that their offsets will match, which will tend to make them cancel.
5. **Multipliers**

Several multiplier topologies are being investigated by various groups. They range from a single-transistor, time multiplexed arrangement to a four transistor bridge, which has higher linearity. Characteristics of each of these circuits are desirable, such as low offsets and high linearity, and the different possibilities should be investigated.

E. Applications and Summary

1. **Applications**

It is believed at this time that one of the stronger developing applications for analog/analog correlators (AAC) is adaptive filtering. It is possible that the needed circuitry for updating tap weights could be placed on the chip, resulting in a device that needed very little external support and could operate with high accuracy and dynamic range. The adaptive filtering algorithm would eliminate the effects of multiplier offset and scale factor errors and automatically update the tap weights. In this application an AAC would be much simpler to use than an analog/binary correlator, making maximum use of the strengths of the AAC (monotonicity, dynamic range), while eliminating effects from its main drawback -- offsets.

While adaptive filtering may be a strong single application, there are so many others that the analog/analog correlator should be considered a general-purpose part.

2. **Summary**

A 64-stage analog/analog correlator has been designed, built, and evaluated. Performance was found to be surprisingly good in spite of a major defect that reduced signal voltages by 10 dB from the design goals. The defect was in the design of the CCD floating gate outputs and limited signal swings to about 0.7 V pp. Even with those limitations, dynamic range was measured at
75 dB, which means 95 dB will be achieved with the proper signal swings. When operated as a \( \sin(x)/x \) lowpass filter, sidelobes due to multiplier offsets were 50 dB down from the in-band output. The -60 dB error sidelobes are therefore expected with a corrected design. Multiplier errors were found to be less than 1% at the designed signal levels and less than 0.3% at the measured smaller levels. The correlator was only operated at low speeds (100 kHz max), where speed was entirely limited by the peripheral electronics.
SECTION IV
CCD BINARY/ANALOG CORRELATOR

A. Introduction

Classification of multispectral image data is a complex data processing function for satellites that is presently performed on the ground. The goal is to perform this pattern classification in the satellite to reduce the data transmission rate required for the down link.

CCD binary/analog correlators provide a hardware implementation of the matrix multiplication function which is the kernel of the pattern classification algorithm. At present, the CCD binary/analog correlator appears to offer the optimum implementation of the matrix multiplication function. The advantages of the binary/analog correlator are given below.

- High accuracy. Since the weighting coefficients are represented in binary form, accuracy of up to 10 bits can be implemented.

- Arbitrarily long reference retention time. Since the reference is stored digitally in static shift registers, it does not need to be refreshed as the analog/analog correlator does.

- The binary/analog correlator is amenable to general-purpose IC design such that a single design can be used in a variety of system applications.

The problem is to classify vectors \( X \)

\[
X = \begin{bmatrix}
X_1 \\
X_2 \\
\vdots \\
X_N
\end{bmatrix}
\]

into classes \( w_j \). \( X \) is an \( N \)-element column vector, and \( N \) is the number of different wavelength sensors. The value \( X_k \) represents the amplitude of the signal in the \( k^{th} \) sensor.
A Bayes optimal pattern classifier is one which classifies patterns \( X \) into classes \( w_j, j = 1, m \) in such a way as to maximize the discriminant

\[
g_i(X) = - \sum_{j \neq i} p(X|w_j) p(w_j)/p(X) .
\]  

(23)

In the above equation

- \( p(X|w_j) \) is the conditional probability density function which gives the probability of occurrence of the pattern \( X \), given that \( X \) is in fact from the class \( w_j \);
- \( p(w_j) \) is the a priori probability of \( w_j \);
- \( p(X) \) is the a priori probability of \( X \).

Maximizing the above discriminant \( g_i(X) \) is equivalent to minimizing the expected loss \( L_X(i) = -g_i(X) \), which is given by

\[
L_X(i) = \sum_{j \neq i} p(w_j|X) .
\]  

(24)

In other words, \( L_X(i) \) is the sum of the probabilities of classifying \( X \) in the incorrect class \( (j \neq i) \). Equation (23) is obtained by the use of Bayes' rule

\[
p(X, w_j) = p(X|w_j) p(w_j) = p(w_j|X) p(X) ,
\]  

(25)

from which

\[
p(w_j|X) = p(X|w_j) p(w_j)/p(X) .
\]  

(26)

Since \( p(X) \) is not a function of \( i \), maximizing \( g_i(X) \) is equivalent to maximizing

\[
g'_i(X) = - \sum_{j \neq i} p(X|w_j) p(w_j) ,
\]  

(27)
which, in turn, can be written as

\[ g'_i(x) = p(x|\omega_i) p(\omega_i) - p(x) . \]  

Equation (29) is maximum if, and only if,

\[ g''_i = p(x|\omega_j) p(\omega_j) \]

is maximum. Thus, the decision rule is: \( x \in \omega_j \) if, and only if,

\[ p(x|\omega_i) p(\omega_i) \geq p(x|\omega_j) p(\omega_j) \quad j \neq i . \]

This is commonly referred to as the maximum likelihood decision rule.

A quadratic classifier utilizes a Gaussian probability distribution function. In one dimension this takes the form

\[ p(x|\omega_i) = (2\pi \sigma_i^2)^{-\frac{1}{2}} \exp\left[- \frac{(x - \mu_i)^2}{2\sigma_i^2}\right] . \]

Following Equation (29), it is desirable to maximize the discriminant

\[ g''_i(x) = \frac{p(\omega_i)}{\sqrt{2\pi} \sigma_i} \exp\left[- \frac{(x - \mu_i)^2}{2\sigma_i^2}\right] . \]

Maximizing \( g''_i(x) \) is equivalent to maximizing

\[ g''_i(x) = \log g''_i(x) = \log p(\omega_i) - \frac{1}{2} \log 2\pi \]

\[ - \log \sigma_i - \frac{(x - \mu_i)^2}{2\sigma_i^2} . \]
Since the constant term in $-\frac{1}{2} \log 2\pi$ appears in all of the $g_i''(x)$, it can be dropped, yielding the discriminant

$$g_i''(x) = \log p(w_i) - \log \sigma_i - \frac{(x - \mu_i)^2}{2\sigma_i^2}. \quad (34)$$

Returning now to N-dimensional space and assuming that all classes are equally likely on an a priori basis, the discriminant reduces to

$$g_i(x) = -\frac{1}{2} \log |\Sigma_i| - \frac{1}{2}(x - U_i)^T \Sigma_i^{-1} (x - U_i), \quad (35)$$

where

$$\Sigma_i = \begin{bmatrix}
\sigma_{i11} & \sigma_{i12} & \cdots & \sigma_{i1N} \\
\sigma_{i21} & \sigma_{i22} & \cdots & \sigma_{i2N} \\
& \ddots & \ddots & \ddots \\
\sigma_{iN1} & \sigma_{iN2} & \cdots & \sigma_{iNN}
\end{bmatrix} \quad (36)$$

and

$$U_i = \begin{bmatrix}
\mu_i^1 \\
\mu_i^2 \\
\vdots \\
\mu_i^N
\end{bmatrix} \quad (37)$$

The simplified discussion given above indicates the importance of matrix multiplication to the pattern classification algorithm. In particular, computation of the form $X^T CX$ forms the kernel of the pattern classification algorithm. This computation can be implemented with binary/analog correlators.
Section IV.B discusses the binary/analog correlator technology and presents results obtained with a 32-stage x 4 bit test device.

Section IV.C discusses the application of the binary/analog correlator to the computation of $X^TCX$.

Section IV.D discusses the preliminary design of an IC to implement a classifier having 16 sensors with 8-bit accuracy.

Section IV.E discusses the design and implementation of a 32-stage by 8-bit binary/analog correlator unit suitable to implement a classifier having $M = 8$ bits and $N = 4$ sensors using two of the existing 32-stage by 4-bit devices.

B. Binary/Analog Programmable CCD Correlator Technology

The concept of the binary/analog correlator is illustrated in Figure 98. The weighting coefficients ($h_0 h_1 \ldots h_{N-1}$ of Figure 1) are made electronically programmable by decomposing each one into a binary representation. $h_n$ is represented with $M$-bit accuracy by

$$h_n = \sum_{k=0}^{M-1} h_n^k 2^{-k}.$$  (38)

For $M$-bit accuracy, $M$ parallel CCD binary/analog correlators are put on a chip as shown in Figure 98. The most significant bit $h_n^0$ of each coefficient is loaded into the static shift register (coefficient store) shown as elongated rectangles in the filter at the top of the figure. The second most significant bit $h_n^1$ of each coefficient is loaded into the second coefficient store, and the least significant bit $h_n^{M-1}$ is loaded into the coefficient store shown at the bottom of the figure. The analog input signal to be filtered is applied without attenuation to the top filter (most significant bit). The input is attenuated by a factor of two at the input of the second filter (second most significant bit), and it is attenuated by a factor of $2^{M-1}$ at the input to the bottom filter (least significant bit). This attenuation is performed using capacitive ratio techniques similar to those employed in MOS MDAC technology.
Figure 98.- Block Diagram of the Binary/Analog Correlator.
The coefficients stored in the static shift registers then control the weighting of the transversal filters, and when the outputs of each filter are summed together as shown, the result is

\begin{align}
H(z) &= \frac{V_{\text{out}}(z)}{V_{\text{in}}(z)} = \sum_{n=0}^{N-1} h_n z^{-n} + 2^{-1} \sum_{n=0}^{N-1} h_{n+1} z^{-n} + \ldots + 2^{-(M-1)} \sum_{n=0}^{N-1} h_{n+M-1} z^{-n} \\
&= \sum_{n=0}^{N-1} \left[ \sum_{k=0}^{M-1} \left( h_n 2^{-k} \right) \right] z^{-n} \\
&= \sum_{n=0}^{N-1} h_n z^{-n}.
\end{align}

In the programmable CCD correlator the relative timing of charge transfer is selectively programmed. The structure is sketched in Figure 99. It consists of a conventional serial $4\phi$ CCD shift register in which phase-and-a-half clocking is employed. All of the $\phi_4$ electrodes are connected to a common output bus line, which is held by the output circuitry to an intermediate potential, i.e., between the OFF potential (0 V) and the ON potential (15 V). The $\phi_2$ electrodes are individually clocked by a program stored in a digital shift register that is physically parallel to the CCD channel. The $\phi_1$ electrodes are used as barriers at the time the charge is sensed and are clocked with a common bus line. The $\phi_2$ electrodes are biased at a constant potential slightly above 0 V by a common bus line. The operation is detailed below.

Assume that at some time in the clock cycle $t_1$ (see Figure 99) all the signal charge packets in the CCD reside under $\phi_2$ electrodes that are all in the ON state. At a later time $t_2$ some of the $\phi_2$ electrodes are selectively turned OFF (those corresponding to program bits that are equal to 1's). Immediately after $t_2$, the signal charge packets will transfer from the OFF $\phi_2$ electrodes through the $\phi_3$ electrodes to the corresponding successive $\phi_4$ electrodes which are maintained at an intermediate potential between the ON
Figure 99.- Schematic of the Operation of the Programmable CCD Correlator.
\( \phi_1 \) level and the OFF \( \phi_1 \) level. The transferred charges will be sensed and summed by the output integrator. Later in the cycle at \( t_3 \), the remaining \( \phi_2 \) clocks are returned to ground potential and the charge packets that did not transfer at \( t_2 \) will transfer to \( \phi_4 \) electrodes. At time \( t_0 \) the \( \phi_1 \) barrier electrodes and \( \phi_2 \) programmed electrodes turn ON, transferring the charges from the \( \phi_4 \) sense electrodes. At time \( t_0 \) the \( \phi_1 \) electrodes turn OFF, leaving the charges under the \( \phi_2 \) electrodes and the cycle is repeated. The \( \phi_3 \) electrodes serve as a buffer to prevent the transition of the \( \phi_2 \) electrodes at time \( t_2 \) from capacitively coupling to the integrator inputs (\( \phi_4 \)). Charge is never stored under the \( \phi_3 \) electrodes. In the time interval between \( t_2 \) and \( t_3 \) the output signal is available and is given by

\[
Q_{\text{out}}(n) = \sum_{i=1}^{N} h_i \times q_s(n - i). \tag{42}
\]

As described thus far, the device required unipolar signals. To allow signal of either sign and to allow the CCD to operate with a fat zero charge for better charge transfer efficiency, it is desirable to operate two structures like the one just described in parallel in a differential mode. A fat zero charge \( Q_{fZ} \) is inserted into each size of the differential pair, and a signal charge \( Q_{\text{sig}} \) is added to one side and subtracted from the other side. The output charge will be given by

\[
Q_{\text{out}}(n) = \sum_{i=1}^{N} [h_i \times (Q_{fZ} + Q_{\text{sig}})] - [h_i \times (Q_{fZ} - Q_{\text{sig}})] \tag{43}
\]

\[
Q_{\text{out}}(n) = \sum_{i=1}^{N} 2h_i Q_{\text{sig}}. \tag{44}
\]
For some applications a single-bit coefficient is adequate, but for applications requiring coefficients with multiple-bit accuracy, several differential channels may be binary weighted and combined as illustrated in Figure 100. This figure illustrates a configuration for correlating N analog samples with N digital words, each of which is represented by M bits. In this scheme the weighting is applied to the analog signals at the input of the CCD shift registers. The summing of the M parallel channels is accomplished simply by a parallel connection of the $\phi_4^+$ clock buses and of the $\phi_4^-$ clock buses. Thus, only one output integrator is needed.

The $\phi_2$ electrode clocking will be accomplished by the same shift register that stores the digital code. The circuit diagram for one stage of the shift register is shown in Figure 101. This shift register operates with three non-overlapping clocks as shown. Another signal R is used to control whether the shift register is connected serially for loading or whether each bit is continuously recirculated within the same stage in a storage mode.

The device illustrated in Figure 102 has been built to demonstrate the binary/analog concept discussed above. The device has 32 programmable taps, each having 4-bit precision. The experimental accuracy of the taps, illustrated in Figure 103, is ±1% consistent with $\frac{1}{8}$ LSB of 8-bit taps. The experimental impulse response shown in Figure 103 has a digital code -1 -7/8 -6/8 ... +7/8 in order to estimate weighting coefficient accuracy.

Figure 104 illustrates operation of the test device as a bandpass filter. Figure 104(a) is a photo of the impulse response and Figure 104(b) is of the frequency response. The center frequency is $\frac{1}{4}$ the clock frequency ($f_c = 5$ kHz), and the impulse response is weighted with a Hamming window quantized to 4-bit accuracy. The sidelobes in Figure 104(b) are within 1 dB of the sidelobes predicted by a computer simulation of this filter with 4-bit tap weight quantization. Figure 105 shows the frequency response of the binary/analog correlator as a bandpass filter with rectangular weighting operating at a 500 kHz
Figure 100. Schematic of a Programmable CCD Correlator with M-bit Accuracy on the Weighting Coefficients.
Figure 101.-Shift Register and Clock Generator for CCD Correlator.
Figure 102.—Photograph of a 32-Stage Binary/Analog Correlator with Four-Bit Weighting Coefficients. This IC has eight CCD filters (two for each bit) and static shift registers to load and store 32 four-bit words. Two of these ICs can be operated in parallel to achieve eight-bit weighting coefficients. Circuitry is included on-chip to facilitate microprocessor control.
IMPULS RESPONSE

Figure 103.-Experimental Impulse Response of 32-Stage Binary/Analog Convolver.

CODE:

\[ h_1 = h_{17} = -1 \]
\[ h_7 = h_{18} = -7/8 \]
\[ h_9 = h_{19} = -6/8 \]
\[ h_{10} = h_{20} = -5/8 \]
\[ h_{32} = h_{32} = +7/8 \]

Each tap can be programmed to any arbitrary 4-bit code.
(a) Impulse Response. (b) Frequency Response.

Figure 104.-Response of the Test Device Operated as a Bandpass Filter.
Figure 105.- Frequency Response of the Binary/Analog Correlator Programmed to Give a Bandpass Filter with Passband at 125 kHz (Clock Frequency is 500 kHz). A rectangular window function is used. The four bits used to code the first five weighting coefficients are shown. Note 2's-complement arithmetic is used to code the negative values ($h_3$) with $b_0$ the sign bit.

<table>
<thead>
<tr>
<th>$b_0$</th>
<th>$b_1$</th>
<th>$b_2$</th>
<th>$b_3$</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
</tbody>
</table>

etc.
clock frequency. Again, the sidelobes are within 1 dB of the computer-simulated response. The maximum clock frequency here is limited by the slew rate of the external integrating amplifier.

Figure 106 illustrates the use of the correlator for convolving p-n sequence codes. The 13-point Barker code impulse response is used. The 3 code words used for the p-n weighting are +7/8, 0, and -7/8. Shown are the correlator input, the filter impulse response, and the correlator output. The output is either 0 or +1 until the correlation peak occurs, at which time the output is +13.

C. Application of Binary/Analog Correlator to Pattern Classification

In a pattern classification system the following operation is needed:

$$X^T C X$$  \hspace{1cm} (45)$$

where $X$ is the column matrix of sensor outputs and $C = A^T A$ is a feature matrix. Let $Y = AX$ and $Y^T = X^T A^T$. To evaluate Equation (45) it is sufficient to compute

$$Y^T Y = \sum_{i=1}^{N} (y_i)^2$$  \hspace{1cm} (46)$$

where

$$y_i = \sum_{j=1}^{N} a_{ij} x_j$$  \hspace{1cm} (47)$$

The computation of $y_i$ can be accomplished using a binary/analog correlator described in Section IV.B.

Figure 107 illustrates the use of the binary/analog correlator structure in a system for the computation of $Y^T Y$. The system requires $N$-stages by $M$-bit binary/analog correlators. $N$ correlators are required for parallel computation
Figure 106.- Matched Filter Operation of the Test Device Using a 13-Bit Barker Code.
Figure 107.- Block Diagram for Computing $Y_T Y$. 

N-N Stage X M Bit Binary/Analog Correlators

$X \rightarrow a_{1j} \rightarrow (\cdot)^2 \rightarrow a_{2j} \rightarrow (\cdot)^2 \rightarrow a_{3j} \rightarrow (\cdot)^2 \rightarrow \ldots \rightarrow (\cdot)^2 \rightarrow \Sigma \rightarrow Y_T Y$
of $y_i$'s. The binary code stored in the $i^{th}$ correlator is the $i^{th}$ column of the $A$ matrix. The output of each correlator is valid after $N$ clock cycles, at which time $X$ has been completely loaded into each correlator. The correlator outputs are squared and then summed together to obtain $Y^TY$.

D. Preliminary Design of Matrix Multiplier IC

In the desired system, the number of different colored sensors is $N = 16$, and $M = 8$ bits of accuracy are thought to be required.

In integrating the system of Figure 107 for 16 sensors, size limitations make it desirable to reduce the number of correlators on one chip to four and multiplex four chips together. The number of squaring circuits can be reduced to one by sampling the correlator outputs after $N$ clock cycles and multiplexing these samples off chip. A block diagram of such a chip is shown in Figure 108. A preliminary layout of the IC is shown in Figure 109. This IC contains filters for four columns of the $16 \times 16$ feature matrix. The feature coefficients are loaded through eight reference inputs, one input for each bit of the reference. The reference inputs are TTL levels. A five bit TTL level address (four bits for row plus address enable) signal is needed to select the matrix column. An input load pulse is needed to transfer the input data to the addressed matrix column. The input signal attenuation needed for each column is realized using capacitive ratio techniques. The differential current integrator (DCI) consists of an MOS amplifier configured as an integrator. The sample-and-hold circuitry and multiplexer are realized using standard MOS techniques. All clocks are derived on-chip from the two-phase master clock. The specification on clock rate is 500 kHz (1 MHz master clock). However, the design goal will be 2 MHz (4 MHz master clock). Provisions are included for asynchronous operation wherein the data can be loaded into the correlators at high speed (greater than 2 MHz), and then the clock can be stopped to provide sufficient time ($\sim 2$ $\mu$s) to read out the matrix product with high accuracy. Synchronizing signals for both inputs
16-Stage, 8-Bit
Binary/Analog Correlator

Figure 108.- Block Diagram of Binary/Analog Correlator IC.
Figure 109.- Preliminary Design of Four-Column by 16-Stage by Eight-Bit CCD Matrix Multiplier IC.
and outputs are provided to allow interfacing to external equipment. Also, there is a provision for clearing the stored feature coefficients to zero using a digital input control signal. TTL/MOS buffers are provided for all digital inputs except the master clock. Sharp transition edges are required, and the TTL/MOS buffers required to provide these sharp edges at high speed would consume too much power to include on-chip. Dynamic range (maximum output signal/rms output noise) is expected to be 60 dB after the squaring operation.

Figure 110 illustrates the system configuration. Since this system uses parallel processing, an output is available from the accumulator every 16 clock cycles. The multiplexer is clocking the 16 correlator samples off chip to the squarer and accumulator while the correlators are being loaded with new sensor data.

The matrix multiplier system of Figure 108 must be duplicated for each feature. The pattern classification decision is then made based on the feature matrix multiplier that gives the largest result.

E. Four-Sensor Demonstration Unit

Using the 32-stage by 4-bit binary/analog correlator test devices described in Section IV.B, a 32-stage by 8-bit binary/analog correlator was constructed. This unit is suitable to implement a classifier having four sensors and 8-bit resolution. The unit is illustrated in Figure 111. To avoid using eight chips in this system, parallel processing is not used. All 16 coefficients of the 4 x 4 feature matrix are stored sequentially in the correlator. Two chips were required for eight-bit resolution. The system is operated by clocking sequences of 32 pulses through the CCDs consisting of the four-term sensor vector and 28 "zeros." The accumulator samples every fourth clock cycle. The correlator output sequence is \(y_1y_2y_3y_4\) for feature 1 and \(y_1y_2y_3y_4\) for feature 2. A squarer and accumulator are required to sum \(y_i^2\) for \(i = 1\) to 4 for each feature. The output will be available once every 16 clock cycles.
Figure 110.- System Block Diagram for Pattern Classifier.
Figure 111.- Block Diagram of Pattern Classifier Using 32 x 2-Bit Binary/Analog Correlator.
Figure 112 illustrates operation of the correlator with eight-bit accuracy. Under microprocessor control the digital code is ramped from all +127/128's to all -1's in twos-complement form. The output of the correlator (top trace) was subtracted from a 10-bit D/A converter with the same input code. The difference (bottom trace) was ±20 mV from zero error. Since the binary/analog output has a 10 V swing, this represents an accuracy of ±2/5 LSB at eight-bits resolution.
Binary/Analog Output
(10 V Swing)

Difference Between
0/4 Converter and
Binary/Analog Outputs (±20 mV Error)

Figure 112.- Illustration of 8-Bit Accuracy of
The Binary/Analog Correlator.
SECTION V
CONCLUSIONS

The objective of this program was to develop different CCD techniques for performing programmable correlation for preprocessing environmental sensor data preparatory to its transmission to the ground. Two separate ICs were developed under this program and a third IC was evaluated.

The first IC was a CCD chirp z-transform IC capable of performing a 32-point DFT at frequencies to 1 MHz. This IC, suitable for frequency domain correlation, contained MOS MDACs, CCD transversal filters, ROM, MOS amplifiers, and all needed control circuitry.

All on-chip circuitry operated as designed with the exception of the limited dynamic range caused by a fixed pattern noise due to interactions between the digital and analog circuits. Also, the tap weights of the CCD filters were erroneously shifted one bit, affecting spectral power density analysis. Both problems can easily be corrected with a minor redesign.

The second IC developed under this program was a 64-stage CCD analog/analog correlator for performing time domain correlation. Dynamic range was measured at 75 dB and potential for 95 dB dynamic range exists. Multiplier errors were found to be less than 1% at designed signal levels and less than 0.3% at the measured smaller levels. The main drawback of the analog/analog correlator is multiplier offsets, but applications exist that eliminate these effects and take advantage of the analog/analog correlator's monotonicity and dynamic range.

During the course of this program, a prototype IC for performing time domain correlation was evaluated. This IC offers digital programmability and appears to offer the optimum implementation of the matrix multiplication function needed in the pattern classification algorithm. The prototype IC was 32 stages long by 4 bits wide and operated at 500 kHz clock rate. The output dynamic range was limited to 48 dB by input signal feedthrough, but random output noise was measured.
to be 70 dB below the maximum output signal. The feedthrough was caused by coupling on chip and can be eliminated in future designs to be 70 dB below the maximum output signal.

It is our strong recommendation that a binary/analog correlator unit be developed suitable to implement a classifier having 8-bit accuracy and 16 sensors. We also recommend that the applications of the CZT IC to image processing problems (two dimensional processing of imagery) be explored.

In summary, the results of Contract No. NAS1-14290 were excellent. The first fully integrated CZT IC was designed and evaluated. An analog/analog correlator was designed and evaluated and an alternate approach to time domain correlation, the binary/analog correlator, was evaluated. A great deal was learned under the contract that will enable us to implement, in hardware, a pattern classifier suitable for operation in a satellite.
REFERENCES


The goal of Contract No. NAS1-14290 was to develop different charged-coupled devices (CCDs) for performing programmable correlation for preprocessing environmental sensor data preparatory to its transmission to the ground. Two separate ICs were developed under this program, and a third IC was evaluated. This final report documents the results of this contract.

The first IC was a CCD chirp z-transform IC capable of performing a 32-point DFT at frequencies to 1 MHz. This IC, suitable for frequency domain correlation, contained MOS MDACs, CCD transversal filters, ROM, MOS amplifiers, and all needed control circuitry. All on-chip circuitry operated as designed with the exception of the limited dynamic range caused by a fixed pattern noise due to interactions between the digital and analog circuits.

The second IC developed under this program was a 64-stage CCD analog/analog correlator for performing time domain correlation. Multiplier errors were found to be less than 1% at designed signal levels and less than 0.3% at the measured smaller levels. The main drawback of the analog/analog correlator is multiplier offsets, but applications exist that eliminate these effects and take advantage of the analog/analog correlator's monotonicity and dynamic range.

During the course of this program, a prototype IC for performing time domain correlation was evaluated. This IC offers digital programmability and appears to offer the optimum implementation of the matrix multiplication function needed in the pattern classification algorithm. The prototype IC was 32 stages long by 4 bits wide and operated at 500 kHz clock rate.