## Measuring the Gap between FPGAs and ASICs

Ian Kuon, Jonathan Rose

Year of Publication: 2006 Area: Architecture

Everyone understands that programmability has a cost. This paper is one of the most cited papers in this collection because it quantifies the cost of programmability. The abstract claims the core area for an FPGA is, on average, a surprising 40 times higher than a standard cell ASIC and is motivational to all work improving FPGA architectures and structured ASICs.

Prior to this paper, most comparisons were anecdotal characterization of small circuits and tended to only compare FPGAs with mask-programmable gate arrays, suggesting only a 10× area penalty. However, by 2006, ASIC CAD had improved and synthesized standard cell designs were the more common choice for ASIC implementations.

In defense of the FPGA architects of the world, the highly-cited 40× result is exaggerated, because it considers only core area, and is obtained by considering designs with both logic and arithmetic in an FPGA architecture lacking hardened multipliers. This paper breaks down the benchmark suite into four classes based on whether they contain arithmetic or memory in addition to unstructured logic and registers. In the class containing logic and arithmetic, the FPGA architecture that includes hardened multipliers has an area ratio of 28 versus the ASIC. Perhaps the more enduring contribution of this paper is the demonstration of the correlation between benchmark results and FPGA architecture features like memories and DSPs. The contribution of hardened components to optimizing cost and performance cannot be ignored. In modern FPGAs, the decisions about what components to harden and how are as important as the traditional FPGA architecture questions like LUT size and interconnect topology.

Benchmarking papers like this are always controversial because they either make imperfect comparisons between different quantities or they use abstractions that make the comparisons more equivalent but less meaningful. This paper does an exemplary job of making the comparisons, and describing in detail exactly how those comparisons are made to allow the readers to form their own conclusions from the results.

Herman Schmit

| Table 2: Area Ratio (FPGA/ASIC) |           |            |            |                  |
|---------------------------------|-----------|------------|------------|------------------|
| Name                            | Logic     | Logic<br>& | Logic<br>& | Logic,<br>Memory |
|                                 | Only      | DSP        | Memory     | & DSP            |
| booth                           | 33        |            |            |                  |
| rs_encoder                      | 36        |            |            |                  |
| cordic18                        | <b>26</b> |            |            |                  |
| cordic8                         | 29        |            |            |                  |
| des_area                        | 43        |            |            |                  |
| des_perf                        | 23        |            |            |                  |
| fir_restruct                    | 34        |            |            |                  |
| mac1                            | 50        |            |            |                  |
| aes192                          | 49        |            |            |                  |
| fir3                            | 45        | 20         |            |                  |
| diffeq                          | 44        | 13         |            |                  |
| diffeq2                         | 43        | 15         |            |                  |
| molecular                       | 55        | 45         |            |                  |
| $rs_decoder1$                   | 55        | 61         |            |                  |
| rs_decoder2                     | 48        | 43         |            |                  |
| $\operatorname{atm}$            |           |            | 93         |                  |
| aes                             |           |            | 27         |                  |
| aes_inv                         |           |            | 21         |                  |
| ethernet                        |           |            | 34         |                  |
| serialproc                      |           |            | 42         |                  |
| fir24                           |           |            |            | 9.8              |
| pipe5proc                       |           |            |            | 25               |
| raytracer                       |           |            |            | 36               |
| Geomean                         | 40        | 28         | 37         | 21               |