| |
This white paper compares the performance of DSP applications in Altera FPGAs with popular DSP processors as well as competitive FPGA offerings. With higher performance, you can easily time-division-multiplex your DSP design to increase the number of processing channels, reducing the overall cost of your system. Table 1 shows the performance advantages Altera offers over other silicon solutions for DSP systems. Table 1. Altera DSP Performance Advantage
Figure 1 compares design performance in Altera Stratix II and Cyclone II devices to Xilinx Virtex-4 and Spartan-3 devices, respectively. Figure 1. DSP Proprietary IP & Open Core Results Comparison The Stratix II devices achieved an f MAX of over 350 MHz in 9 of the 17 designs, and two FIR designs exceeded 400 MHz. In comparison, only 2 of the 17 designs in Virtex-4 devices operated above 350 MHz. The Cyclone II devices achieved an f MAX of over 200 MHz in 9 of the 17 designs, and one FIR design exceeded 300 MHz. None of the 17 designs in Spartan-3 devices operated above 200 MHz. Performance Comparison Metrics There are many ways to compare the performance of different DSP solutions, and each provides a different level of accuracy. The following are three ways to compare DSP performance.
The performance comparisons in this white paper use DSP IP benchmarks and application level benchmarks. The DSP IP performance data is based on both open and proprietary IP cores comparing Altera’s Stratix II and Cyclone II FPGAs with Xilinx’s Virtex-4 and Spartan-3 devices, respectively. The application level benchmark data is based on real DSP systems for comparison of Altera’s first generation Stratix FPGAs against popular DSP processors. BDTI Benchmarks - FPGA vs. DSP Processor Berkeley Design Technology Inc (BDTI) is the leading provider of independent DSP benchmarks and publishes periodic analysis, FPGAs for DSP, comparing the FPGA performance vs. common DSP processors. The latest benchmark based on an orthogonal frequency division multiplexing (OFDM) system shows that Altera’s first generation Stratix FPGAs provide over 95% cost reduction per channel compared to other DSP processor. (See Table 2). Table 2. BDTI Benchmark Results on OFDM System Comparing Stratix FPGAs & Other DSP Processors.
Note to Table 2: OFDM Receiver System Information The benchmarked OFDM receiver system uses algorithms ranging from table look-ups to MAC-intensive transforms. The data sizes ranges from 4 to 16 bits while the data rate ranges from 40 to 320 Mbps. Data includes real and complex values. See Figure 2. Figure 2. OFDM System Block Diagram Input and output precision is 8-bit. This FIR filter in this design is a 127-tap complex FIR with real coefficients and the FFT is a 256-point complex FFT with input and output in natural order. The Slicer is a QAM-256 demapper. Soft decision Viterbi Decoder is used in this design. For even higher performance, based on the benchmark results using real customer designs, Altera’s Stratix II FPGAs offer an average of 50% higher performance than Stratix FPGAs. See the Stratix II Performance & Logic Efficiency Analysis White Paper for more details. FPGA vs. FPGA DSP IP performance benchmarks compare both high-performance, high-density FPGAs and low-cost FPGAs.
The DSP IP performance benchmark uses Altera and Xilinx proprietary IP cores and open cores from www.opencores.org. Benchmarking Methodology & Setup Benchmarking an FPGA performance is a very complex task. A poor benchmarking process can provide inconclusive and incorrect results. Altera has invested significantly to develop a rigorous and scientific benchmarking methodology that is endorsed by industry experts as a meaningful and accurate way to measure FPGA performance. For detailed benchmarking methodology, refer to the FPGA Performance Benchmarking Methodology White Paper. Table 3 shows the benchmark setup. Table 3. Benchmark Setup
Notes to Table 3: Proprietary IP & Open Core Designs Proprietary IP cores are cores generated from Altera’s MegaWizard and Xilinx’s CORE Generator tools. For proprietary IP core comparison, Altera used three types of common DSP IP cores with a total of nine designs:
These IP cores are generated from each FPGA vendor’s tool and benchmarked without further manual optimization. For open core comparison, Altera selected and benchmarked six different DSP-related open IP cores from www.opencores.org. Cores are chosen if its popularity statistics on this web site is greater than 10%. In addition, the complex FFT core is chosen because it is commonly found in DSP designs. The selected open cores are written in generic HDL code except for the use of FPGA-specific primitives in original designs, such as instantiations of embedded memory blocks and multipliers. To allow the compilation of such designs for different FPGAs and to provide a fair comparison, FPGA-specific primitives in each design are converted to use the embedded features of a specific FPGA to achieve the best performance. After FPGA-specific primitives are converted, the open cores are benchmarked without futher manual optimization to keep them as close as possible to their original state. More information for both the proprietary IP and open cores is available in the appendix. High-Performance FPGA Proprietary IP & Open Core Comparison For high-performance and high-density FPGAs, Altera’s Stratix II family offers up-to 1.8x higher performance, and an average of 1.2x higher performance, than Xilinx Virtex-4 FPGAs. See Figure 3 for relative performance comparison and Table 4 for detailed performance data for Stratix II and Virtex-4 families. Modern FPGAs embed dedicated multipliers to increase the speed of multiply-accumulate operations that are essential for many DSP designs. However, the best system performance relies on more than raw multiplier speed. It is critical to couple these multipliers with a complementary logic structure and routing fabrics of the same performance. The Stratix II family seamlessly integrates DSP blocks that operate at up to 450 MHz with high-performance adaptive logic modules (ALMs) and routing fabric to offer the highest system performance for your DSP designs. As shown in Figure 1, The Stratix II device family operated at over 350 MHz in 9 of the 17 designs, and two FIR designs exceeded 400 MHz. In comparison, only 2 of the 17 designs in Virtex-4 devices exceeded 350 MHz, well under the performance claimed in the Virtex-4 data sheet. This shows that high system performance can only be achieved by having an intelligent combination of embedded features and fabrics. Figure 3. Stratix II vs. Virtex-4 Proprietary IP & Open Core Relative Performance Comparison Table 4. Detailed Stratix II vs. Virtex-4 DSP Proprietary IP & Open Core Benchmark Data
Low-Cost FPGA Proprietary IP & Open Core Comparison Altera’s low-cost Cyclone II FPGAs offer up to 2x higher performance, and an average of 1.5x higher performance, than the Xilinx Spartan-3 family. Based on the benchmarked data, the Cyclone II device family operated at over 200 MHz in 9 of the 17 designs, and one FIR design exceeded 300 MHz. None of the 17 designs in Spartan-3 devices operated above 200 MHz. In addition, Cyclone II FPGAs outperform Spartan-3 devices in all designs benchmarked. This performance advantage can directly translate to higher channel count or lower cost for typical designs. Figure 4 shows the relative performance comparison between Cyclone II and Spartan-3 FPGAs. Table 5 shows detailed performance data for Cyclone II and Spartan-3 FPGAs. Figure 4. Cyclone II vs. Spartan-3 Proprietary DSP IP Core Relative Performance Comparison Table 5. Detailed Cyclone II vs. Spartan-3 DSP Proprietary IP & Open Core Benchmark Data
Note to Table 5: Conclusion Based on the benchmarking results from BDTI as well as Altera’s rigorous benchmarking methodology, Stratix II and Cyclone II FPGAs provide a performance advantage over both popular DSP processors and the competing FPGAs. High system performance for DSP applications cannot be achieved by simply embedding dedicated multipliers – it is an aggregate result of high-performance multipliers and performance-matching logic structure and routing architecture as implemented in Stratix II FPGAs. In addition, Altera’s Quartus II development software and DSP Builder provide a simple way to access the DSP performance in Stratix II and Cyclone II FPGAs without time-consuming manual optimization.
Higher DSP performance directly translates to cost savings in typical designs by increasing time-division-multiplexing and, therefore, increasing the total number of processing channels available in your system. Altera offers a comprehensive DSP solution consisting of a complete integrated software environment, performance-optimized devices, DSP intellectual property (IP) cores, development kits, reference designs, and customer training. For more information, visit www.altera.com/dsp.
Appendix
References
September 15, 2005 Comments on this article? Send them to comments@fpgajournal.com
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
All
material on this site copyright © 2006 techfocus media, inc.
All rights reserved.
FPGA and Structured ASIC Journal Privacy Statement |