# Implementation of 64-Point Reconfigurable FFT Processor for ASIP Architecture

**B.S.Mahant<sup>1</sup>**, L.P.Thakare<sup>2</sup>

#### Abstract

In this paper a novel architecture of ASIP for a reconfigurable FFT is proposed. The proposed design implements a reconfigurable 64 points FFT processor for unsigned real numbers. By changing the value of integer constant we can design 2, 4, 8, 16, 32 and 64 point FFT which incorporates a high speed ASIP. In OFDMA system there is a need of alterable point FFT processor. Hence, the design meets the requirement of OFDMA system. DIF-FFT algorithm is implemented using VHDL language and Xilinx 9.1i is used for simulation results. The clock frequency is limited to 272.769 MHz for design stability and real time processing.

#### Keywords

FFT, DIF-FFT, OFDMA.

### I. Introduction

FFT module is core part of OFDMA system as defined by IEEE Std 802.16-2005. The core module of OFDMA physical layer is the FFT module, which can be used in the FFT points are: 2048 points, 1024 points, 512 points, 128 points and 64 points. The design about variable point FFT processor is just based on FFT module in OFDMA system application. Hence this paper uses a reconfigurable FFT design for implementation of ASIP in OFDMA application for unsigned real number.

A variety of ASIP implementations have been presented for FFT algorithms. [3] used 2D FFT Algorithm to design the FFT Processor. According to the idea of two-dimensional Fourier algorithm, i.e. N = 128, N1 = 2, N2 = 64. From 128 = 2 \* 64. When achieve 128-point FFT, Firstly the data is arranged in 64 lines and 2 rows, secondly the input data will transform the 64 points FFT, then the result

multiplies twiddle factor, Thirdly, the result do 2 points FFT. For the classic Cooley-Tukey FFT (CT-FFT), different ASIP implementations have been presented.

In [4] instruction capable of calculating a whole butterfly operation is implemented and the computation resources are distributed into three execution stages to reduce the clock cycle. However, long pipelines consume more energy, and the speedup from one integrated butterfly computation is still not enough to meet the high throughput requirement of the demanding IEEE 802.15.3a UWB communication standard. In [5] Hardware Extension is used for computations composed by four parallel butterfly units. A separate custom register file (CRF) is used to store all the intermediate data of the FFT computations in each epoch, and an on-chip ROM for the intra-epoch coefficients is used. An address changing logic (AC) is added in the decoder to give the right data address and coefficient address. It utilizes parallelism in the data path for performance in dissipation for the wide instruction length of 619 bits. The Xtensa ASIP design for FFT adds a set of, Therefore, after analyzing and comparison of various FFT algorithms, Decimation-In-Frequency FFT algorithm have been chosen. In order to ensure precision, the floating-point system is used in the design.

The paper presents a variable point FFT processor of a pipeline structure which is reconfigurable. The article is structured as follows. Fourier Transforms which is chosen in this design are illustrated in Section II. In Section III, the novel design of ASIP using reconfigurable FFT is proposed. In Section IV, the implementation and simulation is detailed. At last, the concise conclusions remark this paper.

# **II. Methodology**

FFT is defined by the formula:

 $Xk = \sum_{n=0}^{N-1} x(n) W_N^{nk} \quad k=0,1,...N-1 \quad (1)$ Equation(1):N is Transform length  $W_N^{nk} = e^{-j2\pi/N}$  is Twiddle Factor.

**B.S.Mahant**, Electronics Engineering Department, G. H. Raisoni College of Engineering, Nagpur.

L.P.Thakare, Electronics Engineering Department, G. H. Raisoni College of Engineering, Nagpur.

International Journal of Advanced Computer Research (ISSN (print): 2249-7277 ISSN (online): 2277-7970) Volume-3 Number-2 Issue-10 June-2013

The conventional method of FFT calculation involves  $N^2$  complex multiplication and N(N-1)complex additions.

The radix-2 Cooley-Tukey algorithm performs the same computation involving  $(N/2)\log_2 N$  complex multiplication and  $(N)\log_2 N$  complex additions. Algorithm is divided into frequency based (DIF)Fast Fourier Transform. The basic idea of these algorithms are that the N point FFT is divided into smaller and smaller parts until only two points FFT(Radix-2) is reached. Twiddle factor is considered as (-1) for all the butterfly computations.

# **III.Design of ASIP**

Reconfigurable FFT module is core part of ASIP which can be used in various FFT points: 64 points, 32 points, 16 points, 8 points,4 points,2 points. The design is based on select module .It is based on input constant value given to the constant K to select the required points FFT: 64 points,32 points,16 points,8 points,4 points,2 points. By defining this value the algorithm is compiled .After validity the architecture of the processor was modeled in VHDL language and functionally verified by Xilinx 9.1i software and timing simulations were obtained.

### **IV. Implementation and Simulation**

#### A. RTL Schematics

The proposed design was synthesized at 3.66ns clock cycles. The RTL Block Diagrams for Reconfigurable FFT module for constant value of K: 64 is shown in Figure.1.



Figure.1: RTL View for 64 point FFT

B. Behavioral Simulation



Figure.2: Simulation of 2 point FFT

When integer is 2 it gives 2 point FFTOutput for input  $\{1,1\}$  comes to  $\{2,0\}$  is shown in Figure 2.



Figure.3: Simulation of 4 point FFT

Figure.3. shows that when integer is 4 it gives 4 points FFT.O/P for  $\{0,1,2,3\}$  comes to  $\{6,-4,-2,0\}$  assuming twiddle factor (-1).

| Now:<br>2000 ns                   |   | 1000 | 12 | 0 | 14 | 400 | 1 | 1600<br> | 1 | 1800 | I |
|-----------------------------------|---|------|----|---|----|-----|---|----------|---|------|---|
| <b>a <mark>Şil</mark>iyo]3:0]</b> | 8 |      |    |   |    |     |   |          |   |      |   |
| <b>a <mark>Şıl</mark>yı (3:0)</b> | 0 |      |    |   |    |     |   |          |   |      |   |
| <b>1 și </b> y2[3:0]              | 0 |      |    |   |    |     |   |          |   |      |   |
| <b>a <mark>Şıl</mark>y3</b> [3:0] | 0 |      |    |   |    |     |   |          |   |      |   |
| <b>a <mark>Şıl</mark>y4(3:0)</b>  | 0 |      |    |   |    |     |   |          |   |      |   |
| <b>a <mark>şil</mark>y5[3:0]</b>  | 0 |      |    |   |    |     |   |          |   |      |   |
| <b>a <mark>Şil</mark>y6[3:0]</b>  | 0 |      |    |   |    |     |   |          |   |      |   |
| ∎ <mark>\$</mark> ¶y7[3:0]        | 0 |      |    |   |    |     |   |          |   |      |   |

Figure.4: Simulation of 8 point FFT

Figure.4.shows the output for 8 points FFT For inputs  $\{1,1,1,1,1,1,1,1\}$  output is  $\{8,0,0,0,0,0,0,0,0\}$  with twiddle factor (-1)



Figure.5: Simulation of 16 point FFT



Figure.6: Simulation of 32 point FFT

For all the inputs considered as 1 Figure.6. gives the output with time cycle more than 1000nsec.

| Now:<br>4000 ns |      | 1000 |         | 2000    |        |        |         | 1            | 3000   |       |        |      |            | 4000       |  |  |
|-----------------|------|------|---------|---------|--------|--------|---------|--------------|--------|-------|--------|------|------------|------------|--|--|
| B 84 y37[3:0]   | 4ħF  | 47hX | (4hD)   | 4hF     | 411    | (4hE)  | 4716    | (4hD)        | 4hF    | (4ht) | (41E)  | 4716 | X4hD       | <b>h</b> i |  |  |
| B ( y38[3:0]    | 4ħF  | 4700 | (4h1)   | ATHE    | (4119) | (4714) | 41h8    | X 4m7)       | 4hF    | 419   | (4'h4) | 4118 | X 4h7      | Ant        |  |  |
| B4 y39[3:0]     | 4h1  | 470X | X 41    | h1      | 4'hD   | (4hA)  | 4710    | X 4m1        |        | (41D) | (4'hA) | 4110 | X 4711     |            |  |  |
| B 84 y40[3:0]   | 4ħ9  | 4705 | (4h3)   | 419     | 4hF    | -      | 4100    | X 4h7        | 419    | (4hF) | <      | 4%C  | X 417      | h          |  |  |
| B 84 y41[3:0]   | 4ħ7  | 4thX | 418     | 4       | h7     | (412)  | 4'h0    | X 415)       | 417    |       | (4h2)  | 4110 | ( 4h5 )4h7 |            |  |  |
| B \$4 y42[3:0]  | 4ħ7  | 41hX | (4hF)   | 417     | 473    | (4hC)  | 47\A    | X 4713)      | 477    | (413) | (4hC)  | 4ħA  | X 473      | (h)        |  |  |
| B 64 y43[3:0]   | 4h5  | 4700 | (4hF)   | 4115    | 4hF    | (4716) | 41hA    | X 479        | 4115   | 4hF   | 416    | 416  | X 419      | 4 hi       |  |  |
| B 84 y44[3:0]   | 4115 | 471X | 419     | (4h5)   | (4h9)  | (41)4  | 4118    | X 479        | 4h5    | (419) | (4'h4) | 4118 | X 419      | Ans        |  |  |
| ■ 84 y45[3:0]   | 4ħ7  | 47hX | (4h9)   | 4h7     | (4h9)  | (4ħA)  | 4714    | X 4hF        | 4m7    | 419   | (4hA)  | 4114 | X 4hF      | 4m7        |  |  |
| D 64 y46[3:0]   | 4ħF  | 4hX  | (4h9)   | 4hF     | (419)  | (4h0)  | 4716    | X 419)       | (4hF)  | (419) | 410    | 416  | X 419      | The        |  |  |
| B 84 y47[3:0]   | 4h9  | 475X | × 4711) | 4119    | (4%D)  | (472)  | 416     | X 4h7        | (4119) | (4hD) | (4'h2) | 4116 | X 4m7      | Ans.       |  |  |
| B \$4 y48[3:0]  | 4hD  | 4700 | (4h3)   | (4hD)   | 411    | (4hE)  | 4'114   | X 479)       | 4hD    | 411   | (47E)  | 4754 | X 4719     | InE        |  |  |
| B 84 y49[3:0]   | 4ħ3  | 470X | (4hF)   | (4h3)   | 4'hD   | (4714) | 4118    | X 4hF        | (4h3)  | 4hD   | (4'h4) | 4118 | X 4hF      | n:         |  |  |
| B \$4 y50[3:0]  | 4ħF  | 47hX | (4m7)   | 4hF     | 419    | (      | 4h2     | X 4THD X 4TH |        | 419   | 4m2    |      | X 4hD      | hi         |  |  |
| B 64 y51[3:0]   | 4h5  | 4hX  | ATHB )  | 4115    | 419    | (4hC)  | 4hA     | (4hB)        | 4115   | (419) | (4hC)  | 4hA  | X 4'hB     | Ins        |  |  |
| B4 y52[3:0]     | 4ħD  | 4798 | ( 4h5 ) | 4hD     | (4h3)  | (4hE)  | 4110    | X 4hB        | 4hD    | 413   | (4hE)  | 4710 | X 4hB      | Inc        |  |  |
| B 84 y53[3:0]   | 4h7  | 411X | × 419   | X 4m7 X |        | 454    | X 419 X |              | 417    |       | 4114   |      | 4h7        |            |  |  |

Figure.7: Simulation of 64 point FFT

With integer kept at 64 the design behaves as 64 points FFT. After this any value of integer say 128,256 and so on will give default results as in Figure.7.

# V. Results

The FFT core in ASIP is computed with a speed of 272.769MHzs which is faster than 250MHzs [3] and 100MHzs [2].

# **VI.** Conclusions

ASIP architecture for Reconfigurable FFT was proposed and implemented with high speed

performance. The successful completion of the design altered 2,4 ,8 ,16 ,32 and 64 points FFT. Computation, design for 64 points Reconfigurable FFT limited to 272.769MHzs clock frequency, with the overall timing design stability is obtained. The design can be applied to real-time unsigned signal processing system, which completes the main computing modules in the OFDMA system.

### References

- Hanan M. Hassan, Ahmed F Shalash and KarimMohd "FPGA Implementation of an ASIP for high throughput DFT/DCT 1D/2D engine" 2012 IEEE.
- [2] M.BabylathaRajan and V.Pushparahavan "64-Point Radix 4 FFT Processing Using DBNS" International Conference on Computer and Information Technology, Bangkok, 2012.
- [3] WANG Xiu-fang, HOU Zhen-long "Design and Implement of FFT Processor for OFDMA System using FPGA" 2011 IEEE.
- [4] Akshay Sridharan, A Fiji "Low Power Hardware Implementation of High Speed FFT Core" International Conference on Advances in Computer Engineering 2010 IEEE.
- [5] Xian Guan, Hai Lin and YunsiFei, "Design of an Application-specific Instruction Set Processor for High- throughput and Scalable FFT" 2009 IEEE.
- [6] Joseph R. Cavallaro, PredragRadosavljevic"ASIP Architecture for Future Wireless Systems: Flexibility and Customization" Nokia Corporation, Texas Instruments, Inc 2008 IEEE.
- [7] Jacobson, Anthony T., Dean Nguyen Truong, and Bevan M. Baas. "The design of a reconfigurable continuous-flow mixed-radix FFT processor." In Circuits and Systems, 2009. ISCAS 2009. IEEE International Symposium on, pp. 1133-1136. IEEE, 2009.
- [8] Chidambaram, Ramesh, Rene van Leuken, Marc Quax, Ingolf Held, and Jos Huisken. "A multistandard FFT processor for wireless systemon-chip implementations." In Circuits and Systems, 2006. ISCAS 2006. Proceedings. 2006 IEEE International Symposium on, pp. 4-pp. IEEE, 2006.
- [9] M. Nicola, G. Masera, M. Zamboni, H. Ishebabi, D. Kammler, G. Ascheid, and H.Meyr." FFT processor: A case study in ASIP development" In IST Mobile Summit, 2005.



#### Miss.Bhagyashree S. Mahant

B.E., M.Tech G.H. Raisoni College of Engineering, Nagpur ISTE Member Designation:-Training and Placement Officer (Institute), I/C HOD, Electronics Dept.G.H. Raisoni Polytechnic, Nagpur.