Volume : 51, Issue 12, December 2022

# AN ANALYTICAL STUDY ON HARDWARE EFFICIENT PARALLEL PREFIX ADDER DESIGN AND IMPLEMENTATION USING XILINX ISE 

D.Rajani, Dept of Electronics and Communication Engineering, Sree Venkateswara College Of Engineering, Nellore (Dt), Andhra Pradesh, India.<br>E.Venkateswarlu, Dept of Electronics and Communication Engineering, Sree Venkateswara College Of Engineering, Nellore (Dt), Andhra Pradesh, India.<br>V.Venkata Sai Karthik, Dept of Electronics and Communication Engineering, Sree Venkateswara College Of Engineering, Nellore (Dt), Andhra Pradesh, India.<br>V. Mahesh Kumar, Dept of Computer Science and Engineering, Sree Venkateswara College Of Engineering, Nellore (Dt), Andhra Pradesh, India.


#### Abstract

The fastest PPA belongs to a type of parallel prefix adders called kogge-stone adders, which do multibit addition. Because of their complex circuitry, these adders are difficult to install. Modified parallel prefix adders are developed for 16 -bit and 32 -bit parallel prefix kogge-stone adders in order to streamline the hardware of the standard adders. In this project, a customised 64-bit parallel prefix kogge-stone adder with less hardware complexity than a standard 64-bit parallel prefix kogge-stone adder is designed and simulated using XILINX ISE 14.7.


## 1. INTRODUCTION

Every input in a prefix problem influences every output whose magnitude is equal to or greater, and every output whose magnitude is equal to or lower. The associative attribute of the prefix- operator allows for the execution of each operation in any order. In particular, group variables can be created by combining procedures to partially and simultaneously solve the prefix problem for sets of input bits. When group variable sequences are evaluated once again at higher levels, intermediate group variable levels are created, where the group variable reflects the prefix outcome of bits at level 1 . The last level's group variables must include all bits up to 0 and consequently reflect the solutions to the prefix problem. The most often used redundant arithmetic adders, carry-save adders, are crucial for the effective implementation of multi operand addition circuits. Due to the lack of carry-propagation channels and the simplicity of their construction, they are highly quick, but there is little room for further optimisation. The same is true for signed-digit adders, which employ an alternative redundant number representation. However, in order to proceed with processing the addition results, they typically need to be transformed into an irredundant integer representation. A carry-propagate adder is used for this process. The parallel prefix adder (PPA) is thought to be the most efficient adder for adding two multi-bit values..Circuit complexity and PPA speed are crucial factors when it comes to effective hardware implementation, hence numerous PPA variants with diverse parameter characteristics have been created in recent years. Three stages-preprocessing stage, prefix computation stage, and final processing stage-are used to compute the mechanism of PPA[1]. The well-known parallel prefix tree adders include Sklansky, Han-Carlson, Brent-Kung, and Kogge-Stone. The literature revealed that, when compared to other adders, the Kogge-stone adder is the quickest. In terms of worst-case latency, Ripple-Carry, Carry-Look-Ahead, Carry Select, and Kogge-Stone are shown to have the highest adder priorities. This is because there are so many "Reduced stages."The easiest to implement of all tree adders, the Kogge-Stone adder also has one of the shortest critical paths. The Kogge Stone adder's implementation has the disadvantages of taking up a lot of space and requiring more intricate connection routing (Fan-Out).

Volume : 51, Issue 12, December 2022

## 2. LITERATURE SURVEY

To mix binary data and calculate the sum, adders employ several logic gate configurations. The subgrouping of the adders is based on how well they can take and combine the digits. In microprocessors, DSPs, mobile devices, and other high speed applications, parallel addition is performed using parallel-prefix adders. The performance is improved by factors like latency and power thanks to the Parallel Prefix Adders' decrease in logic complexity and delay. The Parallel- Prefix adders are therefore the appropriate component in the high speed arithmetic circuits. The carry chain is the main issue when utilising RCA for binary addition. The length of the carry chain rises as the input operand width does as well. When the carry takes the longest route feasible from the Least Significant Bit (LSB) to the Most Significant Bit (MSB), this is the worst case scenario that can happen. We are thinking about using a carry look ahead adder to propagate the carry ahead or lessen the RCA delay. Basically, propagates and generates are the two operations this kind of adder uses. The intricacy of the carry rises as the adder bit width increases. Therefore, designing higher bit CLA becomes complex. There is a concept known as the Parallel Prefix Approach that may be used to quickly and simply compute the carry in the preceding.

## EXISTING METHOD PARALLEL PREFIX ADDER

The parallel prefix adder (PPA) is a multi-bit carry-propagate adder that combines two multi-bit values in parallel. PPA accelerate adding by extending the produced and propagated logic of the carry lookahead adder [4]. The pre-processing stage, prefix computation stage, and final processing stage comprise the three steps of the fundamental schematic structure of the different PPA, perspective architectures (Fig. 2.1)[5]. Let's take a closer look at each stage.

## OPERATION OF PARALLEL PREFIX ADDER

At the pre-processing stage, carry-generatedescribedbythefollowingcorrespondinglogicalequations:
$\mathrm{g}_{\mathrm{i}}=\mathrm{A}_{\mathrm{i}} \& \mathrm{~B}_{\mathrm{i}} ; \mathrm{i}=0,1,2, \ldots, \mathrm{n}-1$
$\mathrm{h}_{\mathrm{i}}=\mathrm{A}_{\mathrm{i}} \wedge \mathrm{B}_{\mathrm{i}} ; \mathrm{i}=0,1,2, \ldots, \mathrm{n}-1$


Fig 1: Architecture of parallel pre fixadder

## 3. PROPOSED METHOD

The implementation of today's apps is accelerating at an unstoppable rate.
The modules that support those should be replaced when the bus interface protocols and the width of the peripherals change. Since whatever implementation is used must also support legacy devices, it should be expanded to include next-generation applications and devices. Although the design is good for the applications, the industry does not support it if the implemented design lacks these features. In this project, the modified 32 bit parallel prefix adder is expanded to 64 bit. because the majority of

Industrial Engineering Journal
ISSN: 0970-2555
Volume : 51, Issue 12, December 2022
processors used today have 64 bits. The 32 bit adder requires two cycles to operate on that processor/application, increasing the overall duration and lowering system performance.

## MODIFIED 64-BITPARALLEL PREFIX KOGGE-STONEADDER

When compared to a 32 bit parallel prefix adder, the modified 64 bit parallel prefix kogge stone increases. The updated 64-bit parallel prefix adder's block structure is illustrated in Fig. 3.1 below; it has six levels, labelled level L1, level L2, level L3, level L4, level L5, and level L6, and the operation is carried out in three stages: preprocessing, prefixing, and final processing.

## OPERATION OF MODIFIED 64-BIT PARALLEL PREFIX KOGGE-STONE ADDER

Three phases make up the improved 64-bit parallel prefix adder's operation. This adder's two inputs are denoted by the symbols A0,A1,A2.......A63 and B0,B1,B2.......B63, respectively. Preprocessor stage is the initial step. In the preprocessing stage, carries are generated and propagated while adding two adjacent bits from two 64-bit inputs, namely (A0,B0),(A1,B1),....(A63,B63). Prefix computation stage is the second stage. Six stages of addition are performed in the prefix computation step. 63 black cells and 1 grey cell make up Level L1.A grey cell performs the addition of bits A1 and B1, and all subsequent additions of other bits are completed by black cells. In the level L2 two grey cells and thirty black cells are grouped together to form two schematic nodes. In the first grey cell, the bits (A2,B2) and the carry from the addition of ( $\mathrm{A} 0, \mathrm{~B} 0$ ) in level L1 are added. In the second grey cell, the bits ( $\mathrm{A} 3, \mathrm{~B} 3$ ) and the carry from the earlier addition of the bits (A1,B1) in level L1 are added.Similar to this, the bits (A6,B6), (A7,B7), (A10,B10), and (A,B).......(A,B), (A,B), (A,B), (A,B), adder is used to accommodate 32-bit apps, and $(A, B)$ are inserted in black cells alongside prior 64-bit applications. Here, the number of blocks is less than in the standard kogge-stone adder and one stage group carry generated in level L1's earlier stages, respectively.
Four schematic nodes are created at level L3, and addition is performed by skipping four bits. Four grey cells and 28 black cells make up level L3. The carry produced from (A2,B2) at level L2 is combined with the bits $(A 4, B 4)$ in the first grey cell.The bits (A5,B5), (A6,B6), and (A7,B7) are combined with the carry from level L2's (A3,B3) and (A2,B2) to fill the remaining grey cells. Similar to this, carry created from earlier bits in level L2 is inserted in 20 black cells together with (A12,B12), (A13,B13), (A14,B13), (A15,B15), (A20,B20), (A21,B21), (A22,B22), (A23,B23), and (A60,B60), (A61,B61), (A62,B62), and (A63,B63).
8 schematic node groups of level L4 are constructed. By omitting eight bits, the addition is completed. There are 24 black cells and 8 grey cells at level L4. The carries produced from earlier bits are added to the bits (A8,B8), (A9,B9), (A10,B10), (A11,B11), (A12,B12), (A13,B13), (A14,B14), and (A15,B15) at level L3.Similar group carries created from earlier bits in level L3 are inserted in black cells together with (A24,B24)....(A31,B31), (A40,B40), (A47,B47), (A56,B56), and (A63,B63).
Groups with up to 16 nodes are established at level L5. The increase is carried out by skipping 16 bits in the level L5's 16 grey and 16 black cells. The carry produced from earlier bits in level L4 are inserted in sixteen grey cells together with the bits (A16,B16), (A17,B17),......(A31,B31). In sixteen black cells, the bits (A48,B48), (A49,B49),... (A63,B63), along with carries produced from the previous bits in level L4 in the appropriate places, are added. When 32 bits are grouped, level L6 is created. There are 32 grey cells in it. In level L5, carries produced from earlier bits are combined with the bits (A32,B32), (A33,B33), and (A63,B63) in 32 grey cells.
The sums created by the bits skipped at each step are transmitted to the final processing stage in the third stage, where they are added to the carry P0, P1, and P63 using an XOR gate-based circle. S0, S1, and so on through S63 are the outputs of XOR gates.In this manner, a modified 64 -bit parallel prefix

Industrial Engineering Journal
ISSN: 0970-2555
Volume : 51, Issue 12, December 2022
adder is used to create the total and carry.

## IMPLEMENTATION OF64-BIT PARALLEL PREFIX KOGGE STONE ADDER

Modified 8 bit parallel prefix kogge stone adder implementation makes it simple to comprehend modified 64 bit parallel prefix kogge stone adder implementation. Three layers make up the modified 8 bit parallel prefix kogge stone adder. The modified 8 bit parallel prefix kogge stone adder circuit diagram is equivalent to $1 / 8$ th of the modified 64 bit parallel prefix adder circuit schematic. The modified 64-bit parallel prefix kogge stone adder consists of 64 group carries, numbered C 0 to C 63 , which are run concurrently in the prefix stage of the adder's second stage utilising the sums and carries produced in the preprocessing stage of the adder's first stage. All of these group carries are then added in the final processing stage. The modified 8 bit parallel prefix kogge stone adder is used to describe the carries execution from C 0 to C 8 . Fig. 4.1 displays the internal circuit diagram of the modified 8 bit parallel prefix kogge stone adder.
The white cells in the preprocessing step are arranged in 8 columns, numbered 0 to 7 .Take two 8 -bit inputs a0, a1, a2, a 3, a 4, a 5, a 6, and a 7 into consideration. Sums and carries are created during the preprocessing stage in the same
$\mathrm{A} 0 \mathrm{xb} 0=\mathrm{g} 0$ in column 0 , where $\mathrm{a} 0 \mathrm{xor} \mathrm{b} 0=\mathrm{h} 0$,
In column 1, a1 xorb1=h1, a1xb1=g1, in column 2, a2xorb2=h2, a2xb2=g3, in column 4, a4xorb4=g4, in column 5, a5xorb5=g5, in column 6, a6xorb6=g6, and in column 7, a7xorb7=h7, a7xb7=g7.

## Stage2: prefix calculation stage

In the second stage group carries are generated, the group carries are

```
\(\mathrm{C}_{0}=\mathrm{g}_{0}\)
\(\mathrm{C}_{1}=\mathrm{g}_{1}+\mathrm{h}_{1} \mathrm{xC} \mathrm{C}_{0}=\mathrm{g}_{1}+\mathrm{h}_{1} \mathrm{~g}_{0} \mathrm{C}_{2}=\mathrm{g}_{2}+\mathrm{h}_{2} \mathrm{xC}_{1}=\mathrm{g}_{2}+\mathrm{h}_{2}\left(\mathrm{~g}_{1}+\mathrm{h}_{1} \times \mathrm{g}_{0}\right)=\mathrm{g}_{2}+\mathrm{h}_{2} \mathrm{~g}_{1}+\mathrm{h}_{1} \mathrm{xh}_{2} \mathrm{xg}_{0} \mathrm{C}_{3}=\mathrm{g}_{3}+\mathrm{h}_{3} \mathrm{xC}_{2}=\mathrm{g}_{3}+\mathrm{h}_{3}\left(\mathrm{~g}_{2}+\mathrm{h}_{2} \mathrm{~g}_{1}+\mathrm{h}_{1}\right.\)
\(\left.\mathrm{h}_{2} \mathrm{~g}_{0}\right)=\mathrm{g}_{3}+\mathrm{h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{3} \mathrm{~h}_{2} \mathrm{~g}_{1}+\mathrm{h}_{3} \mathrm{~h}_{1} \mathrm{~h}_{2} \mathrm{~g}_{0} \mathrm{C}_{4}=\mathrm{g}_{4}+\mathrm{h}_{4} \mathrm{xC} \mathrm{C}_{3}=\mathrm{g}_{4}+\mathrm{h}_{4}\left(\mathrm{~g}_{3}+\mathrm{h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{3} \mathrm{~h}_{2} \mathrm{~g}_{1}+\mathrm{h}_{3} \mathrm{~h}_{1} \mathrm{~h}_{2} \mathrm{~g}_{0}\right)=\mathrm{g}_{4}+\mathrm{h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{4}\)
\(\mathrm{h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0} \mathrm{C}_{5}=\mathrm{g}_{5}+\mathrm{h}_{5} \mathrm{xC} \mathrm{C}_{4}=\mathrm{g}_{5}+\mathrm{h}_{5}\left(\mathrm{~g}_{4}+\mathrm{h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0}\right)\)
\(=g_{5}+g_{4} h_{5}+h_{5} h_{4} g_{3}+h_{5} h_{4} h_{3} g_{2}+h_{5} h_{4} h_{3} h_{2} g 1+\)
\(\mathrm{h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0} \mathrm{C}_{6}=\mathrm{g}_{6}+\mathrm{h}_{6} \mathrm{xC} \mathrm{C}_{5}=\mathrm{g}_{6}+\mathrm{h}_{6}\left(\mathrm{~g}_{5}+\mathrm{g}_{4} \mathrm{~h}_{5}+\mathrm{h}_{5} \mathrm{~h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0}\right)=\mathrm{g}_{6}+\mathrm{h}_{6} \mathrm{~g}_{5}+\mathrm{h}_{6} \mathrm{~g}_{4} \mathrm{~h}_{5}+\)
\(h_{6} h_{5} h_{4} g_{3}+h_{6} h_{5} h_{4} h_{3} g_{2}+h_{6} h_{5} h_{4} h_{3} h_{2} g 1+\)
\(\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0} \mathrm{C}_{7}=\mathrm{g}_{7}+\mathrm{h}_{7} \mathrm{x} \mathrm{C}_{6}=\mathrm{g}_{7}+\mathrm{h}_{7}\left(\mathrm{~g}_{6}+\mathrm{h}_{6} \mathrm{~g}_{5}+\mathrm{h}_{6} \mathrm{~g}_{4} \mathrm{~h}_{5}+\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0}\right)\)
\(=\mathrm{g}_{7}+\mathrm{h}_{7} \mathrm{~g}_{6}+\mathrm{h}_{7} \mathrm{~h}_{6} \mathrm{~g}_{5}+\mathrm{h}_{7} \mathrm{~h}_{6} \mathrm{~g}_{4} \mathrm{~h}_{5}+\mathrm{h}_{7} \quad \mathrm{~h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{7} \mathrm{~h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{7} \mathrm{~h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h} 7 \mathrm{~h} 6 \mathrm{~h} 5 \mathrm{~h} 4 \mathrm{~h} 3 \mathrm{~h} 2 \mathrm{~h} 1 \mathrm{~g} 0\)
```

These carries generated parallel in three levels level1, level2, level3 using7 columns.
Level1 results: The results of levell are denoted as $G[i: K]$ where I denotes the present column and $k$ Denotes the previous column from where carry is
$\mathrm{G}[5: 4]=\mathrm{g}_{5}+\mathrm{h}_{5} \mathrm{~g}_{4} \quad$ and $\quad \mathrm{H}[5: 4]=\mathrm{H}[5: 0]=\mathrm{h}_{5} \mathrm{Xh}_{4} \mathrm{G}[6: 5]=\mathrm{g}_{6}+\mathrm{h}_{6} \mathrm{~g}_{5} \quad$ and $\mathrm{H}[6: 5]=\mathrm{H}[6: 0]=\mathrm{h}_{6} \mathrm{xh}_{5} \mathrm{G}[7: 6]=\mathrm{g}_{7}+\mathrm{h}_{7} \mathrm{~g}_{6}$ andH[7:6]=H[7:0]= $\mathrm{h}_{7} \mathrm{xh}_{6}$
Hence the group carries Co and C1 are generated inthelevelland are denoted as G[0:0] and G[1:0] which are called as prefixes and the final sums are also generated which are added to these prefixes in final stage.

Level2 results: the results of level2 are also denoted as $G[i: K]$ but the execution starts from $2^{\text {nd }}$ column. $\mathrm{G}[2: 0]=\mathrm{G}[2: 1]+\mathrm{H}[2: 1] \mathrm{g}_{0}=\mathrm{g}_{2}+\mathrm{h}_{2} \mathrm{~g}_{1}+\mathrm{h}_{1} \mathrm{~h}_{2} \mathrm{~g}_{0}=\mathrm{C}_{3}=\mathrm{G}[2: 0]$ andH$[2: 0]=\mathrm{H}[2: 1] \mathrm{G}[3: 1]=\mathrm{G}[3: 2]+\mathrm{H}[3: 2] \mathrm{Xg}[1: 0]=$ $\mathrm{g}_{3}+\mathrm{h}_{3} \mathrm{~g}_{2}+\left(\mathrm{h}_{3} \mathrm{xh}_{2}\right)\left(\mathrm{g}_{1}+\mathrm{h}_{1} \mathrm{xg}_{0}\right)$
$=\mathrm{g}_{4}+\mathrm{h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0}=\mathrm{C} 4=\mathrm{G}[3: 0]$ andH[3:1]=H[3:2]=H[3:0]G[6:4]=G[6:5]+H[6:5]xG[ $4: 3]=\mathrm{g}_{6}+\mathrm{h}_{6} \mathrm{~g}_{5}+\left(\mathrm{h}_{6} \mathrm{~h}_{5}\right)\left(\mathrm{g}_{4}+\mathrm{h}_{4} \mathrm{xg}_{3}\right)=$ $\mathrm{g}_{6}+\mathrm{h}_{6} \mathrm{~g}_{5}+\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~g}_{4}+\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~g}_{3}$ and

## Industrial Engineering Journal

ISSN: 0970-2555
Volume : 51, Issue 12, December 2022
$\mathrm{H}[6: 4]=\mathrm{H}[6: 5] \mathrm{h}_{4}=\mathrm{h}_{6} \mathrm{xh}_{5} \mathrm{xh}_{4} \mathrm{G}[7: 5]=\mathrm{G}[7: 6]+\mathrm{H}[7: 6] \mathrm{xG}[5: 4]=\mathrm{g}_{7}+\mathrm{h}_{7} \mathrm{xg}_{6}+\left(\mathrm{h}_{7} \mathrm{xh}_{6}\right)\left(\mathrm{g}_{5}+\mathrm{h}_{5} \mathrm{xg}_{4}\right.$
$)=\mathrm{g}_{7}+\mathrm{h}_{7} \mathrm{xg}_{6}+\mathrm{h}_{7} \mathrm{xh}_{6} \mathrm{xg}_{5}+\mathrm{h}_{7} \mathrm{xh}_{6} \mathrm{xh}_{5} \mathrm{xg}_{4} \quad$ andH[7:5]=H[7:6] $\mathrm{xh}_{5}=\mathrm{h}_{7} \mathrm{xh}_{6} \mathrm{xh}_{5}$
Thusinlevel2thegroupcarriesC3andC4aregenerated which are denoted as G[2:0] and G[3:0]theseare also called prefixes.

Level 3 results: In level 3 the execution starts fromthe $4{ }^{\text {th }}$ columntakingthecarryof $2^{\text {nd }}$ columngenerated in the level 2 . Thus for the $4^{\text {th }}$ bit executionthe carry is generated at $2^{\text {nd }}$ bit itself which results inlessdelay.
$\mathrm{G}[4: 2]=\mathrm{G}[4: 3]+\mathrm{H}[4: 3] \mathrm{xG}[2: 0]=\mathrm{g}_{4}+\mathrm{h}_{4} \mathrm{xg}_{3}+\left(\mathrm{h}_{4} \mathrm{xh}_{3}\right)\left(\mathrm{g}_{2}+\mathrm{h}_{2} \mathrm{xg}_{1}+\mathrm{h}\right.$
takenforexecution.Itusestheg ${ }_{i}$ generated inpreprocessingstage.
andh $_{i}$
values
$\left.{ }_{1} \mathrm{xh}_{2} \mathrm{xg}_{0}\right)=\mathrm{g}_{4}+\mathrm{h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0}=$
C4=G[4:0]
$\mathrm{G}[0: 0]=\mathrm{C}=$ gand $\mathrm{H}[0: 0]=\mathrm{h}$
$\mathrm{G}[5: 3]=\mathrm{G}[5: 4]+\mathrm{H}[5: 4] \mathrm{xG}[3: 0]=\quad \mathrm{g}_{5}+\mathrm{h}_{5} \mathrm{xg}_{4}+\left(\mathrm{h}_{5} \mathrm{~h}_{4}\right)($
$0 \quad 0 \quad 0$
$\mathrm{G}[1: 0]=\mathrm{g}+\mathrm{hg}=\mathrm{C} 1$ and $\mathrm{H}[1: 0]=\mathrm{h}$
$\left.g_{4}+h_{4} g_{3}+h_{4} h_{3} g_{2}+h_{4} h_{3} h_{2} g 1+h_{4} h_{3} h_{2} h_{1} g_{0}\right)=g_{5}+g_{4} h_{5}+h_{5} h_{4} g_{3}+$
110
1
$\mathrm{G}[2: 1]=\mathrm{g}+\mathrm{hgand} \mathrm{H}[2: 1]=\mathrm{H}[2: 0]=\mathrm{hxh}$
$\mathrm{h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0}=\mathrm{C} 5=\mathrm{G}[5: 0]$
22112
$\mathrm{G}[3: 2]=\mathrm{g}_{3}+\mathrm{h}_{3} \mathrm{~g}_{2}$ and $\mathrm{H}[3: 2]=\mathrm{H}[3: 0]=\mathrm{h}_{3} \mathrm{xh}_{2} \mathrm{G}[4: 3]=\mathrm{g}_{4}+\mathrm{h}_{4} \mathrm{~g}_{3}$ andH[4:3]=H[4:0]= $\mathrm{h}_{4} \mathrm{xh}_{3}$
$\mathrm{G}[6: 2]=\mathrm{G}[6: 4]+\mathrm{H}[6: 4] \mathrm{xG}[2: 0]=\mathrm{g}_{6}+\mathrm{h}_{6} \mathrm{Xg}_{5}+\mathrm{h}_{6} \mathrm{xh}_{5} \mathrm{xg}_{4}+\mathrm{h}_{6} \mathrm{xh}_{5} \mathrm{X}$
$\mathrm{g}_{3}+\left(\mathrm{h}_{6} \mathrm{xh}_{5} \mathrm{xh}_{4}\right) \mathrm{x}\left(\mathrm{g}_{2}+\mathrm{h}_{2} \mathrm{xg}_{1}+\mathrm{h}_{1} \mathrm{xh}_{2} \mathrm{xg}_{0}\right)=\mathrm{g}_{6}+\mathrm{h}_{6} \mathrm{~g}_{5}+\mathrm{h}_{6} \mathrm{~g}_{4} \mathrm{~h}_{5}+\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~g}_{3}+\quad \mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\quad \mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+$ $\mathrm{h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0}=\mathrm{C} 6=\mathrm{G}[6: 0] \mathrm{G}[7: 3]=\mathrm{G}[7: 5]+\mathrm{H}[7: 5] \mathrm{xG}[3: 0]=\mathrm{g}_{7}+\mathrm{h}_{7} \mathrm{xg}_{6}+\mathrm{h}_{7} \mathrm{xh}_{6} \mathrm{xg}_{5}+\mathrm{h}_{7} \mathrm{xh}_{6} \mathrm{x}$
$h_{5} \mathrm{Xg}_{4}+\left(\mathrm{h}_{7} \mathrm{xh}_{6} \mathrm{xh}_{5}\right) \mathrm{x}\left(\mathrm{g}_{4}+\mathrm{h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~g} 1+\mathrm{h}_{4} \mathrm{~h}_{3} \mathrm{~h}_{2} \mathrm{~h}_{1} \mathrm{~g}_{0}\right)=$
$\mathrm{g}_{7}+\mathrm{h}_{7} \mathrm{~g}_{6}+\mathrm{h}_{7} \mathrm{~h}_{6} \mathrm{~g}_{5}+\mathrm{h}_{7} \mathrm{~h}_{6} \mathrm{~g}_{4} \mathrm{~h}_{5}+\mathrm{h}_{7} \mathrm{~h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~g}_{3}+\mathrm{h}_{7} \mathrm{~h}_{6} \mathrm{~h}_{5} \mathrm{~h}_{4} \mathrm{~h}_{3} \mathrm{~g}_{2}+\mathrm{hhhhhhg} 1+\mathrm{hhhhhhhg}=\mathrm{c} 7=\mathrm{G7}: 0$
generated from $\mathrm{P}_{16}$ to $\mathrm{P}_{31}$, in level 6 remaining 32 aregenerated from $\mathrm{P}_{32}$ to $\mathrm{P}_{63}$ and these all are added infinal processing stage and 64 outputs $S_{0}, S_{1}$ to $S_{63}$ areobtained.
Theimplementationofmodified64-bit
parallelprefixadderisdoneusing XilinxVerilog
$\begin{array}{lllll}7 & 6 & 54 & 32 & 7654 \\ 32 & 10\end{array}$
Thus in level 3 the carries $\mathrm{C}_{4}, \mathrm{C}_{5}, \mathrm{C}_{6}, \mathrm{C}_{7}$ are generated anddenoted as $\mathrm{G}[4: 0], \mathrm{G}[5: 0], \mathrm{G}[6: 0]$, $\mathrm{G}[7: 0]$.
Stage3:Finalprocessingstage
G[0:0],G[1:0],G[2:0],G[3:0],G[4:0],G[5:0],G[6:0],G[
7:0]obtainedfromsecondstagearecalledasprefixesdenotedas $\mathrm{P}_{0}, \mathrm{P}_{1}, \mathrm{P}_{2}, \mathrm{P}_{3}, \mathrm{P}_{4}, \mathrm{P}_{5}, \mathrm{P}_{6}, \mathrm{P}_{7}$ respectivelyandthesesa readdedtofinalsumsgenerated in level 1 of second stage in the third stagei.e,final processingstage.
Inthefinalprocessingstageainputcarry $\mathrm{P}_{\text {in }}=0$ denotedasP $_{-1}$ isgivenandthefinaloutputsarecalculated asshownbelow
$\mathrm{S}_{0}=\mathrm{H}[0: 0]$ XOR $\mathrm{P}_{-1} \mathrm{~S}_{1}=\mathrm{H}[1: 0]$ XOR $\mathrm{P}_{0} \mathrm{~S}_{2}=\mathrm{H}[2: 0]$ XOR $\mathrm{P}_{1} \mathrm{~S}_{3}=\mathrm{H}[3: 0]$ XOR $\mathrm{P}_{2} \mathrm{~S}_{4}=\mathrm{H}[4: 0]$ XOR
$\mathrm{P}_{3} \mathrm{~S}_{5}=\mathrm{H}[5: 0]$ XOR $\mathrm{P}_{4} \mathrm{~S}_{6}=\mathrm{H}[6: 0]$ XOR $\mathrm{P}_{5} \mathrm{~S}_{7}=\mathrm{H}[7: 0]$ XOR $\mathrm{P}_{6} \mathrm{~S} 8=\mathrm{P} 7=\mathrm{POUT}$
Thusthefinalresultsareobtained.
In the case of modified 64 bit, the same procedure applies since the levels of the bits and the number of columns will both rise. Up to 8 columns operate similarly to modified 8 bit ppa prefixes P0 to P7 are generated using 3 levels, remaining 8 carries starting from P8 to P15 are generated in level 4, and the
final 16 carries are HDL software. The modified 64 bit ppa is made up of 64 columns ranging from 0 to 63, 6 levels from level 1 to level 6, and 64 carries from C0 to C63.

## 4. RESULT ANALYSIS

Before the design could be tested for accuracy by walking practical simulation or downloading into the prototype board, it needed to be synthesised and put to use.
a. For cell-based designs, carry-skip, carry-select, and prefix adders have been compared qualitatively and quantitatively.
b. This design is an alternative to carry-save adders with the option of having a constant or variable binary number as the third operand.
c. The suggested method performs on par with a multi-operand conventional adder.
d. Because the new logic is integrated into a standard adder, there is no longer a requirement for specialised adder units to carry out the operation.
e. Despite its high power and area consumption, the Kogge-Stone adder performs well in terms of speed.
f.In order to examine the impact and potential performance increase of the adder, it will be applied in real-world application designs like decimal math and multiplier units.

Fig 2: RTL schematic window


Fig3; Technology schematic window


Fig 4: simulation window

Industrial Engineering Journal
ISSN: 0970-2555
Volume : 51, Issue 12, December 2022

## CONCLUSION

We explore parallel prefix adders, which are highly favoured in arithmetic-logic components of contemporary processors including digital signal processors and microprocessors. This research looks at the fastest parallel prefix adder, the conventional 64-bit kogge-stone adder. The hardware complexity of the design makes it difficult to implement. A modified 64-bit parallel prefix Kogge-stone adder is created in this project.
Starting at level 2, the pairs of columns are grouped starting with level 2 (i.e., two pairs of columns are grouped in level 2 and carry propagated only from those groups)., While in the standard 64 bit parallel prefix adder no such grouping is done and carries are generated and propagated from every column, which requires extra gates to generate and propagate these carries from every column, level 3 groups four pairs of columns, level 5 groups eight pairs of columns, and level 6 groups 16 pairs of columns, all of which are grouped and prefixes are calculated. Thus, the hardware complexity of the standard 64 bit kogge stone adder was reduced by the modified 64 bit parallel prefix kogge adder. Verification is done on the simulation outcomes of a modified 64-bit parallel Kogge-Stone adder.

## REFERENCES

[1] Geeta Rani, Sachin Kumar.-Delay Analysis of Parallel-Prefix Addersll. International Journal of Science and Research (IJSR), ISSN: 2319-7064,Impact Factor(2012):3.358.Volume3Issue6,June, 2014.pp. 2339.
[2] Sunil.M,Ankith.R.D,Manjunatha.G.DandPremananda.B.S.Designandimplementationoffasterparallelp refixKoggeStoneadder.InternationalJournalofElectricalandElectronic
Engeering\&Telecommunications2014.ISSN2319 - 2518.Vol.3,No.1,January2014.pp. 116.
[3] AungMyoSan, AlexeyN.Yakunin-Reducing the Hardware Complexity of a Parallel Prefix Adder\| IEEE2018pp. 1350
[4] Athira.T.S, Divya.R, Karthik.M, Manikandan.A.DesignofKogge-Stoneforfastaddition.Proceedings of 34th IRF International Conference, $26^{\text {th }}$ February 2017, Bengaluru, India. ISBN: 978-93-86291639.pp. 27-28.
[5] CH.Sudha,Rani,CH.Ramesh.DesignandImplementation of High Performance Parallel PrefixAdders.InternationalJournalofInnovativeResearchinComputerandCommunicationEngineering. AnISO3297:2007CertifiedOrganization. Vol.2, Issue 9, September 2014. pp.5900.
[6] VibhutiDave.High-speedmultioperandaddition utilization flag bits. Submitted inpartialfulfillmentoftherequirementsforthedegreeofDoctorofPhilosophyinComputerEngineering,Chi cago.Illinois. May2007. pp. 38-39.
[7] RaghumanoharAdusumilli,VinodkumarK -DESIGNANDIMPLEMENT ATIONOFAHIGH SPEED64BITKOGGE-STONEADDERUSING VERILOG HDLI International journal of electricaland electronic engineering \& Telecommunications(IJEETC) ISSN 2319_2518, vol.41,January 2015pp.15-16.

