

# Energy Efficient high Performance Three INPUT EXCLUSIVE-OR/NOR Gate Design

# Aditya Mishra, Deepak Kumar

Department of Electronics & Communication, Vidhyapeeth Institute of Science & Technology, Bhopal, Madhya Pradesh, India

## ABSTRACT

This paper presents highly efficient three inputs Exclusive-OR/NOR gate (XOR/XNOR) cell. The Exclusive-OR/NOR gate (XOR/XNOR) logic gates are the essential blocks of various embedded arithmetic system such as binary full adder, parity generator/checker, binary comparator, and encryption processor. Many circuits have been proposed on full adder and XOR/XNOR gate design that can be categorised in two categories. First one offers full swing output and second one offers partial swing output. The Systematic Cell Design Methodology is partial swing based logic design method which offers less delay and low power consumption at weak logic '0' and logic '1' generation at output. The proposed design is based on pass transistor logic and transmission gate technology. The proposed design consumes 30%, 38%, 33%, 13% and 48% higher dynamic power than XO4, XO7, XO10, Hybrid and LPHSFA, respectively. While the delay time of proposed design is 40%, 40%, 33%, 75% and 81% lower than XRG1, XRG2, XRG3, Hybrid and LPHSFA, respectively. All simulation results have been obtained by using HSPICE schematic simulator based on TSMC 130nm CMOS technology at 1.2 V supply voltages.

Keywords : Hybrid CMOS logic; Exclusive-OR/NOR Gate; Transmission Gate; Pass Gate; Full Swing Logic.

## I. INTRODUCTION

In digital integrated circuits as the technology is getting advanced and innovative day by day, power consumption and delay is also increasing dramatically; saving power is high in demand as it will reduce the overall cost for mobile computing and higher integration density as well as reduction in delay offers higher computation speed. But poor voltage swing offered at output of logic block misguides or malfunction the next stage. With the explosive growth, the demand, and the popularity of portable electronic products, the designers are driven to strive for smaller silicon area, longer battery life, higher speed, and enhanced reliability is increases with technology scaling. As Technology scaling increases the transistor count and operating frequency, which push the market demand for more and more function in Integrated circuit. However scaling always increases leakage power dissipation. As channel length reduces which results in increase of power dissipation with respect to technology progresses. To reduce the power consumption different logic design techniques like CMOS complementary logic, Dynamic

CMOS, Pseudo NMOS, Dynamic CMOS, CMOS Domino logic, Cascade voltage switch logic (CVSL), Modified Domino logic, Pass Transistor Logic (PTL) have been proposed [1-3]. Power dissipation depend on device structure property like gate oxide thickness, channel length, doping profile etc. as they are due to different physical phenomena Although Static CMOS Logic has been the most popular design approach for the past three decades [4, 5]. By scaling down the feature size of MOSFET devices in nano-meter, the supply voltage should be scaled down to avoid hot- carrier effects in CMOS circuits. To enhance the speed of the CMOS circuits, threshold voltage of the circuits has to be scaled down.

The dynamic XOR/XNOR logic gate is one of the most primary building blocks of arithmetic and logic unit (ALU) used in microprocessor. This plays an important role in silicon on chips (SoCs) to design ALU in small die area that reduce manufacturing cost. This system inbuilt ALU occupy more area on silicon chip that dissipate more heat and elevate the temperature of chip. This thing degrades the performance of system. In order to save the chip heat sink is needed that release the internal heat to external environment. As the operating frequency of dynamic XOR improves, dynamic power consumption becomes dominant that introduce heating problem as mentioned above. To overcome this problem, dynamic XOR gate proposed with minimum delay and smaller power consumption.

The remaining paper is organized in the following order. Section II presents the Literature of Exclusive-OR/NOR gate (XOR/XNOR) Unit, Section III describe the proposed cell schematic, Section IV presents result and discussion[6] and Section V presents the concluding remarks.

# II. Literature of Exclusive-OR/NOR (XOR/XNOR) Gate

The recently published novel work can be divided into two categories as they are extracted from the topic: 1) traditional three input XOR gate and 2) its operating methodologies. Cell design methodology (CDM) has been presented to design some limited functions, such as two-input XOR/XNOR and carry-inverse carry in the hybrid-CMOS style [5, 7, 8]. The predominant results persuade us to improve Cell Design Methodology (CDM) through two stages: 1) generating more complex functions and 2) rectifying some remaining flaws. The flaws in previously published CDM include containing some manual steps in the design flow and generating a large number of designs in which the predominant ones would be determined after the completion of simulations. Therefore, in the first stage, a three-input XOR/XNOR as one of the most complex and all-purpose three-input basic gates in arithmetic circuits have been chosen. If the efficiency of the circuits is confirmed in such a competitive environment, it can show the strength of the methodology. In the second stage, CDM is matured as systematic CDM (SCDM) in designing the three-input XOR/XNORs for the first time. It systematically generates elementary basic cell (EBC) using binary decision diagram (BDD), and wisely chooses circuit components based on a specific target. This takes place when the mentioned features are not considered in the CDM. Therefore, after the systematic generation, the SCDM considers circuit optimization based on our target in three steps: 1) wise selection of the basic cell; 2) wise selection of the amend mechanisms; and 3)

transistor sizing. It should be noted that BDD can be utilized for EBC generation of other three-input functions. We consider the power-delay product (PDP) as the design target. It stands as a fair performance metric, precisely involving portable electronic system targets. The motivation to use this methodology is the presence of some unique features and the ability to produce some efficient circuits that enjoy all these advantages.

The SCDM divides a circuit structure into a main structure and optimization-correction mechanisms. In the main structure, it considers features including the least number of transistors in critical path, fairly balanced outputs, being power ground- free, and symmetry. The mechanisms have the duty of completing the functionality of the circuits, avoiding any degradation on the output voltage, and increasing the driving capability. The dynamic consumption optimization comes from the fact of well-balanced propagation delay. This feature is advantageous for applications in which the skew between arriving signals is critical for proper operation, and for cascaded applications to reduce the chance of making glitches [9]. Power-ground-free main structure leads to power reduction.

The degradation in all output voltage swing can thus be completely removed, which makes the design sustainable in low VDD operations and low static power dissipation. The methodology has high flexibility in target and systematically considers it in the three design steps. This can lead to efficient circuits in terms of performance, power, power delay product, energy delay product, layout area.

The fast evolution of microelectronics fabrication processes demands a new cell library generation or a library technology migration. The well-organized systematic methodology leads to automated flow, which can reduce design time and costs, provide consistency in the cell library generation process, increase the range of simulation capabilities at the characteristics step, as well as minimize the risk of errors [7, 10]. Recently published article on hybrid type systematic cell design methodology (SCDM) applied on three circuits as shown in Fig.1, Fig.2 and Fig.3. The Three input XOR/XNOR based transmission gate using 16 transistors (XO4)[11] shown in Fig.1. Another three input XOR/XNOR based transmission gate and mirror circuit using 16 transistors (XO7)[11] shown in Fig.2. next is three input XOR/XNOR based transmission gate and mirror circuit using 18 transistors (XO10)[11] shown in Fig.3. All three circuits have six inputs (A, B, C and compliment of all three inputs) and two outputs (XOR and its compliment).



Fig.1 Three input XOR/XNOR using transmission gate(XO4)[11]



Fig.2 Three input XOR/XNOR using transmission gate and mirror circuit (XO7)[11]



**Fig.3** Three input XOR/XNOR using transmission gate and mirror circuit (XO10)[11]

### III. Proposed Exclusive-OR/NOR (XOR/XNOR) gate

The proposed work is based on 16 transistor circuit as shown in Fig. 4. As like existing SCDM, The predominant results persuade us to improve Cell Design Methodology (CDM) through two stages: 1) generating more complex functions and 2) rectifying some remaining flaws. In this cell, two back to back connected inverter pair is introduced between node XOR3 and XNOR3 using transistor TPP1, TPP2, TNN1 and TNN2. The proposed cell offers smaller delay and power consumption by eliminating transistors Tp1 and Tp2 from SCDM based XRG2 and XRG3 network.



Fig.4 Proposed three input XOR/XNOR gate

## **IV. RESULTS AND DISCUSSION**

To review the performance of three input XOR/XNOR gate, it is necessary to study the transient analysis and process variation. This comparison have been performed

complete study using Hspice [12]. To investigate further several modules with complementary outputs of full adders (FAs), TFA, Hernandez1, Hernandez2, NEWHPSC, Hybrid, 18T\_new\_FS, and LPHS-FA have been analysed, whose excellence have been confirmed in [13] [14] and [15-17]. Therefore, an approximately fair comparison will take place by selecting them. Table-1 shows that XRG1[11], XRG2[11] and XRG3[11] technique have least PDP and delay than other existing conventional LPHS-FA and TF technique. The comparison of Average Delay (ns), Average Power ( $\mu$ W), Average PDP (fJ), EDP ( $\mu$ W × ns) and Transistor Count are shown in Fig. 5, Fig. 6, Fig. 7, Fig. 8 and Fig. 9 respectively. The proposed cell offer least delay due to back to back connected inverter at its output node. The proposed design offers 40%, 40%, 33%, 75% and 81% lower delay than XRG1, XRG2, XRG3, Hybrid and LPHSFA, respectively as shown in Fig. 5. While proposed design consumes 30%, 38%, 33%, 13% and 48% higher dynamic power than XRG1, XRG2, XRG3, Hybrid and LPHSFA, respectively as shown in Fig. 6.



Fig.5 Comparison of average delay (ns) at 130nm technology



Fig.6 Comparison of average power (µW) at 130nm technology

Similarly proposed design offers lower PDP (power delay product) and lower EDP (energy delay product) than XRG1, XRG2, XRG3, Hybrid and LPHSFA. The transistor count of proposed design is same as existing.



Average PDP (fJ)



Fig.8 Comparison of EDP ( $\mu W \times ns$ )) at 130nm technology



#### **Transistor Count**

#### **V. CONCLUSION**

Exclusive OR/NOR gates are frequently used in high performance data processing units. Logic gate with high speed and low power are most demanded in modern high speed integrated circuit. The proposed Exclusive OR/NOR gate offers least delay and power consumption with published work XRG1, XRG2, XRG3, Hybrid and LPHSFA. The proposed design consumes 30%, 38%, 33%, 13% and 48% higher dynamic power than XRG1, XRG2, XRG3, Hybrid and LPHSFA, respectively. While the delay time of proposed design is 40%, 40%, 33%, 75% and 81% lower than XRG1, XRG2, XRG3, Hybrid and LPHSFA, respectively. Therefore, proposed is excellent choice for low power and high speed application.

#### **VI. REFERENCES**

- I. S. Abu-Khater, A. Bellaouar, and M. Elmasry, "Circuit techniques for CMOS low-power highperformance multipliers," IEEE Journal of solidstate circuits, vol. 31, pp. 1535-1546, 1996.
- [2]. U. Ko, T. Balsara, and W. Lee, "Low-power design techniques for high-performance CMOS adders," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 3, pp. 327-333, 1995.
- [3]. A. Bellaouar and M. Elmasry, Low-power digital VLSI design: circuits and systems: Springer Science & Business Media, 2012.
- [4]. A. M. Shams and M. A. Bayoumi, "A novel highperformance CMOS 1-bit full-adder cell," IEEE Transactions on circuits and systems II: Analog and digital signal processing, vol. 47, pp. 478-481, 2000.
- [5]. D. Radhakrishnan, "Low-voltage low-power CMOS full adder," IEE Proceedings-Circuits, Devices and Systems, vol. 148, pp. 19-24, 2001.
- [6]. W. Zhao and Y. Cao, "Predictive technology model for nano-CMOS design exploration," ACM Journal on Emerging Technologies in Computing Systems (JETC), vol. 3, p. 1, 2007.
- [7]. H. T. Bui, A. K. Al-Sheraidah, and Y. Wang, "New 4-transistor XOR and XNOR designs," in ASICs, 2000. AP-ASIC 2000. Proceedings of the Second IEEE Asia Pacific Conference on, 2000, pp. 25-28.
- [8]. N. Gong, B. Guo, J. Lou, and J. Wang, "Analysis and optimization of leakage current characteristics in sub-65nm dual V t footed domino circuits," Microelectronics Journal, vol. 39, pp. 1149-1155, 2008.
- [9]. H. Kaul, M. A. Anders, S. K. Mathew, S. K. Hsu, A. Agarwal, R. K. Krishnamurthy, et al., "A 320 mv 56 μw 411 gops/watt ultra-low voltage motion estimation accelerator in 65 nm cmos," IEEE Journal of Solid-State Circuits, vol. 44, pp. 107-114, 2009.

- [10]. J.-M. Wang, S.-C. Fang, and W.-S. Feng, "New efficient designs for XOR and XNOR functions on the transistor level," IEEE Journal of solid-state Circuits, vol. 29, pp. 780-786, 1994.
- [11]. T. Nikoubin, M. Grailoo, and C. Li, "Energy and Area Efficient Three-Input XOR/XNORs With Systematic Cell Design Methodology," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 24, pp. 398-402, 2016.
- [12]. A. Eshra and A. El-Sayed, "An odd parity checker prototype using DNAzyme finite state machine," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 11, pp. 316-324, 2014.
- [13]. C.-K. Tung, S.-H. Shieh, and C.-H. Cheng, "Lowpower high-speed full adder for portable electronic applications," Electronics Letters, vol. 49, pp. 1063-1064, 2013.
- [14]. M. Aguirre-Hernandez and M. Linares-Aranda, "CMOS full-adders for energy-efficient arithmetic applications," IEEE transactions on very large scale integration (VLSI) systems, vol. 19, pp. 718-721, 2011.
- [15]. S. Goel, M. A. Elgamel, M. A. Bayoumi, and Y. Hanafy, "Design methodologies for highperformance noise-tolerant XOR-XNOR circuits," IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 53, pp. 867-878, 2006.
- [16]. S. Goel, A. Kumar, and M. A. Bayoumi, "Design of robust, energy-efficient full adders for deepsubmicrometer design using hybrid-CMOS logic style," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 14, pp. 1309-1321, 2006.
- [17]. A. Shams and G. M. Zhang, "A review of 0.18 μm full adder performances for tree structure arithmetic circuits," IEEE Trans. Very Large Scale Integr.(VLSI) Syst, vol. 13, pp. 686-695, 2005.