Design of High Performance Parallel Multiplication using FPGA
…
6 pages
Sign up for access to the world's latest research
Abstract
In the recent decade, decimal arithmetic has received a lot of attention. In existing research on decimal multiplication, latency and area are two key factors.In any case, today's computerized frameworks and DSP applications; energy/power utilization is a pivotal thought.For quick DSP, low power and high speed multipliers are required. Because of its regular structure and ease of design, the array multiplier is one of the fastest multipliers. To boost multiplier speed and improve power dissipation with the least amount of delay, adders and CMOS power gating based CLA are employed. In the paired number framework, the significant issue in numbercrunching relates to convey. A higher rad-ix number framework, Quaternary Signed Digit (QSD), is utilized to perform number juggling tasks without convey.
Key takeaways
AI
AI
- Quaternary Signed Digit (QSD) improves decimal multiplication efficiency by minimizing carry propagation issues.
- The proposed multiplier design achieves 11.5% increase in performance and 10.13% reduction in power consumption.
- Array multipliers are simple but can lead to high power dissipation and delays if not optimized.
- Utilization of carry save-ahead adders enhances speed in the multiplication process, reducing overall latency.
- Simulation results show a 53.43% increase in Area-Delay Product (ADP) with advanced adder designs.
Related papers
International Journal of Engineering Research and, 2016
With the recent rapid increase in scale of integration, many sophisticated signal processing as well as video processing systems are being implemented on VLSI chip in which multiplication is dominant operation. The performance of these systems is based on computation capacity and power consumption. This paper presents novel approach of multiplication scheme based on Radix 4 and its implementation on FPGA which results in great computational capacity and reduced power consumption. This system has been designed and simulated using Xilinx 13.4 for 8x8 bit numbers.
Microelectronics Journal, 2013
Floating point arithmetic (FPA) is a crucial basic building block in many application domains such as scientific, numerical and signal processing applications. Multiplication is one of the most commonly used one in FPA. This paper presents three architectures targeting Double Precision (D.P.) multiplier, with one being capable of performing run-time-reconfigurable (RTR) dual Single Precision (S.P.) multiplication operation. The first design is based on a novel block-level truncated multiplication, which is able to reduce 1/3 of multiplier blocks with high performance, and is within 1-ULP (unit in the last place) precision from IEEE-754 floating-point standard precision. The second design regains the accuracy lost from the first design, with the same amount of multiplier blocks but some extra hardware, is also able to achieve better performance with less latency than existing work. The third architecture in this paper is able to perform either, with the single double (extended) precision or dual single (extended) precision operands, without any pipeline stall, and with attractive area, speed and latency results. The first design is suitable for the applications with slightly less precision requirement, whereas the other two designs are fully compatible to the IEEE standard accuracy. Design-1 is able to achieve around 300 MHz and 450 MHz on Virtex-4 (V4) and Virtex-5 (V5), respectively, with only 6 DSP48, and latency of 9 cycles. Design-2 is capable of achieving about 325 MHz (V4) and 400 MHz (V5), with only 6 DSP48, with full precision support. The third design achieves more than 250 MHz (V4) and 325 MHz (V5) speed, providing on-the-fly dual precision support, with hardware requirement similar to only double precision supported implementations in the literature. Promising results are obtained by comparing the proposed designs with the best reported floating point multipliers in the literature.
Computers & Electrical Engineering, 2019
Due to the increasing demand for decimal calculations in the business, financial and economic world, decimal arithmetic circuits have been much considered by system designers. This is mainly because, these applications heavily depend on decimal arithmetic since the results must match exactly those obtained by human calculations. While decimal multiplication is one of the most frequent and complex-to-implement decimal operations, the special case of constant decimal multiplication is widely used in the economic and financial applications. In this paper, we propose two ideas, named "Constant Decimal TCSD" (CDT) and "Constant Decimal DDDS" (CDD), and their hardware implementations for realizing multiple constant decimal multiplication. In the CDT and CDD architectures, the partial products are generated using a set of positive multiplicand multiples coded in2 s complement signed-digit format (TCSD) and binary coded decimal (BCD), respectively. We also present two new (3:1) compressors to reduce the number of partial products in both designs, one based on a new Double Decimal Digit Set (DDDS), which is not only selfcomplementing but also its redundancy, allows carry-free addition. Finally, a redundant to non-redundant converter recodes the TCSD and DDDS product to BCD in the first and second schemes, respectively. Hardware synthesis evaluation shows that compared to the most recent 16 × 16 decimal multipliers, delay, area, power consumption and PDP of the proposed multiple constant multipliers improve up to 57%, 89%, 93% and 97%, respectively.
IEEE Transactions on Computers, 2010
The new generation of high-performance decimal floating-point units (DFUs) is demanding efficient implementations of parallel decimal multipliers. In this paper, we describe the architectures of two parallel decimal multipliers. The parallel generation of partial products is performed using signed-digit radix-10 or radix-5 recodings of the multiplier and a simplified set of multiplicand multiples. The reduction of partial products is implemented in a tree structure based on a decimal multioperand carry-save addition algorithm that uses unconventional (non BCD) decimal-coded number systems. We further detail these techniques and present the new improvements to reduce the latency of the previous designs, which include: optimized digit recoders for the generation of 2 n-tuples (and 5-tuples), decimal carry-save adders (CSAs) combining different decimal-coded operands, and carry-free adders implemented by special designed bit counters. Moreover, we detail a design methodology that combines all these techniques to obtain efficient reduction trees with different area and delay trade-offs for any number of partial products generated. Evaluation results for 16-digit operands show that the proposed architectures have interesting area-delay figures compared to conventional Booth radix-4 and radix-8 parallel binary multipliers and outperform the figures of previous alternatives for decimal multiplication.
18th IEEE Symposium on Computer Arithmetic (ARITH '07), 2007
This paper introduces two novel architectures for parallel decimal multipliers. Our multipliers are based on a new algorithm for decimal carry-save multioperand addition that uses a novel BCD-4221 recoding for decimal digits. It significantly improves the area and latency of the partial product reduction tree with respect to previous proposals. We also present three schemes for fast and efficient generation of partial products in parallel. The recoding of the BCD-8421 multiplier operand into minimally redundant signed-digit radix-10, radix-4 and radix-5 representations using new recoders reduces the complexity of partial product generation. In addition, SD radix-4 and radix-5 recodings allow the reuse of a conventional parallel binary radix-4 multiplier to perform combined binary/decimal multiplications. Evaluation results show that the proposed architectures have interesting area-delay figures compared to conventional Booth radix-4 and radix-8 parallel binary multipliers and other representative alternatives for decimal multiplication.
Cuadernos.info, 1990
J@ti Undip: Jurnal Teknik Industri, 2017
Pertumbuhan akses informasi pada saat sekarang ini berjalan secara pesat. Masyarakat atau user untuk mendapatkan data atau informasi yang akurat, aktual, serta cepat dan mudah sekarang bukanlah hal sulit. Perkembangan internet pada saat ini semakin pesat dan tidak dapat dipisahkan dari dunia pendidikan. Terdapatnya sistem informasi dan internet yang terus berkembang, maka diciptakan suatu cara penyampaian pendidikan secara elektronik dengan bantuan internet. Sistem informasi yang dimaksud pada saat sekarang lebih dikenal dengan istilah elektronik learning (e-learning). E-learning adalah suatu sistem informasi yang membantu pembelajaran manual menjadi pembelajaran secara elektronik. Pada dunia pendidikan untuk membantu dosen dan mahasiswa untuk memanajemen mata kuliah, saat sekarang ini terdapat Sistem Informasi Akademik yang lebih disingkat dengan SIA. SIA adalah suatu sistem informasi yang mengelola dan mengolah semua data-data akademik yang berada di bidang akademik. KRS online yang berada di SIA berhubungan erat dengan mata kuliah yang akan terisi pada e-learning. Dari pandangan kebutuhan tersebut, ada baiknya kedua sistem informasi tersebut digabungkan sehingga dengan user mengisi SIA, maka sistem e-learning yang berhubungan dengan mata kuliah yang diikuti akan terupdate dengan sendirinya dan perlu didesain sedemikian rupa agar kedua sistem informasi tersebut dapat berjalan dengan baik dan dapat digunakan oleh para user. Pendekatan ergonomis yang diperlukan untuk sistem tersebut adalah Human Computer Interaction (HCI) sehingga sistem yang dikembangkan akan nyaman, mudah dan aman digunakan oleh user atau dengan kata lain adalah user friendly. Jurnal ini disusun dengan tujuan untuk mengembangkan suatu sistem integrasi SIA dengan elearning sehingga dapat membantu dan mempermudah aktifitas pembelajaran di Perguruan Tinggi.
Jurnal gizi Indonesia, 2017
Background: Iodine and iron were micronutrient that highly related with formation of thyroid hormone. Iodine and iron deficiency during pregnancy, will affect fetal development. Weight and born baby's head circumference were early indicator of infant development, because it describe the result of interaction of several factors for fetus during pregnancy. Objectives: To determine correlation between iodine and iron status of pregnant women in third trimester with born baby's weight and head circumference. Methods: Cross sectional study with 49 pregnant women on third trimester and their baby born. Measurement of iron status used sTfR level by ELISA and iodine status used EIU. Meanwhile for the born baby, measurement was taken place on the baby's weight and head circumference. Results: The average of iron status is 22,15 nmol/L, meanwhile average of iodine status is 168,63 μg/L with urine median value 165 μg/L. Statistical analysis with multiple correlation test showed no significant relationship between iron and iodine status of third trimester pregnant women to birth weight (r = 0,286; p = 0,141) as well as the babies head circumference (r = 0,195; p = 0,412). Conclusion: Iron and iodine status of third trimester pregnant women have low correlation to the born babies weight or head circumference in endemic area.
Walisongo: Jurnal Penelitian Sosial Keagamaan, 2012
The relation between ritual and socio-political environment should become a deep reflection. Rituals which take place in order for salvation, harmony, and natural balance instead generate the ambigue and ironic situation. Rituals had been going on amazingly but the social as well as natural disasters seems go on continually. In Bali, religious rituals that formerly guarded by mantra-mantra (spiritual wordings) recently enstead by a group of Pecalang (tradition guardian in Bali) and metal detector (at the time of Pamarisudha Karipubhaya Bali Blast of 2002 and 2005). Nowadays Balinese are eager to perform rituals spectacularly. But instead, Bali now is struck by continous disaster, not only the disaster came from external sources but also the internal ones. Hubungan antara ritual dengan lingkungan sosial politik harus menjadi bahan renungan yang dalam. Ritual yang dilakukan untuk tujuan keselamatan, harmoni, dan keseimbangan alam bahkan menimbulkan kondisi ambigue dan ironis. Ritual berjalan secara mengesankan namun bencana sosial maupun bencana alam terus menerus terjadi. Di Bali, ritual agama yang sebelumnya diwarnai mantra-mantra kini diisi oleh Pecalang (pengawal tradisi Bali) dan metal detector (pada saat Pamarisudha Kariphaya Bom Bali 2002 dan 2005). Kini orang Bali cenderung melaksanakan ritual secara spektakuler. Akan tetapi Bali seringkali didera bencana, baik bencana yang berasal dari dalam maupun luar.
References (21)
- Muralidharan, V., and N. Sathish Kumar. "Design and implementation of low power and high speed multiplier using quaternary carry look- ahead adder." Microprocessors and Microsystems 75 (2020): 103054.
- Véstias, Mário P., and Horácio C. Neto. "Decimal Multiplication in FPGA with a Novel Decimal Adder/Subtractor." Algorithms 14.7 (2021): 198.
- Ghose, Debaprasad, Dharamvir Kumar, and Manoranjan Pradhan. "Comparative Analysis of Decimal Fixed-Point Parallel Multipliers Using Signed Digit Radix-4, 5 and 10 Encodings." Recent Trends in Electronics and Communication. Springer, Singapore, 2022. 345-353.
- Gorgin, Saeid, HeliaZareieNejad, and Jeong-A. Lee. "A Practical Energy/Power Reduction Approach for Parallel Decimal Multiplier." IEEE Access (2022).
- Malekpour, Amin, Alireza Ejlali, and SaeidGorgin. "A comparative study of energy/power consumption in parallel decimal multipliers." Microelectronics Journal 45.6 (2018): 775-780.
- P.L. Thangkhiew, R. Gharpinde, P.V. Chowdhary, K. Datta, I. Sengupta, Area efficient implementation of ripple carry adder using memristor crossbar arrays, in: 2016 11th International Design & Test Symposium (IDT), IEEE, 2016, December, pp. 142-147.
- J. Barman, V. Kumar, Approximate carry look ahead adder (CLA) for error tolerant applications, in: 2018 2nd International Conference on Trends in Electronics and Informatics (ICOEI), IEEE, 2018, May, pp. 730-734.
- R.A. Javali, R.J. Nayak, A.M. Mhetar, M.C. Lakkannavar, Design of high speed carry save adder using carry lookahead adder, in: International Conference on Circuits, Communication, Control and Computing, IEEE, 2014, November, pp. 33-36.
- P. Devi, A. Girdher, B. Singh, Improved carry select adder with reduced area and low power consumption, Int. J. Comput. Appl. 3 (4) (June 2010) 14-18.
- J. Ramasamy, N.S. Kumar, B. Sivasankari, Approximate carry select adder based multiplierless multiplication (AMLM), Middle-East J. Sci. Res. 24 (9) (2016) 2772 -2777.
- Afreen, N. Fahmina, M. Mahaboob Basha, and S. Mohan Das. "Design and implementation of area-delay-power efficient CSLA based 32-bit array multiplier." 2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT). IEEE, 2017.
- J. Kaur, L. Sood, Comparison between various types of adder topologies, IJCST 6 (1) (2015) 62-66.
- P. Paliwal, J.B. Sharma, V. Nath, Comparative study of FFA architectures using different multiplier and adder topologies, Microsyst. Technol. (2019) 1-8.
- B. Ramkumar and H.M. Kittur, "Low-power and area efficient carry select adder" IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 20, no. 2, pp. 371-375, Feb. 2012.
- M. NeelaSupraja, M. Mahaboob Basha "Low area Energy efficient Carry select Adder at 65nm technology," ICRAESIT., ISBN:978-93- 85100-47-5, Dec-2015.
- Silparaj.P, Punitha.V, "Design of Improved Array Multiplier by Carry Select Logic", IJRASET, ISSN: 2321-9653, vol. 3, issue 4, April- 2015.
- J. Barman, V. Kumar, Approximate carry look ahead adder (CLA) for error tolerant applications, in: 2018 2nd International Conference on Trends in Electronics and Informatics (ICOEI), IEEE, 2018, May, pp. 730-734.
- S. Gorgin, G. Jaberipur, and B. Parhami, ''Design and evaluation of decimal array multipliers,'' in Proc. 43th Asilomar Signals, Syst., Comput., 2009, pp. 1782-1786.
- L. Han and S.-B. Ko, ''High-speed parallel decimal multiplication with redundant internal encodings,'' IEEE Trans. Comput., vol. 62, no. 5, pp. 956-968, May 2013.
- M. de la GuiaSolaz and R. Conway, Comparative study on word length reduction and truncation for low power multipliers, in: Proceedings of the 33rd International Convention Mipro 2010, pp. 84-88.
- B. Kayalvizhi, N. Anies Fathima and T. Kavitha "Booth Recoded Wallace Tree Multiplier Using and Based Digitally Controlled Delay Lines" ARPN Journal of Engineering and Applied Sciences, pp, 2707- 2713, vol. 10, no. 6, April2015.
FAQs
AI
What benefits does the proposed QSD method provide compared to existing multipliers?add
The proposed Quaternary Signed Digit (QSD) multiplication method shows an 11.5% increase in effectiveness and 10.13% improvement in power efficiency compared to traditional multipliers.
How does parallel multiplication impact energy consumption in DSP applications?add
Research indicates that parallel multiplication algorithms significantly reduce power consumption, with the utilization of optimized carry select adders showing improvements of 46.04% in Power-Delay Product.
What novel approaches were used to generate partial products in the proposed methodology?add
The methodology utilizes both traditional multiplication methods and binary-coded decimal digit multiplier cells for partial product generation, achieving a 4% improvement in pipeline clock cycles.
What simulation results support the effectiveness of the carry look-ahead adder design?add
Simulation results reveal a 63% reduction in power and a 42% decrease in delay compared to conventional carry look-ahead adders, enhancing performance in DSP applications.
How does power consumption compare across different adder designs implemented?add
A study found that using a Han-Carlson adder based Vedic multiplier resulted in 53.43% increased Area-Delay Product versus traditional designs, indicating better resource usage.
R JENILA