Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

A Twofold Lookup Table Architecture for Efficient Approximation of Activation Functions (CROSBI ID 282139)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Xie, Yusheng ; Raj, Alex Noel Joseph ; Hu, Zhendong ; Huang, Shaohaohan ; Fan, Zhun ; Joler, Miroslav A Twofold Lookup Table Architecture for Efficient Approximation of Activation Functions // IEEE transactions on very large scale integration (VLSI) systems, 28 (2020), 12; 2540-2550. doi: 10.1109/TVLSI.2020.3015391

Podaci o odgovornosti

Xie, Yusheng ; Raj, Alex Noel Joseph ; Hu, Zhendong ; Huang, Shaohaohan ; Fan, Zhun ; Joler, Miroslav

engleski

A Twofold Lookup Table Architecture for Efficient Approximation of Activation Functions

In this article, we propose a novel approach to reduce hardware resource consumption when neural networks (NNs) are deployed on field- programmable gate array (FPGA) boards. Rather than using a classical approach with lookup tables (LUTs) to approximate the activation functions of an NN, the proposed solution is based on a twofold LUT (t-LUT) architecture, which comprises an error-LUT (e-LUT) and a data-LUT (d-LUT), in order to achieve high precision and speed as well as low hardware resource consumption. The efficiency of the proposed approach was tested against multiple earlier approaches. Our solution showed that the compressibility of the previously referenced works, which were based on single LUTs, could be improved by up to 94.44% and those that were based on a range addressable LUT (RALUT) by up to 6.35% in the examined case of a hyperbolic tangent (tanh) activation function. Moreover, when RALUT and our architecture were combined, it improved the compressibility of the RALUT-based result by up to additional 10.21% for a tanh activation function. The designed architecture had an initial latency of 39.721 ns, when tested with a 50-MHz clock, to simultaneously retrieve data from the d-LUT and t-LUTs.

Activation functions ; field-programmable gate array (FPGA) ; twofold lookup table (t-LUT)

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

28 (12)

2020.

2540-2550

objavljeno

1063-8210

1557-9999

10.1109/TVLSI.2020.3015391

Povezanost rada

Elektrotehnika, Računarstvo

Poveznice
Indeksiranost