An FPGA-Based Transformer Accelerator Using Output Block Stationary Dataflow for Object Recognition Applications

DOI : 10.1109/TCSII.2022.3196055

Abstract:

The transformer-based model has great potential to deliver higher accuracy for object recognition applications when comparing it with the convolution neural network (CNN). Yet, the amount of weight sharing of a transformer-based model is significantly lower than that of the CNN, which should apply different dataflow to reduce the memory access. This brief proposes a transformer accelerator with an output block stationary (OBS) dataflow to minimize the repeated memory access by block-level and vector-level broadcasting while preserving a high digital signal processor (DSP) utilization rate, leading to higher energy efficiency. It also lowers the memory access bandwidth to the input and output. Verified through an FPGA, the proposed accelerator evaluates a transformer-in-transformer (TNT) model with a throughput of 728.3 GOPs, corresponding to energy efficiency of 58.31 GOPs/W.

Software Implementation:

Modelsim
Xilinx

” Thanks for Visit this project Pages – Register This Project and Buy soon with Novelty “

An FPGA-Based Transformer Accelerator Using Output Block Stationary Dataflow for Object Recognition Applications

“Buy Novelty based VLSI Projects On On-Line”

2014

2015

2016

2017

2018

2019

An FPGA-Based Transformer Accelerator Using Output Block Stationary Dataflow for Object Recognition Applications

An FPGA-Based Transformer Accelerator Using Output Block Stationary Dataflow for Object Recognition Applications

Abstract:

Software Implementation:

An FPGA-Based Transformer Accelerator Using Output Block Stationary Dataflow for Object Recognition Applications

“Buy Novelty based VLSI Projects On On-Line”

Call us today at +91 9789443203 or Email us at nxfee.innovation@gmail.com

THANK YOU

Our services

Quick Links

Contact us :

Our services

Quick Links

Contact us :