A Resource Reuse Strategy for Large-Scale Matrix Operations
A Resource Reuse Strategy for Large-Scale Matrix Operations in HLS-Based FPGA Design
Abstract:
Matrix operations (MOPs) are essential for various computational tasks, particularly in deep learning models, which have grown increasingly complex. As these models expand, their demand for computational resources increases significantly, making deployment on resource-limited hardware platforms, such as field-programmable gate arrays, more challenging, especially in balancing re-source allocation and computation latency. Reuse-control techniques have been employed to optimize resource allo-cation by enabling multiple operations to share the same computational units, such as digital signal processors. Yet, this approach presents a tradeoff between resource utiliza-tion and latency. In this study, we tackle this challenge by thoroughly analyzing existing reuse-control mechanisms and introducing a novel integer linear programming (ILP)-based strategy. Our experimental results demonstrate that the proposed approach not only improves resource utiliza-tion for large-scale MOPs but also significantly reduces latency compared to existing methods. In the best case, on the ResNet model, our ILP-based method achieves up to 2.21× lower latency and 2.14× lower energy consumption per inference, demonstrating significantly improved perfor-mance and energy efficiency. In addition, our work provides a new optimization perspective for hardware design based on high-level synthesis.
” Thanks for Visit this project Pages – Register This Project and Buy soon with Novelty “
A Resource Reuse Strategy for Large-Scale Matrix Operations in HLS-Based FPGA Design