Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lai, Bo-Cheng | en_US |
dc.contributor.author | Pan, Jyun-Wei | en_US |
dc.contributor.author | Lin, Chien-Yu | en_US |
dc.date.accessioned | 2019-08-02T02:18:37Z | - |
dc.date.available | 2019-08-02T02:18:37Z | - |
dc.date.issued | 2019-05-01 | en_US |
dc.identifier.issn | 1063-8210 | en_US |
dc.identifier.uri | http://dx.doi.org/10.1109/TVLSI.2019.2897052 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/152414 | - |
dc.description.abstract | Although the existing single-instruction-multiple-data-like (SIMD) accelerators can handle the compressed format of sparse convolutional neural networks, the sparse and irregular distributions of nonzero elements cause low utilization of multipliers in a processing engine (PE) and imbalanced computation between PEs. This brief addresses the above issues by proposing a data screening and task mapping (DSTM) accelerator which integrates a series of techniques, including software refinement and hardware modules. An efficient indexing module is introduced to identify the effectual computation pairs and skip unnecessary computation in a fine-grained manner. The intra-PE load imbalance is alleviated with weight data rearrangement. An effective task sharing mechanism further balances the computation between PEs. When compared with the state-of-the-art SIMD-like accelerator, the proposed DSTM enhances the average PE utilization by 3.5x. The overall processing throughput is 59.7% higher than the previous design. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Load balance | en_US |
dc.subject | machine learning | en_US |
dc.subject | single-instruction-multiple-data (SIMD) architecture | en_US |
dc.subject | sparse convolutional neural networks (CNNs) | en_US |
dc.title | Enhancing Utilization of SIMD-Like Accelerator for Sparse Convolutional Neural Networks | en_US |
dc.type | Article | en_US |
dc.identifier.doi | 10.1109/TVLSI.2019.2897052 | en_US |
dc.identifier.journal | IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS | en_US |
dc.citation.volume | 27 | en_US |
dc.citation.issue | 5 | en_US |
dc.citation.spage | 1218 | en_US |
dc.citation.epage | 1222 | en_US |
dc.contributor.department | 電子工程學系及電子研究所 | zh_TW |
dc.contributor.department | Department of Electronics Engineering and Institute of Electronics | en_US |
dc.identifier.wosnumber | WOS:000466226400020 | en_US |
dc.citation.woscount | 0 | en_US |
Appears in Collections: | Articles |