OpenBooks

METHODS FOR OPTIMIZING NEURAL NETWORK MODELS

Journal

Scientific and technical journal «Priborostroenie»

Nikita S. Mokretsov, Evgeny D. Arkhiptsev

UDK004.85

Issue:4 (67)

Download PDF461 Kbyte

Annotation

Methods for building optimized deep learning accelerators are discussed. Traditional approaches to fault-tolerant deep learning accelerators are shown to rely on redundant computation, which results in significant overheads including training time, power consumption, and integrated circuit size. A method is proposed that considers differences in the vulnerability of individual neurons and the bits of each neuron, which partially solves the problem of computational redundancy. The method allows you to selectively protect model components at the architectural and circuit levels, which reduces overhead without compromising the reliability of the model. It is shown that quantization of the deep learning accelerator model allows data to be represented in fewer bits, which reduces hardware resource requirements.

METHODS FOR OPTIMIZING NEURAL NETWORK MODELS

Scientific and technical journal «Priborostroenie»

Annotation

Keywords

Постоянный URL

Articles in current issue

METHODS FOR OPTIMIZING NEURAL NETWORK MODELS

Scientific and technical journal «Priborostroenie»

Annotation

Keywords

Постоянный URL

Поделиться

Articles in current issue