HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks

Last update: Sep 11, 2022

Related tags

Deep Learning HEAM

Overview

Approximate Multiplier by HEAM

What's HEAM?

HEAM is a general optimization method to generate high-efficiency approximate multipliers for specific applications.
This project contains an 8x8 unsigned approximate multiplier based on HEAM for Deep Neural Network (DNN) accelerators and the corresponding Design Compiler(DC) script. Besides, the exact WallaceTree multiplier is included for comparison.

Optimization Procedure of the 8×8 Unsigned Approximate Multiplier

How to compile them?

Make sure that you have installed Design Compiler(DC) and prepared your library files.

compile approximate_multiplier.v

step 1: set TOP_LEVEL, all_src, and TOP in scripts/top.tcl at line 1, line 11, and line 15 respectively:

set TOP_LEVEL approximate_multiplier
set all_src "approximate_multiplier.v"
set TOP approximate_multiplier

step 2: run commands in terminal:

dc_shell
source scripts/top.tcl

compile wallacetree.v

step 1: set TOP_LEVEL, all_src, and TOP in scripts/top.tcl at line 1, line 11, and line 15 respectively:

set TOP_LEVEL wallacetree
set all_src "wallacetree.v"
set TOP wallacetree

step 2: run commands in terminal:

dc_shell
source scripts/top.tcl

Experiments of the Approximate Multiplier and the Exact WallaceTree multiplier on Design Compiler(DC) in 3Ghz with a 7-nm Predictive Process Design Kit (PDK) Called the ASAP7 PDK[1]

	Ours	WallaceTree	Reduction
Area ( μm * μm )	17.52516	42.98184	59.23%
Power ( μW )	76.2003	151.9432	49.85%

Future

add several reproduced approximate multipliers for comparison;
add DNNs accelerators results.

Reference

[1] Clark, Lawrence T., et al. "ASAP7: A 7-nm finFET predictive process design kit." Microelectronics Journal 53 (2016): 105-115.

HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks

Related tags

Overview

Approximate Multiplier by HEAM

What's HEAM?

Optimization Procedure of the 8×8 Unsigned Approximate Multiplier

How to compile them?

compile approximate_multiplier.v

compile wallacetree.v

Experiments of the Approximate Multiplier and the Exact WallaceTree multiplier on Design Compiler(DC) in 3Ghz with a 7-nm Predictive Process Design Kit (PDK) Called the ASAP7 PDK[1]

Future

Reference

Owner

Model that predicts the probability of a Twitter user being anti-vaccination.

Transfer Learning Remote Sensing

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.

Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging

This repository is an official implementation of the paper MOTR: End-to-End Multiple-Object Tracking with TRansformer.

Code and data for "TURL: Table Understanding through Representation Learning"

The implemetation of Dynamic Nerual Garments proposed in Siggraph Asia 2021

BarcodeRattler - A Raspberry Pi Powered Barcode Reader to load a game on the Mister FPGA using MBC

This repository contains the code used for the implementation of the paper "Probabilistic Regression with HuberDistributions"

Yoloxkeypointsegment - An anchor-free version of YOLO, with a simpler design but better performance

The author's officially unofficial PyTorch BigGAN implementation.

A certifiable defense against adversarial examples by training neural networks to be provably robust

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

LBBA-boosted WSOD

The deployment framework aims to provide a simple, lightweight, fast integrated, pipelined deployment framework that ensures reliability, high concurrency and scalability of services.

Non-Vacuous Generalisation Bounds for Shallow Neural Networks

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks

Related tags

Overview

Approximate Multiplier by HEAM

What's HEAM?

Optimization Procedure of the 8×8 Unsigned Approximate Multiplier

How to compile them?

compile approximate_multiplier.v

compile wallacetree.v

Experiments of the Approximate Multiplier and the Exact WallaceTree multiplier on Design Compiler(DC) in 3Ghz with a 7-nm Predictive Process Design Kit (PDK) Called the ASAP7 PDK[1]

Future

Reference

Owner

Model that predicts the probability of a Twitter user being anti-vaccination.

Transfer Learning Remote Sensing

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren*, Raymond A. Yeh*, Alexander G. Schwing.

Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging

This repository is an official implementation of the paper MOTR: End-to-End Multiple-Object Tracking with TRansformer.

Code and data for "TURL: Table Understanding through Representation Learning"

The implemetation of Dynamic Nerual Garments proposed in Siggraph Asia 2021

BarcodeRattler - A Raspberry Pi Powered Barcode Reader to load a game on the Mister FPGA using MBC

This repository contains the code used for the implementation of the paper "Probabilistic Regression with HuberDistributions"

Yoloxkeypointsegment - An anchor-free version of YOLO, with a simpler design but better performance

The author's officially unofficial PyTorch BigGAN implementation.

A certifiable defense against adversarial examples by training neural networks to be provably robust

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

LBBA-boosted WSOD

The deployment framework aims to provide a simple, lightweight, fast integrated, pipelined deployment framework that ensures reliability, high concurrency and scalability of services.

Non-Vacuous Generalisation Bounds for Shallow Neural Networks

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.