GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

Last update: Dec 13, 2022

Related tags

Overview

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

GraphLily is the first FPGA overlay for graph processing. GraphLily supports a rich set of graph algorithms by adopting the GraphBLAS programming interface, which formulates graph algorithms as sparse linear algebra kernels. GraphLily effectively utilizes the high bandwidth of HBM to accelerate SpMV and SpMSpV, the two widely-used kernels in GraphBLAS, by co-designing the data layout and the accelerator architecture. GraphLily further builds a middleware to provide runtime support, enabling users to easily port existing GraphBLAS programs from CPUs/GPUs to FPGAs.

For more information, refer to our ICCAD'21 paper.

@article{hu2021graphlily,
  title={GraphLily: Accelerating Graph Linear Algebra on HBM-Equipped FPGAs},
  author={Hu, Yuwei and Du, Yixiao and Ustun, Ecenur and Zhang, Zhiru},
  journal={International Conference On Computer Aided Design},
  year={2021}
}

Prerequisites

Platform: Xilinx Alveo U280
Tool: Xilinx Vitis 2019.2

Run Benchmarking

Clone the repo

git clone [email protected]:cornell-zhang/GraphLily.git
export GRAPHLILY_ROOT_PATH=/path/to/GraphLily

Get the bitstream

A pre-compiled bitstream (166 MHz) is provided here.
To generate a new bitstream:

cd GraphLily/generate_bitstream
make synthesize

Prepare datasets

The input is an adjacency matrix in csr format stored as a scipy npz file. Please install cnpy, which is required for data loading.

Our ICCAD'21 paper evaluated the following six graph datasets:

Run

Go to the GraphLily/benchmark folder, modify the cnpy path in Makefile, modify the bitstream path and the datasets path in run_bfs.sh, then:

bash run_bfs.sh

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

Related tags

Overview

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

Prerequisites

Run Benchmarking

Clone the repo

Get the bitstream

Prepare datasets

Run

Owner

Cornell Zhang Research Group

Memory efficient transducer loss computation

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

A bare-bones Python library for quality diversity optimization.

ANEA: Automated (Named) Entity Annotation for German Domain-Specific Texts

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

Code for our EMNLP 2021 paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Python Fanduel API (2021) - Lineup Automation

Aircraft design optimization made fast through modern automatic differentiation

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

Official Implementation of "Transformers Can Do Bayesian Inference"

Utilizes Pose Estimation to offer sprinters cues based on an image of their running form.

Position detection system of mobile robot in the warehouse enviroment

JFB: Jacobian-Free Backpropagation for Implicit Models

PyTorch Implementation of AnimeGANv2

On-device speech-to-intent engine powered by deep learning

【CVPR 2021, Variational Inference Framework, PyTorch】 From Rain Generation to Rain Removal

領域を指定し、キーを入力することで画像を保存するツールです。クラス分類用のデータセット作成を想定しています。

NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework

Pytorch implementation for the Temporal and Object Quantification Networks (TOQ-Nets).