notesum.ai
Published at November 6Efficient Message Passing Architecture for GCN Training on HBM-based FPGAs with Orthogonal Topology On-Chip Networks
cs.AR
cs.LG
Released Date: November 6, 2024
Authors: Qizhe Wu1, Letian Zhao1, Yuchen Gui1, Huawen Liang Xiaotian Wang
Aff.: 1University of Science and Technology of China, Hefei, Anhui, China
| GPU | HP-GNN(Lin et al., 2022) | Ours | ||
|---|---|---|---|---|
| Platform | Device | Nvidia A100 | Alveo U250 | VCU128 |
| Peak Perf. | 19.5TFLOPS | 1.8TFLOPS | 2TFLOPS | |
| On-chip-Mem. | 40MB | 54MB | 43MB | |
| NS-GCN | Flickr | 0.21() | 0.16() | 0.09() |
| 6.59() | 1.09() | 1.05() | ||
| Yelp | 2.90() | 1.35() | 1.11() | |
| AmazonP. | 5.06() | 3.49() | 1.92() | |
| NS-SAGE | Flickr | 0.29() | 0.22() | 0.12() |
| 3.05() | 1.56() | 1.37() | ||
| Yelp | 3.51() | 1.85() | 1.64() | |
| AmazonP. | 6.83() | 4.83() | 3.65() |