Program

Paper Session: Machine Learning 1

Date: July 24 (Wed)
Session: 11:00-11:40
Session Chair: Tao Wei

11:00Yannick Braatz, Taha Soliman, Shubham Rai, Dennis Sebastian Rieber and Oliver BringmannCoNAX: Towards Comprehensive Co-Design Neural Architecture Search Using HW Abstractions
11:20Hanqing Liu, Xiaole Cui, Sunrui Zhang, Mingqi Yin, Yuanyuan Jiang and Xiaoxin CuiA Convolutional Spiking Neural Network Accelerator with the Sparsity-aware Memory and Compressed Weights

Paper Session: Arithmetic

Date: July 24 (Wed)
Session: 15:00-16:30
Session Chair: Chester Sungchung Park

15:00Andreas Boettcher and Martin KummMultiplier Design Addressing Area-Delay Trade-offs by using DSP and Logic resources on FPGAs
15:20Georg Rutishauser, Joan Mihali, Moritz Scherer and Luca BeninixTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems
15:40Jens Karrenbauer, Sven Schönewald, Simon Klein and Holger BlumeEnhancing a Hearing Aid Processor with ISA Extensions Supporting Flexible Fixed-Point Formats
16:00Yasong Cao, Mei Wen, Junzhong Shen and Zhongxing LiBitShare: An Efficient Precision-Scalable Accelerator with Combining-Like-Terms GEMM

Paper Session: Architectures and Applications

Date: July 25 (Thu)
Session: 11:00-12:00
Session Chair: Qiang Liu

11:00Dhruv Gajaria, Tosiron Adegbija and Kevin GomezCHIME: Energy-Efficient STT-RAM-based Concurrent Hierarchical In-Memory Processing
11:20Songqiao Cui and Josep BalaschConfigurable Loop Shuffling via Instruction Set Extensions
11:40Salim Khemira, Xinyuan Wang, Anh Nguyen, Yutaka Tamiya, Makoto Taiji, Takahide Yoshikawa and Jason AndersonRaising Compute Density of Molecular Dynamics Simulation Through Approximate Memoization

Paper Session: Tools

Date: July 25 (Thu)
Session: 13:30-14:30
Session Chair: Matthew Tang

13:30Yuanhai Zhang, Shuai Zhao, Gang Chen and Kai HuangFault-tolerant DAG Scheduling with Runtime Reconfiguration on Multicore Real-Time Systems
13:50Tianyi Yu, Omar Ragheb, Stephen Wicklund and Jason AndersonMLIR-to-CGRA: A Versatile MLIR-Based Compiler Framework for CGRAs
14:10Yunfeng Deng and Haifeng SunA DRL-based multi-priority task division scheduling strategy in IIoT

Paper Session: Crypto & Security

Date: July 25 (Thu)
Session: 15:00-15:40
Session Chair: Minoru Watanabe

15:00Zhuoheng Ran, Muhammad A.A. Abdelgawad, Zekai Zhang, Ray C.C. Cheung and Hong YanRO-SVD: A Reconfigurable Digital Copyright Protection Framework for AIGC Applications
15:20Yang Yang, Rajgopal Kannan and Viktor PrasannaA Low Latency FPGA Framework for Homomorphic Encryption Operations

Paper Session: Machine Learning 2

Date: July 26 (Fri)
Session: 11:00-12:00
Session Chair: Martin Kumm

11:00Haoran Su and Nan WuDeoxys: Defensive Approximate Computing for Secure Graph Neural Networks
11:20Zehuan Zhang, Matej Genci, Hongxiang Fan, Andreas Wetscherek and Wayne LukAccelerating MRI Uncertainty Estimation with Mask-based Bayesian Neural Network
11:40Zhenyu Xu, Miaoxiang Yu, Jillian Cai, Qing Yang and Tao WeiTwinStep Network (TwNet): a Neuron-Centric Architecture Achieving Rapid Training

Paper Session: GPUs

Date: July 26 (Fri)
Session: 13:30-14:10

13:30Paul Delestrac, Jonathan Miquel, Debjyoti Bhattacharjee, Diksha Moolchandani, Francky Catthoor, Lionel Torres and David NovoAnalyzing GPU Energy Consumption in Data Movement and Storage
13:50Shinya Miura, Qiong Chang and Jun MiyazakiK-way In-place Merge by CPU-GPU Cooperative Processing

Paper Session: Linear Algebra Acceleration

Date: July 26 (Fri)
Session: 14:40-15:40
Session Chair: Jun Miyazaki

14:40Valentin Isaac-Chassande, Adrian Evans, Yves Durand and Frédéric RousseauSpDCache: Region-Based Reduction Cache for Outer-Product Sparse Matrix Kernels
15:00Dominik Walter, Thomas Adamtschuk, Frank Hannig and Jürgen TeichAnalysis and Optimization of Block LU Decomposition for Execution on Tightly Coupled Processor Arrays
15:20Shengbai Luo, Sheng Ma, Bo Wang, Yihao Shi, Qingshan Xue and Xueyi ZhangSparm: A Sparse Matrix Multiplication Accelerator Supporting Multiple Dataflows

Poster Presentations 1

Date: July 24 (Wed)
Session: 11:40-12:00

Frederik Kautz, Sven Gesper, Gia Bao Thieu, Hans-Martin Bluethgen, Holger Blume and Guillermo Payá-VayáMulti-Level Prototyping of a Vertical Vector AI Processing System
Yazhou Yan, Jiangnan Li, Guowei Zhu, Wenbo Yin and Lingli WangAn End-to-End Agile Design Framework to Improve Energy Efficiency on CGRAs
Oliver Renke, Christoph Riggers, Jens Karrenbauer and Holger BlumeDesign space exploration of semantic segmentation CNN SalsaNext for constrained architectures
Hongbin Wang, Zou Yi, Guohua Wen and Junfeng HuMemory Access Acceleration Through Architecture Design for Edge SoCs
Tenghao Zhao and Zhaohui YeZeroVex: A Scalable and High-performance RISC-V Vector Processor Core for Embedded Systems
Yingchang Mao and Qiang LiuMSCA: A Multi-grained Sparse Convolution Accelerator for DNN Training
Yao Liu, Shiyang Chen, Guolong Yang, Long Ma and Kun WanReal-time Order Book Building and Snapshot Generating for High Frequency Trading on FPGA
Yuchen Gui, Qizhe Wu, Wei Yuan, Huawen Liang, Xiaotian Wang and Xi JinA FPGA-HBM-based Hardware Streaming Accelerator for GNN Sampling
Shipeng Yue, Honghao Liang, Xinpeng Xing and Haigang FengSLICE Matrix: A Memory Access Scheduling Policy for Multicore Network Processors

Poster Presentations 2

Date: July 25 (Thu)
Session: 10:00-10:30

Yude Fang, Junhui Wang, Libo Huang, Yongwen Wang and Weixia XuOut-of-Order and Recursive RAS: A Return Address Stack Design On High Performance Processor
Riadh Benabdelhamid, Vladislav Valek and Dirk KochSPARKLE: A 1024-Core/16,384-Thread single FPGA many-core RISC-V barrel processor Overlay
Uyong Lee, Yeji Park, Junsu Heo, Sungkyung Park and Chester Sungchung ParkDesign Space Exploration of FFT Accelerators for IEEE 802.11ax Using High-Level Synthesis
Junzhe Huang, Qiang Dou and Li ShenExtending the RISC-V Instruction Set for High Performance Data Compression Hardware Acceleration
Shaoyang Sun, Boyin Jin, Jiahang Lou, Jiangnan Li, Yuhang Cao, Jingyuan Li, Chen Shen, Yuan Dai, Wenbo Yin and Lingli WangMDCRA: A Reconfigurable Accelerator Framework for Multiple Dataflow Lanes
Yiyang Lin, Yi Zou and Yanfeng YangCSIFA: A Configurable SRAM-based In-memory FFT Accelerator
Matteo Perotti, Michele Raeber, Mattia Sinigaglia, Matheus Cavalcante, Davide Rossi and Luca BeniniSpatzformer: An Efficient Reconfigurable Dual-Core RISC-V V Cluster for Mixed Scalar-Vector Workloads
Dong-eon Won, Yeeun Kim, Janghwan Lee, Minjae Lee, Jonghyun Bae, Jongjoo Park, Jeongyong Song and Jungwook ChoiISP2DLA: Automated Deep Learning Accelerator Design for On-Sensor Image Signal Processing
Tao Cai, Jianfei Dai, Dejiao Niu, Lei Li, Zeyu Huang and Qiangqiang NiA LLC-Friendly LSM-tree

PhD Forum


Mohamed Bouaziz and Suhaib A. FahmyLeveraging MLIR for efficient irregular-shaped CGRA overlay design
Qingchen Zhai, Zhiwei Zhang and Ruozhou XiaoLLM Based End-to-end Branch Predictor Optimization Generator
Annina Gutermann and Juergen BeckerA Full-System Approach to Multi-Valued Logic Design
Mateusz Wygrzywalski and Robert SzczygiełLightweight Extension of RISC-V Core for NTT-like Algorithms
Xiqin Tang and Delong ShangDesign of High-performance while Energy-efficient Microprocessor with Novel Asynchronous Techniques
Zekai Zhang and Ray Chak Chung CheungDesign of Light-weight Encryption Algorithm Based on RISC-V Platform
Yuki Shimamura, Minoru Watanabe and Nobuya WatanabeVoltage range evaluation of an optically reconfigurable gate array VLSI
Kangli Zhao, Anping He and Di ZhaoResearch on High-Efficiency Asynchronous Superscalar Processors