High-performance matrix multiplication remains a cornerstone of numerical computing, underpinning a wide array of applications from scientific simulations to machine learning. Researchers continually ...
Abstract: Emerging applications, e.g., machine learning, large language models (LLMs), and graphic processing, are rapidly developing and are both compute-intensive and memory-intensive. Computing in ...
Abstract: In this paper, a table lookup-based computing technique is proposed to perform convolutional neural network (CNN) inference without multiplication, and its FPGA implementation is ...
Numerical data analysis has become a nearly indispensable tool in modern neuroscience. This course aims to equip graduate students with the fundamental mathematical skills in quantitative modeling and ...
This repository contains a comprehensive empirical study analyzing the architectural evolution and performance transition of GPUs from the 2000s era to modern accelerators. Through systematic ...