A light-speed large language model accelerator with optical stochastic computing

Afifi, Salma, author; Alo, Oluwaseun, author; Thakkar, Ishan, author; Pasricha, Sudeep, author; ACM, publisher

A light-speed large language model accelerator with optical stochastic computing

Files

FACF_ACMOA_3716368.3735299.pdf (6.56 MB)

Date

2025-06-29

Authors

Afifi, Salma, author

Alo, Oluwaseun, author

Thakkar, Ishan, author

Pasricha, Sudeep, author

ACM, publisher

Abstract

To address the increasingly intensive computational demands of attention-based large language models (LLMs), there is a growing interest in developing energy-efficient and high-speed hardware accelerators. To that end, photonics is being considered as an alternative technology to digital electronics. This work introduces a novel optical hardware accelerator that leverages stochastic computing principles for LLMs. Our proposed accelerator incorporates full-range optical stochastic multipliers and stochastic-analog compute-capable optical-to-electrical transducer units to efficiently handle static and dynamic tensor computations in attention-based models. Our analysis shows that our accelerator exhibits at least 7.6× speedup and 1.3× lower energy compared to state-of-the-art LLMs hardware accelerators.

Subject

transformer neural networks

silicon photonics

inference acceleration

stochastic computing

optical computing

URI

https://hdl.handle.net/10217/242039

Collections

Publications

Full item page

A light-speed large language model accelerator with optical stochastic computing

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Abstract

Description

Rights Access

Subject

Citation

URI

Associated Publications

Collections