A scalable optical neural network architecture using coherent detection

Alexander Sludds; Liane Bernstein; Ryan Hamerly; Marin Soljacic; Dirk Englund

doi:10.1117/12.2546940

24 February 2020 A scalable optical neural network architecture using coherent detection

Alexander Sludds, Liane Bernstein, Ryan Hamerly, Marin Soljacic, Dirk Englund

Proceedings Volume 11299, AI and Optical Data Sciences; 112990H (2020) https://doi.org/10.1117/12.2546940
Event: SPIE OPTO, 2020, San Francisco, California, United States

Abstract

Storing, processing, and learning from data is a central task in both industrial practice and modern science. Recent advances in modern statistical learning, particularly Deep Neural Networks (DNNs), have given record breaking performance on tasks in game playing,^{1, 2} natural language processing,³ computer vision,⁴ computational biology,^{5, 6} and many others. The rapid growth of the field has been driven by an increase in the amount of public datasets,⁷ improvements to algorithms,⁸ and a substantial growth in computing power.⁹ In order to perform well on these tasks networks have had to grow in size, learning more complicated statistical features. The training and deployment of these large neural networks has spurred the creation of many neural network accelerators to aid in the computation of these networks.^10-12

Existing general purpose computing devices such as CPUs and GPUs are limited both by thermal dissipation per unit area and yield associated with large chips.^{13, 14} The design of Application Specific Integrated circuits (ASICs) has aided in decreasing the energy consumption per workload substantially by limiting the supported operations on chip. An example of this is the first generation tensor processing unit (TPU)¹⁵ which is able to perform the inference of large convolutional neural networks in datacenter in <10ms with an idle power of 28W and an workload power of 40W. It may seen counterintuitive then that the limiting factor for the implementation of DNNs is not computation, but rather the energy and bandwidth associated with reading and writing data from memory as well as the energy cost of moving data inside of the ASIC.^{15, 16} Several emerging technologies, such as in-memory computing,¹⁷ memristive crossbar arrays¹⁸ promise increased performance, but these emerging architectures suffer from calibration issues and limited accuracy.¹⁹

Photonics as a field has had tremendous success in improving the energy efficiency of data interconnects.²⁰ This has motivated the creation of optical neural networks (ONNs) based on 3D-printed diffractive elements,²¹ spiking neural networks utilizing ring-resonators,²² reservoir computing²³ and nanophotonic circuits.²⁴ However, these architectures have several issues. 3D-printed diffractive networks and schemes requiring spatial light modulators are non-programmable, meaning that they are unable to perform the task of training. Nanophotonic circuits allow for an O(N²) array of interferometers to be programmed, providing passive matrix-vector multiplication. However, the large (≈1mm²) size of on chip electro-optic interferometers means that scaling to an array of 100x100 would require 10; 000mm² of silicon, demonstrating the limitations of scaling this architecture. To date no architecture has demonstrated high-speed (GHz) speed computation with more than N ≥ 10; 000 neurons.

Here we present an architecture that is scalable to N ≥ 10⁶ neurons. The key mechanism of this architecture is balanced homodyne detection. By scaling the architecture to such a large size we show that we can decimate energy costs per operation associated with the optical component of this architecture, reaching a bound set by shot noise on the receiving photodetectors which leads to classification error. We call this bound a standard quantum limit (SQL) which reaches 100zJ/MAC on problems such as MNIST. We also analyze the energy consumption using existing technologies and show that sub-fJ/MAC energy consumption should be possible.

This paper is organized as follows: In section 1 we will discuss the function of this architecture as a matrixmatrix processor. In section 2 we will analyze the energy consumption of the architecture. In section 3 we will discuss methods for training and extending the accelerator to a broader scope of problems, namely convolutionally neural networks (CNNs).

Conference Presentation

Citation Download Citation

Alexander Sludds, Liane Bernstein, Ryan Hamerly, Marin Soljacic, and Dirk Englund "A scalable optical neural network architecture using coherent detection", Proc. SPIE 11299, AI and Optical Data Sciences, 112990H (24 February 2020); https://doi.org/10.1117/12.2546940

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available