English

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

219
2023-09-27 15:24:41
See translation

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

Related Recommendations
  • Researchers use blurry light to 3D print high-quality optical components

    Canadian researchers have developed a new 3D printing method called Blur Tomography, which can quickly produce micro lenses with commercial grade optical quality. The new method can make designing and manufacturing various optical devices easier and faster.Daniel Webber from the National Research Council of Canada stated, "We have intentionally added optical blurring to the beams used in this 3D p...

    2024-05-11
    See translation
  • Optical Capture of Optical Nanoparticles: Fundamentals and Applications

    A new article published in Optoelectronic Science reviews the basic principles and applications of optical capture of optical nanoparticles. Optical nanoparticles are one of the key elements in photonics. They can not only perform optical imaging on various systems, but also serve as highly sensitive remote sensors.Recently, the success of optical tweezers in separating and manipulating individual...

    2023-11-25
    See translation
  • NSF funding for the world leading EP-OPAL laser multi mechanism design in Rochester

    The National Science Foundation (NSF) of the United States has awarded the University of Rochester nearly $18 million for three years to design and prototype key technologies for EP-OPAL, a new facility dedicated to studying the interaction between ultra-high intensity lasers and matter.After the design project is completed, the facility can be built at the Laser Energy Laboratory (LLE). This fund...

    2023-09-26
    See translation
  • The wide application of TORNOS mind machine in diversified industrial fields

    TORNOS walking machine, also known as walking CNC lathe or spindle box mobile CNC automatic lathe, occupies an important position in the field of precision manufacturing due to its excellent performance and wide application areas. This machine tool not only integrates mechanical and electrical technologies, but also becomes an indispensable processing equipment in many industrial fields due to its...

    2024-07-24
    See translation
  • Statsndata predicts that the light detection and ranging market will experience vigorous development globally in 2029

    The Light Detection and Ranging (LiDAR) market embodies the technology of remote sensing, surveying, and the use of laser pulses to measure distance and generate detailed three-dimensional models of objects, terrain, and environment.The LiDAR system emits a laser beam and measures the time required for the light to return to the surface, creating accurate and high-resolution digital representation...

    2023-08-31
    See translation