Português

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

536
2023-09-27 15:24:41
Ver tradução

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

Recomendações relacionadas
  • New Method - Observing how materials emit polarized light

    Many materials emit light in ways that encode information in its polarization. According to researchers at École Polytechnique Fédérale de Lausanne (EPFL), Switzerland, polarization is key for future technologies, from quantum computers to secure communication and holographic displays.Among such phenomena is a form known as circularly polarized luminescence (CPL), a special type of light emission ...

    07-04
    Ver tradução
  • Amazemet uses Siemens Xcelerator software for scaling metal 3D printing

    Polish metal 3D printing company Amazemet uses the Xcelerator software combination from industrial manufacturing company Siemens.The spin off company of Warsaw University of Technology is using Siemens workflow management software to develop its metal powder atomizer and 3D printing post-processing equipment.Amazemet was founded in 2016, and its ultrasonic atomization device is capable of producin...

    2024-04-18
    Ver tradução
  • Tiny yet Powerful: How Lasers on Chips Change the Game Rules of Photonics

    Chip level ultrafast mode-locked laser based on nanophotonic lithium niobate.Researchers have created a compact mode-locked laser integrated into a nanophotonic platform, capable of generating high-power and ultrafast optical pulses. The breakthrough in miniaturization of MLL technology can significantly expand the application of photonics.Innovation in mode-locked laser technologyTo improve the t...

    2023-12-27
    Ver tradução
  • New progress in research on laser cleaning and improving the damage threshold of fused quartz components at Shanghai Optics and Machinery Institute

    Recently, the research team of the High Power Laser Element Technology and Engineering Department of the Shanghai Institute of Optics and Mechanics, Chinese Academy of Sciences, has made new progress in the study of improving the damage threshold of fused quartz elements through laser cleaning. The study proposes for the first time the use of microsecond pulse CO2 laser cleaning to enhance the dam...

    2024-07-08
    Ver tradução
  • The emergence of laser engraving glass technology injects exquisite and vivid artistic quality into glass works

    The emergence of laser inner glass carving technology has brought new forms and possibilities of artistic expression to glass art. It not only showcases advanced technology and innovative craftsmanship, but also endows glass works with unique artistry.Firstly, laser engraved glass can achieve very fine and complex carving effects. By penetrating the interior of glass with a laser beam for carving,...

    2023-09-15
    Ver tradução