Ελληνικά

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

931
2023-09-27 15:24:41
Δείτε τη μετάφραση

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

Σχετικές προτάσεις
  • Jenoptik announces record high preliminary performance for 2024

    Recently, Jenoptik, a German company, released its preliminary performance for 2024, delivering a record high in both revenue and profit, but also revealing hidden concerns amidst industry cyclical fluctuations. Against the backdrop of weak demand in the semiconductor equipment market and increasing global economic uncertainty, this company with laser and optical technology as its core is attempti...

    02-14
    Δείτε τη μετάφραση
  • Ortel launches advanced 1550nm laser to enhance LiDAR and optical sensing functions

    Ortel belongs to the Photonics Foundries group and has launched its latest innovative product - the 1786 1550 nm laser module, aimed at significantly improving optical sensing in various applications. This laser module is designed specifically for continuous wavelength operation and is a key component of systems that require coherent light sources for precise sensing in environments with fluctuati...

    2024-03-16
    Δείτε τη μετάφραση
  • Solar cell laser processing deserves attention

    Laser processing is a relatively emerging non-contact processing method that utilizes the high energy of a beam of light to interact with materials and instantly vaporize or change their properties to achieve the expected manufacturing effect. It has gradually been promoted and applied in China in the past 20 years. Due to the different types, pulse widths, and wavelengths of laser generators, the...

    2023-10-31
    Δείτε τη μετάφραση
  • 3D printed chocolate: a delicious fusion of innovation and sustainable development

    In the era of sustainable development and cutting-edge technology, the integration of 3D printing and culinary art is not only an innovation, but also a proof of human creativity. Imagine in such a world, your desserts are not just coming out of the kitchen, but carefully designed and printed layer by layer. This is not a glimpse of the distant future, but the reality of today, as developers have ...

    2024-02-19
    Δείτε τη μετάφραση
  • Application of Multipurpose Femtosecond Laser Interferometry in High Precision Silicon Nanostructures

    Researchers from the Laser Processing Group of the IO-CSIC Institute of Optics in Spain report on the application of multi-purpose femtosecond laser interference in high-precision silicon nanostructures. The related research was published in Optics&Laser Technology with the title "Versatile femtosecond laser interference pattern applied to high precision nanostructured of silicon".Highlights:...

    2024-07-10
    Δείτε τη μετάφραση