简体中文

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

536
2023-09-27 15:24:41
查看翻译

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

相关推荐
  • Gas reduction technology of fiber laser helps to improve the cutting quality of low-carbon steel

    The Mitsubishi GX-F Advanced series of artificial intelligence enabled fiber lasers now use patented gas and burr reduction technology to help improve cutting quality while reducing gas consumption when cutting low-carbon steel.Mitsubishi Laser's proprietary Agr Mix nozzle technology does not require an external mixing tank or high-pressure oxygen. The combination of low-pressure air and nitrogen ...

    2024-02-14
    查看翻译
  • Professor Wu Dong's team at the University of Science and Technology of China created a "dancing microrobot" using femtosecond laser composite materials.

    It was learned from the University of Science and Technology of China that the team of Professor Wu Dong of the Micro and Nano Engineering Laboratory of the school proposed a femtosecond laser two-in-one multi-material processing strategy, manufactured a micromechanical joint composed of temperature-sensitive hydrogel and metal nanoparticles, and then developed a multi-joint humanoid micromachine ...

    2023-08-11
    查看翻译
  • Ecken develops a new type of iron silicon powder for 3D printing of motors

    Through the SOMA project funded by the European Union, organic silicon material expert Aiken has collaborated with research partners and clients to develop a new specialized iron silicon powder that can more efficiently 3D print motor components.Yesterday's electric motor was usually made by cutting and shaping parts from a metal plate. 3D printing can fundamentally improve efficiency and...

    2024-01-20
    查看翻译
  • Tower Semiconductor is preparing to add laser integrated PIC for Scintil

    Grenoble stated that in the context of growing demand driven by artificial intelligence and 5G, "key" milestones have strengthened its supply chain.Scantil Photonics, a subsidiary of CEA Leti that focuses on silicon photonics, has stated that its integrated laser design is now being produced by Tower Semiconductor, a wafer foundry partner.This method describes this development as a "crucial step f...

    2024-02-29
    查看翻译
  • University of Science and Technology of China realizes quantum elliptical polarization imaging

    Recently, the team led by Academician Guo Guangcan from the University of Science and Technology of China has made significant progress in the research of quantum elliptical polarization imaging. The research group of Professor Shi Baosen and Associate Professor Zhou Zhiyuan combined high-quality polarization entangled light sources with classical polarization imaging technology to observe the bir...

    04-14
    查看翻译