English

Scientists demonstrate a new optical neural network training method that can crush electronic microprocessors

729
2023-09-27 15:24:41
See translation

The current deep neural network system (such as ChatGPT) can quickly improve energy efficiency by 100 times in training, and "future improvements will greatly increase by several orders of magnitude. Scientists from MIT and other institutions have demonstrated a new optical neural network training method that can crush state-of-the-art electronic microprocessors.

Moreover, the computational density of the demonstrated system is about two orders of magnitude higher than that of Nvidia, Google, or Graphcore systems.

Basically, this means that the most advanced models can be trained with 100 times less energy and occupy less space at the same speed.

Artificial neural networks mimic the way biological brains process information. These artificial intelligence systems aim to learn, combine, and summarize information from big datasets, reshaping the field of information processing. Current applications include images, objects, speech recognition, games, medicine, and physical chemistry.

The current artificial intelligence model has reached hundreds of billions of artificial neurons, showing exponential growth and posing challenges to current hardware capabilities.

This paper demonstrates that optical neural network (ONN) methods with high clock speed, parallelism, and low loss data transmission can overcome current limitations.

Our technology opens up a path for large-scale optoelectronic processors to accelerate machine learning tasks from data centers to decentralized edge devices, "the paper wrote.

The ONN method is expected to alleviate the bottlenecks of traditional processors, such as the number of transistors, data mobility energy consumption, and semiconductor size. ONN uses light, which can carry a large amount of information simultaneously due to its wide bandwidth and low data transmission loss. In addition, many photonic circuits can be integrated to expand the system.

In order to move light for calculation, the team led by MIT utilized many laser beams, which were described as "using mass-produced micrometer scale vertical cavity surface emitting lasers for neuron coding".

The researchers explained, "Our scheme is similar to the 'axon synapse dendrite' structure in biological neurons
They believe that the demonstrated system can be expanded through mature wafer level manufacturing processes and photon integration.

Dirk Englund, Associate Professor of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology and the head of this work, explained to SciTechDaily that the size of models such as ChatGPT is limited by the capabilities of today's supercomputers. Therefore, training larger models is not economically feasible.

He claimed, "Our new technology can make it possible to cross machine learning models, otherwise it would not be possible in the near future.

This paper titled "Deep Learning Using Coherent VCSEL Neural Networks" was published by a large team of scientists. This work has received support from the Army Research Office, NTT Research, and NTT Netcast Awards, as well as financial support from the Volkswagen Foundation. The three researchers of the team have applied for patents related to this technology.

Source: Laser Network

Related Recommendations
  • New discoveries bring progress in photon calculation

    International researchers led by Philip Walther from the University of Vienna have made significant breakthroughs in the field of quantum technology, successfully demonstrating quantum interference between multiple single photons using a new resource-saving platform. This work, published in Science Advances, represents a significant advancement in the field of quantum computing and paves the way f...

    2024-04-27
    See translation
  • New technology can efficiently heal cracks in nickel based high-temperature alloys manufactured by laser additive manufacturing

    Recently, Professor Zhu Qiang's team from the Department of Mechanical and Energy Engineering at Southern University of Science and Technology published their latest research findings in the Journal of Materials Science. The research team has proposed a new process for liquid induced healing (LIH) laser additive manufacturing of cracks. By controlling micro remelting at grain boundaries to introdu...

    2024-03-15
    See translation
  • AEROTECH releases updated AUTOMATION1 motion control platform

    Aerotech is a global leader in precision motion control and automation, and every release has made the Automation1 motion control platform even stronger and more user-friendly. Version 2.5 brings TCP socket interface (test version), Automation1 MachineApps HMI development, new auxiliary module for motor settings, and improved machine settings for galvanometer laser scanning heads.Automation1 conti...

    2023-08-14
    See translation
  • Lumentum revenue growth due to increased demand for artificial intelligence

    Photonic component manufacturer Lumentum says that its sales revenues will exceed half a billion dollars in the current quarter - and surpass $600 million this time next year, as demand from artificial intelligence (AI) data centers continues to accelerate. CEO Michael Hurlston announced a sales figure of just under $481 million for the quarter that ended June 28, up 56 per cent year-on-year and...

    08-15
    See translation
  • The University of California has developed a pioneering chip that can simultaneously carry lasers and photonic waveguides

    A team of computer and electrical engineers at UC Santa Barbara, in collaboration with several colleagues at Caltech and another colleague at Anello Photonics, has developed a first-of-its-kind chip that can carry both laser and photonic waveguides. In a paper published in the journal Nature, the team describes how they made the chip and how it worked during testing.With the advent of integrated c...

    2023-08-10
    See translation