Science Light-powered computer chip can train AI much faster than components powered by electricity - In designing their chip, the scientists set out to build a light-based platform that could perform calculations known as vector-matrix multiplications. This is one of the key mathematical operations used to train neural networks

  • Want to keep track of this thread?
    Accounts can bookmark posts, watch threads for updates, and jump back to where you stopped reading.
    Create account

Light-powered computer chip can train AI much faster than components powered by electricity​

New chip design uses photons rather than electrons to perform calculations, and scientists hope to integrate the technology into future graphics cards to train AI.​

By Keumars Afifi-Sabet published [March 25th, 2024]

Scientists have designed a new microchip that's powered by light rather than electricity. The tech has the potential to train future artificial intelligence (AI) models much faster and more efficiently than today's best components, researchers claim.

By using photons to perform complex calculations, rather than electrons, the chip could overcome the limitations of classic silicon chip architecture and vastly accelerate the processing speed of computers, while also reducing their energy consumption, scientists said in a new study, published Feb. 16 in the journal Nature Photonics.

Silicon chips have transistors — or tiny electrical switches — that turn on or off when voltage is applied. Generally speaking, the more transistors a chip has, the more computing power it has — and the more power it requires to operate.

Throughout computing history, chips have adhered to Moore's Law, which states the number of transistors will double every two years without a rise in production costs or energy consumption. But there are physical limitations to silicon chips, including the maximum speed transistors can operate at, the heat they generate from resistance, and the smallest size chip scientists can make.

It means stacking billions of transistors onto increasingly small silicon-electronic chips might not be feasible as the demand for power increases in the future — particularly for power-hungry AI systems.

Using photons, however, has many advantages over electrons. Firstly, they move faster than electrons — which cannot reach the speed of light. While electrons can move at close to these speeds, such systems would need an extraordinary — and unfeasible — amount of energy. Using light would therefore be far less energy-intensive. Photons are also massless and do not emit heat in the same way that electrons carrying an electrical charge do.

In designing their chip, the scientists set out to build a light-based platform that could perform calculations known as vector-matrix multiplications. This is one of the key mathematical operations used to train neural networks — machine-learning models arranged to mimic the architecture of the human brain. AI tools like ChatGPT and Google's Gemini are trained in this way.

Instead of using a silicon wafer of uniform height for the semiconductor, as conventional silicon chips do, the scientists made the silicon thinner — but only in specific regions.

"Those variations in height — without the addition of any other materials — provide a means of controlling the propagation of light through the chip, since the variations in height can be distributed to cause light to scatter in specific patterns, allowing the chip to perform mathematical calculations at the speed of light," co-lead author Nader Engheta, professor of physics at the University of Pennsylvania, said in a statement.

The researchers claim their design can fit into pre-existing production methods without any need to adapt it. This is because the methods they used to build their photonic chip were the same as those used to make conventional chips.

They added the design schematics can be adapted for use in augmenting graphics processing units (GPUs), for which demand has skyrocketed in recent years. That's because these components are central to training large language models (LLMs) like Google's Gemini or OpenAI's ChatGPT.

"They can adopt the Silicon Photonics platform as an add-on," co-author Firooz Aflatouni, professor of electrical engineering at the University of Pennsylvania, said in the statement. "And then you could speed up [AI] training and classification."
 
Honestly, this reads like first page of a bad grant proposal. Extremely vague description of the research and a specious connection to whatever is trending - in this case AI.

Also "vector-matrix multiplications" sounds unnatural when the common notation is for operators to act from the left instead of from the right (e.g. we describe a linear system by writing Ax=b instead of xA=b).
 
Converting all transistor-based computers to optics-based would be a huge pain in the ass, and it would be smart to get ahead of the curve before the big switch
 
Honestly, this reads like first page of a bad grant proposal. Extremely vague description of the research and a specious connection to whatever is trending - in this case AI.

Yeah, it's bullshit. Electrons still move incredibly fast (upwards of 2/3 the speed of light) in wire, so our current CPUs aren't going to see much benefit from increasing the rate that electrons move. Consider that even the fastest clock speeds that top of the line gaming CPUs reach would require that the electrons travel more than 2.5 cm during the time span before there would be any need for them to travel faster. Although that seems tiny, it's still farther than the long side of of the die of 14900k. Nothing in the chip needs to move that far in that time window, not by a long shot. The chips doing machine learning aren't running anywhere near 6 GHz, so the distance constraints are even more relaxed.

Furthermore, photons aren't going to travel as fast when they don't have a straight path, which they aren't going to get in any kind of design they implement. There's also the matter of how small they can make whatever the hell they're actually proposing as a substitute for transistors. The odds that they can compete for density with the bleeding edge process nodes from TSMC or Intel is laughable at best. Even if they could design something that operates at 3/4 the speed of light or above, if you can only fit 1% of the logic gates as conventional photolithography, there's not really any point.

The only area where there could be some conceivable advantage is power usage. Current fabrication technologies are a fucking marvel of engineering and it's amazing that the teams working on them have continually been able to break past all of the various barriers that stood in their way. However, the biggest problems have been that power decreases haven't scaled for a while now and the designs have had to become even more exotic to enable density increases without those chips turning into tiny furnaces. For high performance compute workloads and the massive data centers that run them, power consumption and cooling costs to shed all of the waste heat generated by the machines can be a bigger cost factor over time than some of the chips themselves.

But this is just some grade-A bullshit that has no actual promise. These are opportunistic assholes hoping that a stupid congressman will throw a few million dollars their way. It's not like the latest spending bill didn't include far more money earmarked for even more dubious projects.
 
Yeah, it's bullshit. Electrons still move incredibly fast (upwards of 2/3 the speed of light) in wire, so our current CPUs aren't going to see much benefit from increasing the rate that electrons move. Consider that even the fastest clock speeds that top of the line gaming CPUs reach would require that the electrons travel more than 2.5 cm during the time span before there would be any need for them to travel faster. Although that seems tiny, it's still farther than the long side of of the die of 14900k. Nothing in the chip needs to move that far in that time window, not by a long shot. The chips doing machine learning aren't running anywhere near 6 GHz, so the distance constraints are even more relaxed.

Photonic integrated circuits are very real, and Synopsys sells design tools for engineers who make them.


Lightelligence turned heads a couple years ago with a photonic chip that was about 100x faster than a conventional GPU:


Eventually, we're going to figure out how to manufacture these things at scale, and silicon chips will be as relevant as coal-fired steam trains are today.

By the way, current travels via the movement of electromagnetic fields and moves much faster than the electrons move in a wire. Clock speed has nothing to do with moving electrons from one end of the chip to the other, but with how fast state can be changed without melting the chip or destroying circuit integrity, whichever comes first.
 
I've heard about this idea before.. never really got a good explanation for why though.. The AI angle is new though. And once again, less than a clear explanation for why as well.
 
I've heard about this idea before.. never really got a good explanation for why though.. The AI angle is new though. And once again, less than a clear explanation for why as well.

The main advantage is it takes far less energy to move a photon than an electron. The main disadvantage is it's extremely expensive to fabricate photonic devices. You need microscopic crystal structures for them to work. It's the same principle behind the advantages of fiber optic over copper wire.

It's really big area of research right now. IBM's been working on photonic chips for years now. Intel's already selling photonic transceivers, and showed off a hybrid electronic/photonics chip at Hot Chips last year. The fact is we're running into the limits of what electricity can do, so whoever figures out how to make light work economically is going to make enormous amounts of money.
 
Back
Top Bottom