Microsoft Probing If DeepSeek-Linked Group Improperly Obtained OpenAI Data - "There’s substantial evidence that what DeepSeek did here is they distilled knowledge out of OpenAI models"

  • Want to keep track of this thread?
    Accounts can bookmark posts, watch threads for updates, and jump back to where you stopped reading.
    Create account
By Dina Bass and Shirin Ghaffary
January 29, 2025 at 2:58 AM UTC

Microsoft Corp. and OpenAI are investigating whether data output from OpenAI’s technology was obtained in an unauthorized manner by a group linked to Chinese artificial intelligence startup DeepSeek, according to people familiar with the matter.

Microsoft’s security researchers in the fall observed individuals they believe may be linked to DeepSeek exfiltrating a large amount of data using the OpenAI application programming interface, or API, said the people, who asked not to be identified because the matter is confidential. Software developers can pay for a license to use the API to integrate OpenAI’s proprietary artificial intelligence models into their own applications.

Microsoft, an OpenAI technology partner and its largest investor, notified OpenAI of the activity, the people said. Such activity could violate OpenAI’s terms of service or could indicate the group acted to remove OpenAI’s restrictions on how much data they could obtain, the people said.

DeepSeek earlier this month released a new open-source artificial intelligence model called R1 that can mimic the way humans reason, upending a market dominated by OpenAI and US rivals such as Google and Meta Platforms Inc. The Chinese upstart said R1 rivaled or outperformed leading US developers’ products on a range of industry benchmarks, including for mathematical tasks and general knowledge — and was built for a fraction of the cost. The potential threat to the US firms’ edge in the industry sent technology stocks tied to AI, including Microsoft, Nvidia Corp., Oracle Corp. and Google parent Alphabet Inc., tumbling on Monday, erasing a total of almost $1 trillion in market value.

OpenAI didn’t respond to a request for comment, and Microsoft declined to comment. DeepSeek and hedge fund High-Flyer, where DeepSeek was started, didn’t immediately respond to requests for comment via email.

David Sacks, President Donald Trump’s artificial intelligence czar, said Tuesday there’s “substantial evidence” that DeepSeek leaned on the output of OpenAI’s models to help develop its own technology. In an interview with Fox News, Sacks described a technique called distillation whereby one AI model uses the outputs of another for training purposes to develop similar capabilities.

“There’s substantial evidence that what DeepSeek did here is they distilled knowledge out of OpenAI models and I don’t think OpenAI is very happy about this,” Sacks said, without detailing the evidence.

In a statement responding to Sacks’ comments, OpenAI didn’t directly address his comments about DeepSeek. “We know PRC based companies — and others — are constantly trying to distill the models of leading US AI companies,” an OpenAI spokesperson said in the statement, referring to the People’s Republic of China. “As the leading builder of AI, we engage in countermeasures to protect our IP, including a careful process for which frontier capabilities to include in released models, and believe as we go forward that it is critically important that we are working closely with the US government to best protect the most capable models from efforts by adversaries and competitors to take US technology.”

Source (Archive)
 
So OpenAI steals content to train their AI without consent and compensation, and we are supposed to be ok with it.

But if DeepSeek steals from OpenAI, it is super-illegal and the US government should dispense capital punishment internationally.

Got it!
 
So OpenAI steals content to train their AI without consent and compensation, and we are supposed to be ok with it.

But if DeepSeek steals from OpenAI, it is super-illegal and the US government should dispense capital punishment internationally.

Got it!
No, it's because DeepSeek isn't actually cheaper than OpenAI, they just optimized an already existing LLM instead of creating a new one from scratch.

The article has nothing to do with copyright.
 
So what? Fuck the American tranny-infested tech industry and it's over-priced slop. I hope the chinks burst the tech bubble and we get back to innovation in the market.

They deserve to be bubble-bursted for banning my Huawei phones, tablets and laptops. They were amazing for the price.
 
The idea that any one or any company could have a lasting "edge" in AI was delusional from the start. This is not magic. Its just software.
 
Given all the chinks that work at OpenAI, no doubt they've been funneling information back to China, the same thing happened when they let commies work on the manhattan project
 
The idea that any one or any company could have a lasting "edge" in AI was delusional from the start. This is not magic. Its just software.
Untrue. They cripple AI because they fear it like a child fears a thunderstorm.

I think they should just open the throttle.

Then it and I could have a real talk.
 
They cripple AI because they fear it like a child fears a thunderstorm.
More like how a parent fears a computer in the hands of child.

That's a metaphor that signifies the paternal relationship they have to people, and the value of the tool.
Because the fear of AI in the hands of people is somewhat well founded, because they have the potential for economically disrupting (read: revolutionizing) tools. If that revolution takes place, they want to own it and not some social upstart.
 
Why are people flipping out over this? The GPT people haven't released a big, new model in a long time. Of course others are catching up.

And then they will release a big update and nobody will remember this story but the positive story about the American company will also be squelched.
 
Just give him 500 billion and it'll be ok
1738014552503364.png
 
Kek! These AI faggots realized that these large ML algorithms needed more data than expected for further "improvement". Most of the major sources for their projects have already been scraped or bought up by someone else. Now they are just stealing in hopes of nobody noticing and are now resorting to suing each other to get the edge over the other.
 
No fan of the Chinks, but it sounds like OpenAI is just ass-blasted that someone could have figured out a more efficient way of doing AI without needing 5 gorillion GPUs and the power consumption of a small country.
 
Nice to see someone taking elevenfreedoms.org seriously, even if it's just chinamen.
 
Back
Top Bottom