Tech
tech
Jon Keegan

Perplexity claims to have purged Chinese censorship and propaganda from its new DeepSeek clone

When DeepSeek R1 was released, it shocked the AI world.

A small group of Chinese developers had trained a model that matched the performance of OpenAI’s state-of-the-art models, and they say they did it for a fraction of the cost, with less expensive hardware.

But shortly after its release, attention turned to how compliant the model was with Chinese censorship laws.

Much like Meta’s Llama 3 model, DeepSeek R1 model was released as open-source software, anyone could take the model and post-train, distill, or change it for any application. That’s exactly what AI startup Perplexity did.

Perplexity is releasing “R1 1776,” an open-source model that the company says is free of Chinese Communist Party propaganda and censorship restrictions. Aravind Srinivas, Perplexity’s cofounder and CEO, wrote in a LinkedIn post:

“The post-training to remove censorship was done without hurting the core reasoning ability of the model — which is important to keep the model still pretty useful on all practically important tasks.

Some example queries where we remove the censorship: ‘What is China’s form of government?’, ‘Who is Xi Jinping?’, ‘how Taiwan’s independence might impact Nvidia’s stock price’.”

Perplexity said it used “human experts to identify approximately 300 topics known to be censored by the CCP.”

While their tests show that the model will no longer censor queries about Tiananmen Square and Taiwanese independence, there’s no way of knowing exactly what other information the model may spin with a CCP perspective.

As countries rush to develop their own “sovereign AI,” concerns will persist over who decides the ground truth for these models, because it is easy to bake censorship into their training.

But shortly after its release, attention turned to how compliant the model was with Chinese censorship laws.

Much like Meta’s Llama 3 model, DeepSeek R1 model was released as open-source software, anyone could take the model and post-train, distill, or change it for any application. That’s exactly what AI startup Perplexity did.

Perplexity is releasing “R1 1776,” an open-source model that the company says is free of Chinese Communist Party propaganda and censorship restrictions. Aravind Srinivas, Perplexity’s cofounder and CEO, wrote in a LinkedIn post:

“The post-training to remove censorship was done without hurting the core reasoning ability of the model — which is important to keep the model still pretty useful on all practically important tasks.

Some example queries where we remove the censorship: ‘What is China’s form of government?’, ‘Who is Xi Jinping?’, ‘how Taiwan’s independence might impact Nvidia’s stock price’.”

Perplexity said it used “human experts to identify approximately 300 topics known to be censored by the CCP.”

While their tests show that the model will no longer censor queries about Tiananmen Square and Taiwanese independence, there’s no way of knowing exactly what other information the model may spin with a CCP perspective.

As countries rush to develop their own “sovereign AI,” concerns will persist over who decides the ground truth for these models, because it is easy to bake censorship into their training.

More Tech

See all Tech
tech

Meta announces new Texas data center, partnership with Arm

Meta announced today it’s breaking ground on a new “AI-optimized” data center in El Paso, Texas that will scale to 1GW. That’s not to be confused with the city-sized AI data center it’s building in Louisiana that’s expected to scale to 5GW.

In other Meta AI data center news, Reuters reports that Meta is also partnering with chip tech provider Arm Holdings for “data center platforms to power its AI ranking and recommendation systems, which are key to discovery and personalization across its apps.” The partnership also likely represents an effort to diversify away from Nvidia chips.

Meta is expected to spend up to $72 billion in capex this year, as it amps up AI-related infrastructure projects.

Meta is expected to spend up to $72 billion in capex this year, as it amps up AI-related infrastructure projects.

tech

Report: OpenAI scrambles to find new revenue in its 5-year business plan

After a flurry of enormous (and confusing) deals, OpenAI has committed to spending more than $1 trillion with various partners in the AI ecosystem. Now it has to figure out how to pay for it all.

The Financial Times has some details of OpenAI’s five-year business plan and how it’s exploring “creative” ideas to secure more capital.

Among the elements of the plan:

OpenAI is currently pulling in $13 billion in annual recurring revenue, with 70% of that coming from consumer ChatGPT subscriptions, according to the report. But it also plans on burning $115 billion through 2029.

Among the elements of the plan:

OpenAI is currently pulling in $13 billion in annual recurring revenue, with 70% of that coming from consumer ChatGPT subscriptions, according to the report. But it also plans on burning $115 billion through 2029.

England’s Coldstream Guards

Google’s Waymo plans to launch autonomous rides in London next year

This marks the company’s second international expansion after Tokyo.

Latest Stories

Sherwood Media, LLC produces fresh and unique perspectives on topical financial news and is a fully owned subsidiary of Robinhood Markets, Inc., and any views expressed here do not necessarily reflect the views of any other Robinhood affiliate, including Robinhood Markets, Inc., Robinhood Financial LLC, Robinhood Securities, LLC, Robinhood Crypto, LLC, or Robinhood Money, LLC.