Amazon CEO Andy Jassy at AWS re:Invent 2024 (Noah Berger/Getty Images)

super mega ultra

Amazon’s AI plans: custom chips, an Anthropic “ultracluster,” and its own foundation model

While Amazon’s new models appear to be competitive in terms of features and performance, that isn’t the main thing that the company is touting — it’s the cost.

Jon Keegan

12/4/24 2:11PM

This week at Amazon’s AWS re:Invent conference in Las Vegas, the company fleshed out its plans to both serve and compete with the larger AI industry.

AWS is largely AI agnostic. Customers can use pretty much any of the major AI models on the cloud-computing platform, running on servers that use chips from Nvidia, AMD, Qualcomm, and others.

But Amazon has also been building and selling computing powered by its purpose-built AI chips, including its latest Trainium2 chip, which Amazon is now making widely available on AWS’s EC2 service. Amazon says these new Trainium2 instances are built for training and deploying jumbo-sized large language models with better price performance than its current offerings.

Amazon also deepened its partnership with AI startup Anthropic, announcing that it’s building an “ultracluster” of “hundreds of thousands” of Trainium2 servers to train Anthropic’s next-generation LLM. Amazon recently doubled its investment in Anthropic, bringing the total to $8 billion.

Probably the most significant announcement was Amazon’s late entry to the foundational AI-model club. Named “Amazon Nova,” the new LLM comes in four flavors: a text-only Micro and three multimodal models, Lite, Pro, and Premier. Amazon touted benchmark scores for the Nova models, which place it in the same class as OpenAI’s GPT-4o and Meta’s Llama 3. Amazon’s multimodal Nova models can ingest and generate images and videos, like many of the other top models out there today.

While Amazon’s new models appear to be competitive in terms of features and performance, that isn’t the main thing that the company is touting — it’s the models’ low, low cost.

Running Amazon models on Amazon servers, powered by Amazon chips, yields significant cost savings and low latency. Amazon says its Nova models are “at least 75% less expensive” than the best-performing models available on AWS today.

Rani Molla15h

Microsoft is reportedly building a super app to tame product sprawl — and finally crack mobile

Super apps are very 2010s, but they might be the future for Microsoft. The enterprise giant is working on combining its sprawling and often confusing product suite into a single super app expected by late summer, Fortune reports.

By unifying the tools, Microsoft is hoping that the massive popularity of some of its offerings — particularly GitHub Copilot — will rub off on its other, slower-growing products.

The tool will merge its coding assistant GitHub Copilot, its chat function Copilot, its Copilot Cowork tool, and a new agentic workflow called Autopilot. The move, known internally as “Delivering one Copilot,” will have the dual purpose of simplifying Microsoft’s fragmented desktop AI offerings and finally helping the office software giant gain a foothold on mobile, where competing tools have dominated.

Microsoft is taking a page from frenemy OpenAI’s playbook. In March, OpenAI announced plans for its own desktop super app to combine ChatGPT, Codex, and its Atlas browser into one central workstation.

Exclusive: Microsoft is building a super app that combines coding, chat, and other Copilot AI tools | Fortune

Rani Molla21h

Forty-two is the answer to life, the universe, and everything in Douglas Adams’ classic “The Hitchhiker’s Guide to the Galaxy.” It’s also the number of unsupervised Robotaxis Tesla has on the road in Texas, the only state where it’s operating autonomous service, according to records from a newly required government database in the state.

That’s much lower than CEO Elon Musk had hoped, as the company struggles to ready its camera-only autonomous vehicles for commercial scale. In 2025, Musk said that the service would be available to “half the population of the US by the end of the year.”

Even smaller competition has more: Avride has 317 and Nuro has 47. Meanwhile, Tesla’s chief rival, Alphabet subsidiary Waymo, has 577 in operation in the state. Nationwide, Waymo’s fleet currently numbers more than 3,000.

Unfortunately for Tesla, figuring out how to actually scale its robotaxi fleet remains the ultimate question.

Anthropic raises $65 billion at a $965 billion valuation, releases a more “honest” Claude Opus 4.8

Anthropic’s monster $965 billion valuation puts it firmly ahead of OpenAI’s $850 billion valuation as the rivals head toward expected IPOs later this year.

Jon Keegan5/28/26

Report: Microsoft tries to get back in the AI coding game with new model

Microsoft wants to fight its way back into the AI coding field by releasing a new model next week at its annual Microsoft Build developer conference, The Information reports.

The company is expected to announce a new family of models as Microsoft AI CEO Mustafa Suleyman seeks to shore up the company’s own AI offerings and gradually wean it off OpenAI’s technology over the remainder of their $13 billion partnership.

Microsoft was initially well positioned to meet software developers with AI-enhanced tools. It owns GitHub, the most popular platform for hosting and sharing code, and GitHub’s Copilot AI-powered coding tool was released months before OpenAI’s ChatGPT debuted in 2022.

But it fumbled one of the biggest first-mover advantages in history as Anthropic’s Claude Code, OpenAI’s Codex, and Cursor rolled out coding tools that developers loved.

Microsoft to Release New Coding Model Next Week in Comeback Attempt

But it fumbled one of the biggest first-mover advantages in history as Anthropic’s Claude Code, OpenAI’s Codex, and Cursor rolled out coding tools that developers loved.

Waymo to launch free robotaxi rides in its new Ojai vans

The new vehicles are less expensive — which is important for the service to really scale.

Rani Molla5/28/26