Article archive
Browse all 75 published articles

Microsoft and OpenAI amend partnership, ending cloud exclusivity while preserving Azure-first status
Microsoft and OpenAI amended their 2019 partnership on April 27. OpenAI can now serve products through any cloud provider while Microsoft remains the

DeepSeek V4 ships with open weights and Huawei Ascend support, narrowing the gap with closed AI models
DeepSeek shipped its V4 series on April 24 — a 1.6-trillion-parameter MIT-licensed MoE flagship and a 284-billion-parameter sibling, both with 1M-toke

Google commits up to $40 billion to Anthropic, doubling down on its top AI rival
Alphabet will put $10 billion into Anthropic now at a $350 billion valuation, with another $30 billion tied to performance milestones — and Google Clo

Gemma 4 on an iPhone: here's what a 2B model can actually do
Google shipped a real LLM to your phone. We tested Gemma 4 E2B fully offline (math, code, ethics, and history) to find where on-device AI shines and w

Netflix VOID: we tested their physics-aware video eraser with our own clips
Netflix open-sourced a production VFX model that reasons about physical interactions during object removal. We ran it on an H100 to see if the claims

Gemma 4 E4B benchmark: 53 tests on a MacBook Pro M4 (24GB)
53 practical tests — vision benchmarks, 14 UI recreations, and 30 text challenges covering math, logic, essays, creative writing, and translation. All

Veo 3.1 Lite vs Fast: what you actually lose at a third of the price
Google launched Veo 3.1 Lite today at $0.05 per second. I ran 7 text-to-video prompts and one image-to-video test on both models to find out where Lit

I cloned my voice with Mistral's Voxtral TTS in under a minute, then tested the quantized local model
I recorded 43 seconds of audio on a MacBook, sent it to Mistral's Voxtral API, and got back natural-sounding voice clones in minutes. Then I ran the s

Cohere Transcribe, the most accurate open-source speech recognition model currently available
Cohere's first speech recognition model tops the Open ASR Leaderboard as well as our own independent benchmark of four providers.

Search is becoming a conversation, and Google just went first at scale
Google's expansion of Search Live to more than 200 countries this week is easy to frame as a product update. A voice feature got wider availability. B

Google's new voice model powers Search Live expansion
Google released Gemini 3.1 Flash Live on March 26, positioning it as the company's most capable real-time audio model to date.

Machine learning model predicts liver cancer risk from routine blood tests and health records
Researchers have developed a machine learning model that predicts an individual's risk of hepatocellular carcinoma using routine clinical information

Karpathy says LLM memory features "trying too hard"
In a pair of posts on X on March 25, Andrej Karpathy noted that the memory feature has an issue: models can treat stale, one-off queries as deep ongoi

Nvidia-backed Reflection AI in talks to raise $2.5 billion at $25 billion valuation
Reflection AI, a startup backed by Nvidia that is building freely available AI models, is in talks to raise $2.5 billion at a pre-money valuation of $

Anthropic has introduced auto mode for Claude Code
Anthropic has introduced auto mode for Claude Code, a new permissions setting that lets it approve or block actions on the developer's behalf.

Google's Lyria 3 Pro generates full songs up to three minutes long
Google DeepMind's Lyria 3 Pro music generation model can create complete tracks up to three minutes long, a sixfold increase over the 30-second clips

OpenAI kills Sora as compute costs force a strategic retreat
OpenAI is discontinuing Sora, its AI video generation platform, just six months after the standalone app launched.

Cursor publishes Composer 2 technical report, formally crediting Kimi K2.5 as base model
Cursor released a technical report on March 24 detailing how Composer 2 was trained, formally crediting Moonshot AI's Kimi K2.5 as the base model for

Google finds a way to shrink AI memory usage by 4.5x without losing accuracy
A new compression technique from Google Research could make AI models cheaper to run, faster to respond, and capable of handling much longer conversat

Cursor's new coding model was built on top of Kimi K2.5, a Chinese open-source base
A developer discovered that Cursor's Composer 2 model was built on top of Kimi K2.5, an open-source model from Beijing-based Moonshot AI, a detail Cur

Nvidia DLSS 5, fusing traditional rendering with gen AI
Nvidia DLSS 5 introduces a real-time neural rendering model that uses generative AI to enhance game visuals with photorealistic lighting and materials

Moonshot AI proposes new method for how LLM layers share information, claims 1.25x compute advantage
Moonshot AI's Kimi Team has published a technical report proposing a change to a fundamental component of modern AI models called residual connections

How ChatGPT and AlphaFold helped an Australian engineer design a personalized cancer vaccine for his dog
A Sydney tech entrepreneur with no background in biology used ChatGPT, Google DeepMind's AlphaFold, and his own machine learning expertise to design a

Musk sets March 21 for Tesla's Terafab chip factory
Elon Musk set a March 21 date for what may well be one of the most ambitious semiconductor ventures ever attempted by a company outside the establishe

Anthropic commits $100 million to new Claude Partner Network for enterprise adoption
Anthropic announced on March 12 that it is investing $100 million in 2026 to launch the Claude Partner Network, a program designed to help consulting

Claude's 1M context window is now generally available with no long-context pricing premium
Anthropic has moved the 1 million token context window for Claude Opus 4.6 and Sonnet 4.6 from beta to general availability, and eliminated the long-c

Google DeepMind celebrates AlphaGo's 10th anniversary as Pentagon AI crisis consumes the industry
Google DeepMind CEO Demis Hassabis published a warm retrospective about AlphaGo's 10th anniversary in what may very well be an act of narrative positi

Cursor Automations turns AI agents into background processes
Cursor's new Automations let AI agents run in the background, triggered by PRs, PagerDuty alerts, Slack messages, and timers, with no prompt required.

GPT-5.4 arrives with native computer use, experimental 1M context, and a new tool search capability
OpenAI released GPT-5.4 today, billing it as its most capable model to date for professional work, coding, and automated tasks.

NotebookLM launches Cinematic Video Overviews, powered by a three-model AI pipeline
Google's AI research tool upgrades from narrated slides to source-grounded cinematic video using a three-model pipeline of Gemini 3, Nano Banana Pro,

Google expands Canvas in AI Mode to all US users in Search
Google has made its Canvas feature available to all users in the United States who access AI Mode in Search, the company announced on March 4, 2026.

Google releases Gemini 3.1 Flash-Lite
Google has released a preview of Gemini 3.1 Flash-Lite, positioning it as the fastest and most cost-efficient model in the Gemini 3 series.

NotebookLM adds customizable infographic styles
Google's research AI ships ten style presets, a sign the competition for AI tool loyalty is moving beyond accuracy.

Alibaba launches Qwen 3.5 small model series with sub-1B edge options
Alibaba's Qwen team has released four new small language models — Qwen3.5-0.8B, 2B, 4B, and 9B — alongside their base model counterparts, extending th

The pressure test: What February 2026 revealed about AI's safety promises
In one week, Anthropic dismantled its hardest safety promise and absorbed a federal blacklisting rather than cross its ethical red lines. Both things

Claude hits #1 on the US App Store after Pentagon dispute
After the Pentagon banned Anthropic for refusing to allow Claude to be used for mass surveillance and autonomous weapons, Claude climbed to #1 on the

OpenAI shares Pentagon contract language
OpenAI published excerpts of its agreement with the Department of Defense on Saturday morning. The language is more detailed than expected, yet more a

OpenAI strikes Pentagon deal hours after Anthropic blacklisted — with seemingly the same terms Anthropic was punished for requesting
OpenAI CEO Sam Altman announced a deal to deploy AI on the Pentagon's classified network just hours after the administration blacklisted Anthropic for

Amazon invests $50 billion in OpenAI as part of $110 billion funding round
OpenAI announced $110 billion in new investment at a $730 billion pre-money valuation, anchored by $50 billion from Amazon, $30 billion from SoftBank,

The Pentagon wants Claude without guardrails. Anthropic said no. Here's where it stands
The U.S. Department of War gave Anthropic a Friday deadline to remove AI safety guardrails from Claude or face contract cancellation, a supply chain r

Claude Code now remembers what it learns between sessions
Anthropic has rolled out an automatic memory feature for Claude Code that writes persistent notes about project patterns, debugging discoveries, and u

Meta signs AMD AI chip deal with 160 million-share warrant tied to 6GW deployment
Meta and AMD announced a five-year agreement to deploy up to six gigawatts of AMD Instinct GPUs, alongside a performance-based warrant that could let

Google launches Nano Banana 2, bringing pro-level image generation to its Flash model
Google DeepMind releases Nano Banana 2 (Gemini 3.1 Flash Image), delivering Pro-tier image generation quality at Flash speeds. The model rolls out acr

Anthropic launches Remote Control for Claude Code, enabling mobile access
Claude Code's new 'Remote Control' feature, launched February 24, 2026, allows devs to continue local terminal sessions from their phone, tablet, or a

Claude Code Security, an AI-powered vulnerability scanner
Anthropic announced Claude Code Security on February 20, 2026, a tool that scans codebases for security vulnerabilities and suggests patches for human

Gemini 3.1 Pro claims top-tier reasoning gains
Google has released Gemini 3.1 Pro, a new version of its Pro model line that it says upgrades the core reasoning capabilities behind recent Gemini 3 a

Google rolls out Lyria 3 music generation in the Gemini app
Google has begun rolling out Lyria 3 in the Gemini app, adding a built-in tool for generating short 30-second music tracks.

Meta signs multiyear deal for NVIDIA GPUs
Meta and NVIDIA announced a multiyear strategic partnership deploying millions of Blackwell and Rubin GPUs, standalone Grace CPUs, and Spectrum-X netw

Anthropic releases Claude Sonnet 4.6, bringing near-flagship performance to its mid-tier model
Anthropic released Claude Sonnet 4.6, upgrading its mid-tier AI model with improved coding, computer use, and long-context reasoning capabilities.

OpenAI adds Lockdown Mode and “Elevated Risk” labels
OpenAI introduced Lockdown Mode in ChatGPT and added “Elevated Risk” labels to reduce prompt injection data exfiltration risk.

Manus brings its AI agent into Telegram chats
Manus has launched “Manus Agents,” a way to use its AI agent directly inside messaging apps, starting with Telegram.

Vatican introduces AI-assisted live translations for Mass at St. Peter’s Basilica
The Vatican is rolling out an AI-assisted live translation service intended to help attendees follow principal celebrations at St. Peter’s Basilica in

OpenClaw creator Peter Steinberger joins OpenAI as OpenClaw shifts to a foundation

Speed, the rising product feature in AI
Developments in early 2026 show a change in how AI companies compete. Response speed appears to be moving from a background engineering concern to a v

Manus adds Project Skills to its AI agent platform
Manus has introduced a feature called Project Skills that lets teams curate and lock sets of reusable AI workflows at the project level.

Google Docs adds Gemini-powered audio summaries
Google is rolling out a new Gemini feature in Google Docs that lets you listen to a short audio summary of a document.

Airbnb says AI now handles nearly 30% of English-language support tickets in North America

OpenAI unveils ultra-fast “Codex-Spark” model for real-time coding
OpenAI has launched a research preview of GPT-5.3-Codex-Spark, a new version of its Codex coding model designed to respond quickly enough for real-tim

DeepMind reports research-level results from Gemini Deep Think
Gemini Deep Think is being used to assist with research-level problems in mathematics, computer science, physics, and economics.

OpenAI upgrades ChatGPT deep research with GPT-5.2 and new controls

Alibaba launches Qwen-Image-2.0, a unified image model for 2K generation and editing
Alibaba’s Qwen team has launched Qwen-Image-2.0, a new image generation model positioned as a single system for both text-to-image generation and imag

Study suggests LLM leaderboards may be more fragile than they appear

OpenAI begins testing ads in ChatGPT, turning "Go" into the budget tier while Plus stays ad-free
OpenAI said on Feb. 9, 2026 that it is beginning a U.S. test of ads in ChatGPT, positioning ads as a way to support broader access while keeping answe

"Fast mode" for Claude Opus 4.6, 2.5x speed at 6x the price
A new "fast mode" for Claude Opus delivers up to 2.5 times higher output token generation speed at a significant premium: six times the standard.

The Cowork selloff: how an Anthropic plugin update rattled software stocks
A product update billed as a modest expansion triggered the worst week for enterprise software in years, then markets snapped back. Here's what actual

4-minute picture task helps AI screen for addiction risk
A machine-learning model uses picture ratings to predict substance use disorder behaviors.

Goldman Sachs is building AI agents with Anthropic for accounting and compliance

OpenAI drops GPT-5.3-Codex right after Anthropic's Opus 4.6

Anthropic releases Claude Opus 4.6 with 1M context window

International AI Safety Report warns oversight is lagging
A major international report released this week finds that artificial intelligence systems are advancing faster than the methods used to test, monitor

Roblox launches AI 4D generation beta for game objects
Roblox has launched the beta release of 4D generation, a new AI system that allows creators to generate functional, interactive 3D objects from text p

Without voice, the ChatGPT macOS app falls short
On January 15, 2026, OpenAI removed the Voice feature from the ChatGPT macOS desktop app. While voice remains available on the web, mobile, and Window

OpenAI releases standalone Codex app for macOS
OpenAI released a dedicated macOS application for Codex, its AI-powered coding assistant. The app is designed to serve as a "command center" that make

OpenAI shares how its internal data agent works
A look at OpenAI’s in-house data agent, including how it uses context, access controls, and evals to support internal analytics.

Inside Microsoft’s Maia 200: Why This Chip Matters More Than You Think
Microsoft’s Maia 200 is not trying to be the fastest chip on paper. It is trying to be the cheapest place to run AI at scale. That distinction matters